General Motors (GM) Interview Question

Derive policy gradient algorithm on the board