NPTEL Video Course : NOC:Stochastic Approximation: Theory and Applications
Lecture 43 - Best Policy Algorithm for Q-Value Functions: A Stochastic Approximation Formulation
Home
Previous
Next
Thumbnails