video thumbnail
Pause
Mute
Subtitles not available
Playback speed
0.25
0.5
0.75
1
1.25
1.5
1.75
2
Full screen

Reinforcement Learning

Published on Feb 4, 202529295 Views

MDPs/VI,\\ Q learning (w/ proof),\\ TD(lambda),\\ Function approximation, \\ options, \\ PSRs

Related categories