![Safe Policy Search for Lifelong Reinforcement Learning with Sublinear Regret thumbnail](https://apiminio.videolectures.net/vln/lectures/23349/1/en/thumbnail.jpg?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=masoud%2F20250117%2Fus-east-1%2Fs3%2Faws4_request&X-Amz-Date=20250117T153342Z&X-Amz-Expires=604800&X-Amz-SignedHeaders=host&X-Amz-Signature=5ec78a211fa5fa4acc7c7ef364e8d091e066bed85aa4f9c8c0db1d600e766437)
Safe Policy Search for Lifelong Reinforcement Learning with Sublinear Regret
Published on Sep 27, 20151648 Views
Lifelong reinforcement learning provides a promising framework for developing versatile agents that can accumulate knowledge over a lifetime of experience and rapidly learn new tasks by building upon