Logo image
Sign in
Exploration-exploitation trade-off for continuous-time episodic reinforcement learning with linear-convex models
Preprint

Exploration-exploitation trade-off for continuous-time episodic reinforcement learning with linear-convex models

Lukasz Szpruch, Tanut Treetanthiploet and Yufei Zhang
19/12/2021

Abstract

Computer Science - Learning Mathematics - Optimization and Control Mathematics - Probability Statistics - Machine Learning

Metrics

1 Record Views

Details

Logo image