Linked e-resources
Details
Table of Contents
Prediction Error and Actor-Critic Hypotheses in the Brain
Reviewing on-policy / o-policy critic learning in the context of Temporal Dierences and Residual Learning
Reward Function Design in Reinforcement Learning
Exploration Methods In Sparse Reward Environments
A Survey on Constraining Policy Updates Using the KL Divergence
Fisher Information Approximations in Policy Gradient Methods
Benchmarking the Natural gradient in Policy Gradient Methods and Evolution Strategies
Information-Loss-Bounded Policy Optimization
Persistent Homology for Dimensionality Reduction
Model-free Deep Reinforcement Learning Algorithms and Applications
Actor vs Critic
Bring Color to Deep Q-Networks
Distributed Methods for Reinforcement Learning
Model-Based Reinforcement Learning
Challenges of Model Predictive Control in a Black Box Environment
Control as Inference?
Reviewing on-policy / o-policy critic learning in the context of Temporal Dierences and Residual Learning
Reward Function Design in Reinforcement Learning
Exploration Methods In Sparse Reward Environments
A Survey on Constraining Policy Updates Using the KL Divergence
Fisher Information Approximations in Policy Gradient Methods
Benchmarking the Natural gradient in Policy Gradient Methods and Evolution Strategies
Information-Loss-Bounded Policy Optimization
Persistent Homology for Dimensionality Reduction
Model-free Deep Reinforcement Learning Algorithms and Applications
Actor vs Critic
Bring Color to Deep Q-Networks
Distributed Methods for Reinforcement Learning
Model-Based Reinforcement Learning
Challenges of Model Predictive Control in a Black Box Environment
Control as Inference?