How to tune hypeparametes in A2C-ppo?

This page summarizes the projects mentioned and recommended in the original post on /r/reinforcementlearning

Scout Monitoring - Free Django app performance insights with Scout Monitoring
Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
www.scoutapm.com
featured
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
  • baselines

    OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

  • Im currently working with A2C. The model was able to learn open ai pong, i ran this as a sanity check that i havent made any bugs. Now im trying to make the model play breakout, but still after 10m steps the model has not made any significant progress. Im using baseline hyperparameters which can be found here https://github.com/openai/baselines/blob/master/baselines/a2c/a2c.py, except my buffersize have been from 512 to 4096. Ive noticed that entropy decreases extremely slowly given the buffersize from the interval which i just gave. So my questions are how to make entropy decrease and how to increase rewards per buffer? Ive tried to decrease the entropy coefficient to almost zero, but still it acts very weirdly.

  • ppo-implementation-details

    The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

  • You might find our PPO blog post helpful - https://iclr-blog-track.github.io/2022/03/25/ppo-implementation-details/

  • Scout Monitoring

    Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

    Scout Monitoring logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Membuat Project Python yang mudah untuk dimaintain

    1 project | dev.to | 1 Jun 2024
  • Make Maintainable Python Project

    1 project | dev.to | 1 Jun 2024
  • Download Paul Graham essays in ePub format

    1 project | news.ycombinator.com | 1 Jun 2024
  • Building an Open Source AI Quality Assurance for Web Applications

    1 project | news.ycombinator.com | 1 Jun 2024
  • CSS Written in Pure Go

    2 projects | news.ycombinator.com | 1 Jun 2024