JAX Implementations of Actor-Critic Algorithms

Scout Monitoring - Free Django app performance insights with Scout Monitoring

Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

www.scoutapm.com

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

stable-baselines3

46 8,115 8.2 Python

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

I even have the PyTorch implementation faster in some cases (I created a branch with pytorch optimization that gives a 5% speed improvement https://github.com/DLR-RM/stable-baselines3/tree/exp/torch-optim ).

Ray

43 31,566 10.0 Python

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Folks like me using RLLib have observed this behavior: https://github.com/ray-project/ray/issues/12494

Scout Monitoring

www.scoutapm.com featured

Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
stable-baselines

10 4,068 0.0 Python

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

- tf2 speed: https://github.com/hill-a/stable-baselines/issues/576#issuecomment-573331715

rl-baselines3-zoo

11 1,824 6.2 Python

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

for pytorch, use the rl zoo (https://github.com/DLR-RM/rl-baselines3-zoo) and sb3 ;) https://github.com/DLR-RM/stable-baselines3

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

[P] PettingZoo 1.24.0 has been released (including Stable-Baselines3 tutorials)

4 projects | /r/reinforcementlearning | 24 Aug 2023
[Question] Why there is so few algorithms implemented in SB3?

1 project | /r/reinforcementlearning | 22 Jul 2023
Stable baselines! Where my people at?

1 project | /r/reinforcementlearning | 5 Jul 2023
SB3 - NotImplementedError: Box([-1. -1. -8.], [1. 1. 8.], (3,), <class 'numpy.float32'>) observation space is not supported

2 projects | /r/reinforcementlearning | 19 Jun 2023
Exporting an A2C model created with stable-baselines3 to PyTorch

1 project | /r/reinforcementlearning | 5 Jun 2023

JAX Implementations of Actor-Critic Algorithms

This page summarizes the projects mentioned and recommended in the original post on /r/reinforcementlearning
reinforcement-learning Machine Learning Gym openai Python
Post date: 10 Jan 2021

stable-baselines3

Ray

Scout Monitoring

stable-baselines

rl-baselines3-zoo

Related posts

[P] PettingZoo 1.24.0 has been released (including Stable-Baselines3 tutorials)

[Question] Why there is so few algorithms implemented in SB3?

Stable baselines! Where my people at?

SB3 - NotImplementedError: Box([-1. -1. -8.], [1. 1. 8.], (3,), <class 'numpy.float32'>) observation space is not supported

Exporting an A2C model created with stable-baselines3 to PyTorch

JAX Implementations of Actor-Critic Algorithms

This page summarizes the projects mentioned and recommended in the original post on /r/reinforcementlearning reinforcement-learning Machine Learning Gym openai Python Post date: 10 Jan 2021

stable-baselines3

Ray

Scout Monitoring

stable-baselines

rl-baselines3-zoo

Related posts

[P] PettingZoo 1.24.0 has been released (including Stable-Baselines3 tutorials)

[Question] Why there is so few algorithms implemented in SB3?

Stable baselines! Where my people at?

SB3 - NotImplementedError: Box([-1. -1. -8.], [1. 1. 8.], (3,), &lt;class 'numpy.float32'&gt;) observation space is not supported

Exporting an A2C model created with stable-baselines3 to PyTorch

This page summarizes the projects mentioned and recommended in the original post on /r/reinforcementlearning
reinforcement-learning Machine Learning Gym openai Python
Post date: 10 Jan 2021

SB3 - NotImplementedError: Box([-1. -1. -8.], [1. 1. 8.], (3,), <class 'numpy.float32'>) observation space is not supported