TransformerXL + PPO Baseline + MemoryGym

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

episodic-transformer-memory-ppo

5 111 2.5 Python

Clean baseline implementation of PPO using an episodic TransformerXL memory

We finally completed a lightweight implementation of a memory-based agent using PPO and TransformerXL (and Gated TransformerXL).

brain-agent

2 92 3.0 Python

Brain Agent for Large-Scale and Multi-Task Agent Learning

Brain Agent

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
DI-engine

3 2,617 8.7 Python

OpenDILab Decision AI Engine

DI Engine

Ray

43 31,414 10.0 Python

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

RLlib

endless-memory-gym

1 67 6.7 Python

Challenging Memory-based Deep Reinforcement Learning Agents

Code: https://github.com/MarcoMeter/drl-memory-gym

adaptive-transformers-in-rl

1 129 10.0 Python

Adaptive Attention Span for Reinforcement Learning

Found relevant code at https://github.com/jerrodparker20/adaptive-transformers-in-rl + all code implementations here

popgym

4 147 6.1 Python

Partially Observable Process Gym

Have you seen this other ICLR paper, POPGym? Paper: https://openreview.net/forum?id=chDrutUTs0K Code: https://github.com/smorad/popgym

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Gymnasium

12 5,859 9.3 Python

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Thanks! It really depends on the task that you want to implement. But in general, sticking to the standard gymnasium API is important. If you want to implement a 2D environment then PyGame is promising. If it's more like a game, check out Unity ML-Agents or Godot RL Agents. Anything simpler can also be just pure python code. You also need to carefully design your observation space, action space and reward function. My advice is to explore design choices of related environments.

ml-agents

60 16,435 8.0 C#

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.

Thanks! It really depends on the task that you want to implement. But in general, sticking to the standard gymnasium API is important. If you want to implement a 2D environment then PyGame is promising. If it's more like a game, check out Unity ML-Agents or Godot RL Agents. Anything simpler can also be just pure python code. You also need to carefully design your observation space, action space and reward function. My advice is to explore design choices of related environments.

godot_rl_agents

5 768 9.1 Python

An Open Source package that allows video game creators, AI researchers and hobbyists the opportunity to learn complex behaviors for their Non Player Characters or agents

Thanks! It really depends on the task that you want to implement. But in general, sticking to the standard gymnasium API is important. If you want to implement a 2D environment then PyGame is promising. If it's more like a game, check out Unity ML-Agents or Godot RL Agents. Anything simpler can also be just pure python code. You also need to carefully design your observation space, action space and reward function. My advice is to explore design choices of related environments.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

[P] 10x faster reinforcement learning hyperparameter optimization than SOTA - now with distributed training!

2 projects | /r/MachineLearning | 7 Jul 2023
Why did Stability not copy Midjourney's RLHF process? And what's the future of Stable Diffusion?

3 projects | /r/StableDiffusion | 9 Apr 2023
[P] 10x faster reinforcement learning HPO - now with CNNs!

3 projects | /r/MachineLearning | 5 Apr 2023
ACTorch: a PyTorch-based deep reinforcement learning framework for fast prototyping

1 project | /r/reinforcementlearning | 6 Mar 2023
Using AI to infer depth information from images in Godot 4 .NET 6 using the MiDaS monocular depth model

1 project | /r/godot | 2 Dec 2022

TransformerXL + PPO Baseline + MemoryGym

This page summarizes the projects mentioned and recommended in the original post on /r/reinforcementlearning
reinforcement-learning deep-reinforcement-learning Pytorch Deep Learning Machine Learning
Post date: 15 Feb 2023

episodic-transformer-memory-ppo

brain-agent

InfluxDB

DI-engine

Ray

endless-memory-gym

adaptive-transformers-in-rl

popgym

SaaSHub

Gymnasium

ml-agents

godot_rl_agents

Related posts

[P] 10x faster reinforcement learning hyperparameter optimization than SOTA - now with distributed training!

Why did Stability not copy Midjourney's RLHF process? And what's the future of Stable Diffusion?

[P] 10x faster reinforcement learning HPO - now with CNNs!

ACTorch: a PyTorch-based deep reinforcement learning framework for fast prototyping

Using AI to infer depth information from images in Godot 4 .NET 6 using the MiDaS monocular depth model

TransformerXL + PPO Baseline + MemoryGym

This page summarizes the projects mentioned and recommended in the original post on /r/reinforcementlearning reinforcement-learning deep-reinforcement-learning Pytorch Deep Learning Machine Learning Post date: 15 Feb 2023

Related posts

[P] 10x faster reinforcement learning hyperparameter optimization than SOTA - now with distributed training!

Why did Stability not copy Midjourney's RLHF process? And what's the future of Stable Diffusion?

[P] 10x faster reinforcement learning HPO - now with CNNs!

ACTorch: a PyTorch-based deep reinforcement learning framework for fast prototyping

Using AI to infer depth information from images in Godot 4 .NET 6 using the MiDaS monocular depth model

This page summarizes the projects mentioned and recommended in the original post on /r/reinforcementlearning
reinforcement-learning deep-reinforcement-learning Pytorch Deep Learning Machine Learning
Post date: 15 Feb 2023