TransformerXL + PPO Baseline + MemoryGym

This page summarizes the projects mentioned and recommended in the original post on /r/reinforcementlearning

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • episodic-transformer-memory-ppo

    Clean baseline implementation of PPO using an episodic TransformerXL memory

  • We finally completed a lightweight implementation of a memory-based agent using PPO and TransformerXL (and Gated TransformerXL).

  • brain-agent

    Brain Agent for Large-Scale and Multi-Task Agent Learning

  • Brain Agent

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • DI-engine

    OpenDILab Decision AI Engine

  • DI Engine

  • Ray

    Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

  • RLlib

  • endless-memory-gym

    Challenging Memory-based Deep Reinforcement Learning Agents

  • Code: https://github.com/MarcoMeter/drl-memory-gym

  • adaptive-transformers-in-rl

    Adaptive Attention Span for Reinforcement Learning

  • Found relevant code at https://github.com/jerrodparker20/adaptive-transformers-in-rl + all code implementations here

  • popgym

    Partially Observable Process Gym

  • Have you seen this other ICLR paper, POPGym? Paper: https://openreview.net/forum?id=chDrutUTs0K Code: https://github.com/smorad/popgym

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • Gymnasium

    An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

  • Thanks! It really depends on the task that you want to implement. But in general, sticking to the standard gymnasium API is important. If you want to implement a 2D environment then PyGame is promising. If it's more like a game, check out Unity ML-Agents or Godot RL Agents. Anything simpler can also be just pure python code. You also need to carefully design your observation space, action space and reward function. My advice is to explore design choices of related environments.

  • ml-agents

    The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.

  • Thanks! It really depends on the task that you want to implement. But in general, sticking to the standard gymnasium API is important. If you want to implement a 2D environment then PyGame is promising. If it's more like a game, check out Unity ML-Agents or Godot RL Agents. Anything simpler can also be just pure python code. You also need to carefully design your observation space, action space and reward function. My advice is to explore design choices of related environments.

  • godot_rl_agents

    An Open Source package that allows video game creators, AI researchers and hobbyists the opportunity to learn complex behaviors for their Non Player Characters or agents

  • Thanks! It really depends on the task that you want to implement. But in general, sticking to the standard gymnasium API is important. If you want to implement a 2D environment then PyGame is promising. If it's more like a game, check out Unity ML-Agents or Godot RL Agents. Anything simpler can also be just pure python code. You also need to carefully design your observation space, action space and reward function. My advice is to explore design choices of related environments.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • [P] 10x faster reinforcement learning hyperparameter optimization than SOTA - now with distributed training!

    2 projects | /r/MachineLearning | 7 Jul 2023
  • Why did Stability not copy Midjourney's RLHF process? And what's the future of Stable Diffusion?

    3 projects | /r/StableDiffusion | 9 Apr 2023
  • [P] 10x faster reinforcement learning HPO - now with CNNs!

    3 projects | /r/MachineLearning | 5 Apr 2023
  • ACTorch: a PyTorch-based deep reinforcement learning framework for fast prototyping

    1 project | /r/reinforcementlearning | 6 Mar 2023
  • Using AI to infer depth information from images in Godot 4 .NET 6 using the MiDaS monocular depth model

    1 project | /r/godot | 2 Dec 2022