Python reinforcement-learning

Open-source Python projects categorized as reinforcement-learning

Top 23 Python reinforcement-learning Projects

  • Ray

    Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

  • Project mention: Ray: Unified framework for scaling AI and Python applications | news.ycombinator.com | 2024-05-03
  • d2l-en

    Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • reinforcement-learning-an-introduction

    Python Implementation of Reinforcement Learning: An Introduction

  • wandb

    πŸ”₯ A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.

  • Project mention: A list of SaaS, PaaS and IaaS offerings that have free tiers of interest to devops and infradev | dev.to | 2024-02-05

    Weights & Biases β€” The developer-first MLOps platform. Build better models faster with experiment tracking, dataset versioning, and model management. Free tier for personal projects only, with 100 GB of storage included.

  • machine_learning_examples

    A collection of machine learning examples and tutorials.

  • Project mention: Doubt about numpy's eigen calculation | /r/learnmachinelearning | 2023-05-25

    Does that mean that the example I found on the internet is wrong (I think it comes from a DL Course, so I'd imagine it is not wrong)? or does it mean that I am comparing two different things? I guess this has to deal with right and left eigen vectors as u/JanneJM pointed out in her comment?

  • stable-baselines3

    PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

  • Project mention: Sim-to-real RL pipeline for open-source wheeled bipeds | /r/robotics | 2023-12-09

    The latest release (v3.0.0) of Upkie's software brings a functional sim-to-real reinforcement learning pipeline based on Stable Baselines3, with standard sim-to-real tricks. The pipeline trains on the Gymnasium environments distributed in upkie.envs (setup: pip install upkie) and is implemented in the PPO balancer. Here is a policy running on an Upkie:

  • trax

    Trax β€” Deep Learning with Clear Code and Speed

  • Project mention: Maxtext: A simple, performant and scalable Jax LLM | news.ycombinator.com | 2024-04-23

    Is t5x an encoder/decoder architecture?

    Some more general options.

    The Flax ecosystem

    https://github.com/google/flax?tab=readme-ov-file

    or dm-haiku

    https://github.com/google-deepmind/dm-haiku

    were some of the best developed communities in the Jax AI field

    Perhaps the β€œtrax” repo? https://github.com/google/trax

    Some HF examples https://github.com/huggingface/transformers/tree/main/exampl...

    Sadly it seems much of the work is proprietary these days, but one example could be Grok-1, if you customize the details. https://github.com/xai-org/grok-1/blob/main/run.py

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • pysc2

    StarCraft II Learning Environment

  • PaLM-rlhf-pytorch

    Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

  • TensorLayer

    Deep Learning and Reinforcement Learning Library for Scientists and Engineers

  • Gymnasium

    An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

  • Project mention: NASA JPL Open Source Rover That Runs ROS 2 | news.ycombinator.com | 2023-09-22

    "Show HN: Ghidra Plays Mario" (2023) https://news.ycombinator.com/item?id=37475761 :

    [RL, MuZero reduxxxx ]

    > Farama-Foundation/Gymnasium is a fork of OpenAI/gym and it has support for additional Environments like MuJoCo: https://github.com/Farama-Foundation/Gymnasium#environments

    > Farama-Foundatiom/MO-Gymnasiun: "Multi-objective Gymnasium environments for reinforcement learning": https://github.com/Farama-Foundation/MO-Gymnasium

  • keras-rl

    Deep Reinforcement Learning for Keras.

  • cleanrl

    High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

  • Project mention: [P] PettingZoo 1.24.0 has been released (including Stable-Baselines3 tutorials) | /r/reinforcementlearning | 2023-08-24

    PettingZoo 1.24.0 is now live! This release includes Python 3.11 support, updated Chess and Hanabi environment versions, and many bugfixes, documentation updates and testing expansions. We are also very excited to announce 3 tutorials using Stable-Baselines3, and a full training script using CleanRL with TensorBoard and WandB.

  • trlx

    A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

  • Project mention: Recapping the AI, Machine Learning and Data Science Meetup β€” May 2, 2024 | dev.to | 2024-05-02

    Transformer Reinforcement Learning X on GitHub

  • stable-baselines

    A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

  • dm_control

    Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

  • polyaxon

    MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle

  • ElegantRL

    Massively Parallel Deep Reinforcement Learning. πŸ”₯

  • pytorch-a2c-ppo-acktr-gail

    PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

  • acme

    A library of reinforcement learning components and agents

  • tensorforce

    Tensorforce: a TensorFlow library for applied reinforcement learning

  • football

    Check out the new game server:

  • catalyst

    Accelerated deep learning R&D (by catalyst-team)

  • Project mention: Instance segmentation of small objects in grainy drone imagery | /r/computervision | 2023-12-09
  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python reinforcement-learning related posts

  • Deep Reinforcement Learning: Zero to Hero

    3 projects | news.ycombinator.com | 5 May 2024
  • Recapping the AI, Machine Learning and Data Science Meetup β€” May 2, 2024

    2 projects | dev.to | 2 May 2024
  • Bayesianbandits: A Pythonic microframework for multi-armed bandit problems

    1 project | news.ycombinator.com | 14 Mar 2024
  • Adding Weapons

    1 project | dev.to | 24 Jan 2024
  • Understand how transformers work by demystifying all the math behind them

    1 project | news.ycombinator.com | 4 Jan 2024
  • Show HN: An end-to-end reinforcement learning library for infinite horizon tasks

    1 project | news.ycombinator.com | 29 Dec 2023
  • Show HN: Easily train AlphaZero-like agents on any environment you want

    2 projects | news.ycombinator.com | 20 Dec 2023
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 21 May 2024
    Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more β†’

Index

What are some of the best open-source reinforcement-learning projects in Python? This list will help you:

Project Stars
1 Ray 31,414
2 d2l-en 21,922
3 reinforcement-learning-an-introduction 13,229
4 wandb 8,328
5 machine_learning_examples 8,114
6 stable-baselines3 8,082
7 trax 7,964
8 pysc2 7,931
9 PaLM-rlhf-pytorch 7,609
10 TensorLayer 7,296
11 Gymnasium 5,859
12 keras-rl 5,492
13 cleanrl 4,564
14 trlx 4,344
15 stable-baselines 4,000
16 dm_control 3,578
17 polyaxon 3,494
18 ElegantRL 3,478
19 pytorch-a2c-ppo-acktr-gail 3,423
20 acme 3,398
21 tensorforce 3,284
22 football 3,253
23 catalyst 3,234

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com