Grow your team on GitHub
GitHub is home to over 40 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Sign up
Pinned repositories
-
gym
A toolkit for developing and comparing reinforcement learning algorithms.
-
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
-
spinningup
An educational resource to help anyone learn deep reinforcement learning.
-
gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
-
mujoco-py
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.
-
-
lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
-
gpt-2-output-dataset
Dataset of GPT-2 outputs for research in detection, biases, and more
-
multi-agent-emergence-environments
Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"
-
neural-mmo
Code for the paper "Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents"
-
retro
Retro Games in Gym
-
evolution-strategies-starter
Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"
-
box2d-py
Forked from jonasschneider/box2d-py -
mujoco-worldgen
Automatic object XML generation for Mujoco
-
-
roboschool
DEPRECATED: Open-source software for robot simulation, integrated with OpenAI Gym.
-
doom-py
ViZDoom Python wrapper
-
multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
-
pixel
Code for the single pixel debate game from the paper "AI safety via debate" (https://arxiv.org/abs/1805.00899)
-
atari-py
A packaged and slightly-modified version of https://github.com/bbitmaster/ale_python_interface
-
coinrun
Code for the paper "Quantifying Transfer in Reinforcement Learning"
-
generating-reviews-discovering-sentiment
Code for "Learning to Generate Reviews and Discovering Sentiment"
-
blocksparse
Efficient GPU kernels for block-sparse matrix multiplication and convolution
-
gym-http-api
API to access OpenAI Gym from other languages via HTTP
-
orrb
Code for the paper "OpenAI Remote Rendering Backend"
-
kubernetes-ec2-autoscaler Archived
A batch-optimized scaling manager for Kubernetes
-
-
monorepo-diff-buildkite-plugin
Forked from chronotc/monorepo-diff-buildkite-pluginRun separate pipelines for each folder in your monorepo
Most used topics
Loading…
People
This organization has no public members. You must be a member to see who’s a part of this organization.