Skip to content
Change the repository type filter

All

    Repositories list

    • HighJax

      Public
      Highway driving simulation in JAX for Reinforcement Learning research
      Rust
      MIT License
      0700Updated Mar 27, 2026Mar 27, 2026
    • Testing ranking algorithms to improve social cohesion
      Python
      33210Updated Mar 26, 2025Mar 26, 2025
    • A benchmark environment for fully cooperative human-AI performance.
      Jupyter Notebook
      MIT License
      209953132Updated Mar 22, 2025Mar 22, 2025
    • A prompt injection game to collect data for robust ML research
      Python
      BSD 2-Clause "Simplified" License
      869324Updated Jan 27, 2025Jan 27, 2025
    • imitation

      Public
      Clean PyTorch implementations of imitation and reward learning algorithms
      Python
      MIT License
      3011.7k7619Updated Jan 7, 2025Jan 7, 2025
    • Prosocial Ranking Challenge Perspective Ranker
      Jupyter Notebook
      MIT License
      0101Updated Nov 26, 2024Nov 26, 2024
    • PRC: Testing ranking algorithms to improve social cohesion
      JavaScript
      3000Updated Sep 21, 2024Sep 21, 2024
    • PRC: Civirank submission
      3000Updated Sep 8, 2024Sep 8, 2024
    • Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"
      Jupyter Notebook
      GNU General Public License v3.0
      112710Updated Jun 4, 2024Jun 4, 2024
    • Dataset for the Tensor Trust project
      Jupyter Notebook
      54810Updated Mar 17, 2024Mar 17, 2024
    • Jupyter Notebook
      0140Updated Nov 30, 2023Nov 30, 2023
    • Library to compare and evaluate reward functions
      Python
      Apache License 2.0
      86842Updated Oct 23, 2023Oct 23, 2023
    • seals

      Public
      Benchmark environments for reward modelling and imitation learning algorithms.
      Python
      MIT License
      94661Updated Sep 19, 2023Sep 19, 2023
    • Code for "On the Utility of Learning about Humans for Human-AI Coordination"
      Python
      4611100Updated Apr 17, 2023Apr 17, 2023
    • ray

      Public
      A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, …
      Python
      Apache License 2.0
      7.4k009Updated Mar 4, 2023Mar 4, 2023
    • eirli

      Public
      An Empirical Investigation of Representation Learning for Imitation (EIRLI), NeurIPS'21
      Python
      43723Updated Mar 4, 2023Mar 4, 2023
    • Checking the divisibility of neural networks, and investigating the nature of the pieces networks can be divided into.
      Python
      2602Updated Feb 11, 2023Feb 11, 2023
    • Web application where humans can play Overcooked with AI agents.
      JavaScript
      286086Updated Dec 6, 2022Dec 6, 2022
    • A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
      Python
      MIT License
      591100Updated Nov 30, 2022Nov 30, 2022
    • A simple webpage that can visualize a sgf string encoded as a url fragment.
      CSS
      0000Updated Sep 29, 2022Sep 29, 2022
    • Python
      MIT License
      1300Updated Aug 11, 2022Aug 11, 2022
    • sacred

      Public
      Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.
      Python
      MIT License
      391103Updated Jul 24, 2022Jul 24, 2022
    • Supporting code for Assistance Games as a Framework paper
      Python
      MIT License
      1300Updated Jul 11, 2022Jul 11, 2022
    • dmc2gym

      Public
      OpenAI Gym wrapper for the DeepMind Control Suite
      Python
      MIT License
      68200Updated Jun 16, 2022Jun 16, 2022
    • Docker files to help reproduce bug described in https://forums.developer.nvidia.com/t/kernel-oops-null-pointer-dereference-when-closing-cuda-application-katago/…
      Dockerfile
      0000Updated May 25, 2022May 25, 2022
    • Preprocessing reward functions to make them more interpretable
      Python
      0400Updated May 11, 2022May 11, 2022
    • Code for the paper "Emergent Complexity via Multi-agent Competition"
      Python
      157400Updated Apr 19, 2022Apr 19, 2022
    • Find best-response to a fixed policy in multi-agent RL
      Python
      MIT License
      4828880Updated Apr 1, 2022Apr 1, 2022
    • PyTorch version of Stable Baselines, improved implementations of reinforcement learning algorithms.
      Python
      MIT License
      2.1k300Updated Nov 6, 2021Nov 6, 2021
    • Script for automatically creating the reconnaissance email.
      HTML
      1500Updated Nov 2, 2021Nov 2, 2021
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.