[go: up one dir, main page]

Showing 20 open source projects for "deep reinforcement learning"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • anny is an all-in-one platform for managing hybrid workplaces and shared resources. Icon
    anny is an all-in-one platform for managing hybrid workplaces and shared resources.

    For Businesses looking for a flexible solution for internal and external bookings

    Enable your employees to easily book desks, meeting rooms, parking spots, equipment, and more – all in one place. With flexible rules and group permissions, you stay in full control of who can access what.
    Learn More
  • 1
    ViZDoom

    ViZDoom

    Doom-based AI research platform for reinforcement learning

    ...Access to the depth buffer (3D vision). Automatic labeling of game objects visible in the frame. Access to the list of actors/objects and map geometry.ViZDoom API is reinforcement learning friendly (suitable also for learning from demonstration, apprenticeship learning or apprenticeship via inverse reinforcement learning.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    Diplomacy Cicero

    Diplomacy Cicero

    Code for Cicero, an AI agent that plays the game of Diplomacy

    ...It is designed to play the board game Diplomacy by combining open-domain natural language negotiation with strategic planning. The repository includes training code, model checkpoints, and infrastructure for both language modelling (via the ParlAI framework) and reinforcement learning for strategy agents. It supports two variants: Cicero (which handles full “press” negotiation) and Diplodocus (a variant focused on no-press diplomacy) as described in the README. The codebase is implemented primarily in Python with performance-critical components in C++ (via pybind11 bindings) and is configured to run in a high‐GPU cluster environment. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    tic tac toe AI

    tic tac toe AI

    simplest AI programme of tic-tac-toe game

    ...My future visions about this program is: v 1.0.1 --> bug fixes v 1.1 --> (added) click interaction _______________________________________________________________________________________________________________________________________________ v 1.2 --> addition of reinforcement learning (cache data different for each computer unlike v1.3). v 1.3 --> addition of cloud reinforcement learning (optional; chosen from settings). ... & more
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    Botcraft

    Botcraft

    Botcraft is a cross-platform C++ library to create bots

    Botcraft is an advanced Minecraft bot framework that provides a full client implementation in Node.js, enabling bots to join Minecraft servers, interact with the world, and perform tasks like mining, fighting, and building. It leverages the mineflayer ecosystem to provide a highly programmable interface for automation and AI experimentation in Minecraft. Botcraft is particularly useful for AI researchers, game modders, and automation enthusiasts who want to explore pathfinding, environment...
    Downloads: 0 This Week
    Last Update:
    See Project
  • The Industry Leading Platform for eCommerce Enablement and Analytics Icon
    The Industry Leading Platform for eCommerce Enablement and Analytics

    With MikMak Insights, brands gain real-time eCommerce analytics on the channels, campaigns, creative, and audiences that drive conversions.

    MikMak’s Where to Buy Shoppable Solutions help multichannel brands drive sales, grow market share, and increase profitability while reducing costs across categories such as CPG, Grocery, Alcohol, Beauty, Personal Care, Pet Care, Home Care, Consumer Electronics, Home Appliances, Toys, and more.
    Learn More
  • 5
    Bullet Physics SDK

    Bullet Physics SDK

    Real-time collision detection and multi-physics simulation for VR

    This is the official C++ source code repository of the Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc. We are developing a new differentiable simulator for robotics learning, called Tiny Differentiable Simulator, or TDS. The simulator allows for hybrid simulation with neural networks. It allows different automatic differentiation backends, for forward and reverse mode gradients. TDS can be trained using Deep Reinforcement Learning, or using Gradient based optimization (for example LFBGS). ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    AirSim

    AirSim

    A simulator for drones, cars and more, built on Unreal Engine

    ...It is developed as an Unreal plugin that can simply be dropped into any Unreal environment. AirSim's development is oriented towards the goal of creating a platform for AI research to experiment with deep learning, computer vision and reinforcement learning algorithms for autonomous vehicles. For this purpose, AirSim also exposes APIs to retrieve data and control vehicles in a platform independent way. AirSim is fully enabled for multiple vehicles. This capability allows you to create multiple vehicles easily and use APIs to control them.
    Downloads: 29 This Week
    Last Update:
    See Project
  • 7
    Lichess (Lila)

    Lichess (Lila)

    The forever free, adless and open source chess server

    Lila is the backend codebase powering Lichess, a free, open-source online chess server with millions of users. The architecture handles real-time games, puzzles, tournaments, correspondence chess, streaming, and user profiles, scaling to thousands of simultaneous matches. The system uses functional programming principles (in Scala) to maintain reliability—state transitions, clock management, game rules, and move validation are deterministic, reducing synchronization errors. Lila also...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Procgen

    Procgen

    Procedurally-Generated Game-Like Gym-Environments

    Procgen (short for Procedural Generation Benchmark) is a suite of 16 procedurally generated, game-like reinforcement learning environments designed to evaluate generalization and sample efficiency in RL agents. Unlike fixed, deterministic environments, Procgen generates new levels (layouts, obstacles, visual variation) each episode, making it impossible for an agent to simply memorize trajectories. The environments are designed to run very quickly (thousands of steps per second on a single core) to facilitate large-scale experiments and make benchmarking efficient. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 9
    Euler

    Euler

    A distributed graph deep learning framework.

    As a general data structure with strong expressive ability, graphs can be used to describe many problems in the real world, such as user networks in social scenarios, user and commodity networks in e-commerce scenarios, communication networks in telecom scenarios, and transaction networks in financial scenarios. and drug molecule networks in medical scenarios, etc. Data in the fields of text, speech, and images is easier to process into a grid-like type of Euclidean space, which is suitable for processing by existing deep learning models. Graph is a data type in non-Euclidean space and cannot be directly applied to existing methods, requiring a specially designed graph neural network system. Graph-based learning methods such as graph neural networks combine end-to-end learning with inductive reasoning, and are expected to solve a series of problems such as relational reasoning and interpretability that deep learning cannot handle.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Get safely back to business Icon
    Get safely back to business

    SafetyCulture iAuditor is designed for companies who need to conduct safety inspections & quality audits

    Equip your team with a simple safety inspection and observation app that anyone can learn in minutes, so you can get safely back to business from wherever you are.
    Learn More
  • 10
    Project Malmo

    Project Malmo

    A platform for Artificial Intelligence experimentation on Minecraft

    ...That learns transferable skills throughout its existence, and applies them to solve new, challenging problems? Project Malmo sets out to address these core research challenges, addressing them by integrating (deep) reinforcement learning, cognitive science, and many ideas from artificial intelligence. The Malmo platform is a sophisticated AI experimentation platform built on top of Minecraft, and designed to support fundamental research in artificial intelligence. The Project Malmo platform consists of a mod for the Java version, and code that helps artificial intelligence agents sense and act within the Minecraft environment. ...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 11
    Neural MMO

    Neural MMO

    Code for the paper "Neural MMO: A Massively Multiagent Game..."

    Neural MMO is a massively multi-agent simulation environment developed by OpenAI for reinforcement learning research. It provides a persistent, procedurally generated world where thousands of agents can interact, compete, and cooperate in real time. The environment is inspired by Massively Multiplayer Online Role-Playing Games (MMORPGs), featuring resource gathering, combat mechanics, exploration, and survival challenges.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    ELF (Extensive Lightweight Framework)

    ELF (Extensive Lightweight Framework)

    An End-To-End, Lightweight and Flexible Platform for Game Research

    ELF (Extensive, Lightweight, and Flexible) is a high-performance platform for reinforcement learning research that unifies simulation, data collection, and distributed training. A C++ core provides fast environments and concurrent actors, while Python bindings expose simple APIs for agents, replay, and optimization loops. It supports both single-agent and multi-agent settings, with batched stepping and shared-memory queues that keep GPUs saturated during training.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    DarkForestGo

    DarkForestGo

    DarkForest, the Facebook Go engine

    darkforestGo is an early deep-reinforcement-learning Go engine that combined a convolutional policy/value network with Monte Carlo Tree Search (MCTS) to play the full 19×19 game at a strong amateur level. The system couples fast GPU policy inference with CPU or GPU-assisted tree search so priors from the network guide exploration while search refines local tactics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Deep Space Pirates
    A space combat simulator game/learning project. All the info in our wiki: https://sourceforge.net/apps/mediawiki/dspirates/index.php?title=Main_Page Our code at: https://deepspacepirates.codeplex.com/SourceControl/latest
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Reinforcement learning and simulation-based search in computer Go
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Brainstormers Public Source Code Release
    The source code of the Brainstormers' RoboCup champion team 2005 has been made publicly available at the end of 2005. That source code release contains also a lot of our results in applying Reinforcement Learning in the simulated soccer domain.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Using reinforcement learning in neural network for learning othello
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Cerberus is a First Person Shooter Game that incorporates machine learning techniques. The game utilizes Neural Networks to control fighting behavior and Reinforcement Learning to select high level tactics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    RL Poker is a study project Java implementation of an e-soft on-policy Monte Carlo Texas Hold'em poker reinforcement learning algoritm with a feedforward neural network and backpropagation. It provides a graphical interface to monitor game rounds.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    General purpose agents using reinforcement learning. Combines radial basis functions, temporal difference learning, planning, uncertainty estimations, and curiosity. Intended to be an out-of-the-box solution for roboticists and game developers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next