Build software better, together

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

reinforcement-learning deep-learning tensorflow deep-reinforcement-learning tf2 mcts alphazero tensorflow2 muzero

Updated Mar 28, 2021
Jupyter Notebook

manyoso / allie

Star

Allie: A UCI compliant chess engine

chess-engine chess neural-network mcts deepmind alphabeta alphazero

Updated Apr 8, 2021
C++

Urinx / ReinforcementLearning

Star

Reinforcing Your Learning of Reinforcement Learning

reinforcement-learning tic-tac-toe space-invaders q-learning doom dqn mcts policy-gradient cartpole gomoku ddpg atari-2600 alphago frozenlake ppo advantage-actor-critic alphago-zero

Updated Jul 14, 2019
Python

AlphaZero implementation for Othello, Connect-Four and Tic-Tac-Toe based on "Mastering the game of Go without human knowledge" and "Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" by DeepMind.

game machine-learning reinforcement-learning deep-learning tensorflow tic-tac-toe connect-four reversi mcts othello tictactoe resnet deepmind connect4 alphago-zero alpha-zero alphazero self-play

Updated Apr 14, 2018
Python

xtrp / jupiter

Star

A Monte-Carlo based AI to beat 2048

ai mcts 2048

Updated Mar 27, 2022
JavaScript

vgarciasc / mcts-viz

Star

Visualization of MCTS algorithm applied to Tic-tac-toe.

visualization mcts tictactoe p5js

Updated Aug 25, 2021
JavaScript

yangboz / godpaper

Star

🐵 An AI chess-board-game framework(by many programming languages) implementations.

docker kubernetes board-game flash deep-neural-networks ai microservice game-engine wiki actionscript deep-reinforcement-learning cnn dnn mcts finite-state-machine deeplearning starling fuzzy-logic-control alphago policytree

Updated Feb 11, 2022
HTML

xuetf / AlphaZero_Gobang

Star

Deep Learning big homework of UCAS

deep-learning pytorch mcts gomoku residual-networks gobang alphazero five-in-a-row

Updated Jan 8, 2019
Python

zhangshun97 / AI_Gomocup

Star

Here are some Python implementations of Gomoku AIs, including MCTS, Minimax and Genetic Alg.

ai genetic-algorithm mcts gomoku minimax

Updated Dec 14, 2018
Python

CGLemon / pyDLGO

Star

基於深度學習的 GTP 圍棋（围棋）引擎，KGS 指引文件以及演算法教學。2022 TCGA 電腦圍棋賽現已開始報名，詳見內部文件。

deep-learning baduk weiqi goban mcts alphago

Updated Mar 29, 2022
Python

OMerkel / UCThello

Star

UCThello - a board game demonstrator (Othello variant) with computer AI using Monte Carlo Tree Search (MCTS) with UCB (Upper Confidence Bounds) applied to trees (UCT in short)

game board-game mobile ai simulation mobile-app artificial-intelligence mcts othello mobile-game entertainment ucb uct monte-carlo-tree-search ai-players upper-confidence-bounds abstract-game perfect-information 2-player-strategy-game

Updated Mar 30, 2018
JavaScript

gorisanson / quoridor-ai

Star

Quoridor AI based on Monte Carlo tree search

ai mcts quoridor monte-carlo-tree-search quoridor-game

Updated Sep 27, 2021
JavaScript

hayoung-kim / mcts-tic-tac-toe

Star

Monte Carlo Tree Search for tic tac toe

tic-tac-toe mcts

Updated Jul 24, 2018
Python

masouduut94 / MCTS-agent-python

Star

Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a profound impact on Artificial Intelligence (AI) approaches for domains that can be represented as trees of sequential decisions, particularly games and planning problems. In this project I used a board game called "HEX" as a platform to test different simulation strategies in MCTS field.