Plaat, 2020 - Google Patents

Self-Play

Plaat, 2020

Document ID: 1087011932165032891
Author: Plaat A
Publication year: 2020
Publication venue: Learning to Play: Reinforcement Learning and Games

External Links

Cited by

Snippet

Self-Play Page 1 Chapter 7 Self-Play This chapter is devoted to AlphaGo-style self-play. Self-play is an intuitively appealing AI method that has long been used by AI researchers in various forms, as we saw at the end of the previous chapter. The 2016 results showed, many years …

Continue reading at link.springer.com (other versions)

230000002787 reinforcement 0 description 70

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G06N5/046—Forward inferencing, production systems
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor

Similar Documents

Publication	Publication Date	Title
Hu et al.	2024	A survey on large language model-based game agents
Hilpisch	2020	Artificial intelligence in finance
US7873587B2 (en)	2011-01-18	Method and system for creating a program to preform a desired task based on programs learned from other tasks
Devezas et al.	2003	Power law behavior and world system evolution: A millennial learning process
Van Otterlo	2009	The logic of adaptive behavior: Knowledge representation and algorithms for adaptive sequential decision making under uncertainty in first-order and relational domains
Liu et al.	2022	On efficient reinforcement learning for full-length game of starcraft ii
Baier et al.	2018	Emulating human play in a leading mobile card game
Wu et al.	2008	Learning to play Go using recursive neural networks
Baum	1998	Manifesto for an evolutionary economics of intelligence
Cox et al.	2025	Predicting the next response: Demonstrating the utility of integrating artificial intelligence-based reinforcement learning with behavior science
Plaat	2020	Self-Play
Hu	2023	Planning with a model: Alphazero
Seify	2020	Single-agent optimization with monte-carlo tree search and deep reinforcement learning
Ruotsalainen	2024	Comparing path-finding algorithms and machine learning model
Iosti et al.	2020	Synthesizing control for a system with black box environment, based on deep learning
Plaat	2022	Two-Agent Self-Play
Araújo	2021	Agentes Com Aprendizagem Automática Para Jogos de Computador
Dobre	2019	Low-resource learning in complex games
West	2020	Self-play deep learning for games: Maximising experiences
Präntare	2017	Simultaneous coalition formation and task assignment in a real-time strategy game
Rizzo	2025	Model-Free Multi-Agent Reinforcement Learning Approach in NeurIPS LuxAI S3 Competition
Yannakakis et al.	2025	AI Methods for Games
Stooke	2020	Advancements in Deep Reinforcement Learning: Algorithms and Implementations
de Oliveira	2021	A Modular Architecture for Model-Based Deep Reinforcement Learning
Ring et al.	2017	Replicating deepmind starcraft ii reinforcement learning benchmark with actor-critic methods