Roig et al., 2020 - Google Patents
Remote reinforcement learning over a noisy channelRoig et al., 2020
- Document ID
- 9693096132567387569
- Author
- Roig J
- Gündüz D
- Publication year
- Publication venue
- GLOBECOM 2020-2020 IEEE Global Communications Conference
External Links
Snippet
A collaborative multi-agent reinforcement learning (RL) problem is considered, where agents communicate over a noisy communication channel towards achieving a common goal. In particular, we consider a remote-controlled version of a single-agent RL problem, in …
- 230000002787 reinforcement 0 title abstract description 11
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L25/00—Baseband systems
- H04L25/02—Details ; Arrangements for supplying electrical power along data transmission lines
- H04L25/03—Shaping networks in transmitter or receiver, e.g. adaptive shaping networks ; Receiver end arrangements for processing baseband signals
- H04L25/03006—Arrangements for removing intersymbol interference
- H04L2025/03592—Adaptation methods
- H04L2025/03598—Algorithms
- H04L2025/03611—Iterative algorithms
- H04L2025/03643—Order recursive
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Tung et al. | Effective communications: A joint learning and communication framework for multi-agent reinforcement learning over noisy channels | |
| Roig et al. | Remote reinforcement learning over a noisy channel | |
| CN113285875B (en) | Space route prediction method based on impulse neural network | |
| US20210141715A1 (en) | System and method for producing test data | |
| Zou et al. | Optimized consensus for blockchain in internet of things networks via reinforcement learning | |
| CN118869145A (en) | A collaborative data transmission method for UAV clusters based on MAPPO and RLNC | |
| Moon et al. | Learning-enabled network-control co-design for energy-efficient industrial internet of things | |
| Chang et al. | Steganography in game actions | |
| US12524673B2 (en) | Multitask distributed learning system and method based on lottery ticket neural network | |
| Hmedoush et al. | DS-IRSA: A deep reinforcement learning and sensing based IRSA | |
| Lent | Resource selection in cognitive networks with spiking neural networks | |
| Draper | Successive structuring of source coding algorithms for data fusion, buffering, and distribution in networks | |
| Hu et al. | Communication learning in multi-agent systems from graph modeling perspective | |
| CN111314015B (en) | Pulse interference decision method based on reinforcement learning | |
| Borisovskaya et al. | Estimation of average delay in systems with unsourced random access and multiple departure | |
| Huang et al. | Multi-agent cooperative games using belief map assisted training | |
| CN117852915A (en) | A DQN-based method for solving elastic adaptive strategies in information systems | |
| Perera et al. | Dynamic spectrum fusion: An adaptive learning approach for hybrid NOMA/OMA in evolving wireless networks | |
| CN116418755A (en) | Punctual information system data scheduling method based on Markov decision | |
| Page et al. | Node cardinality estimation in the internet of things using privileged feature distillation | |
| Mostaani et al. | On learning how to communicate over noisy channels for collaborative tasks | |
| Sahai et al. | Demystifying the witsenhausen counterexample [ask the experts] | |
| Akama et al. | Deep reinforcement learning model design and transmission for network delay compensation in 3d online shooting game | |
| Sharma et al. | Enhanced performance of high-speed MANET utilizing variational relation vector graph neural network optimized with superb fairy wren optimization algorithm | |
| Bowyer et al. | Optimizing synchronization times for tracking a mobile asset in GPS-denied environments using deep Q-learning |