Tsitsiklis et al., 1996 - Google Patents
Feature-based methods for large scale dynamic programmingTsitsiklis et al., 1996
View PDF- Document ID
- 8982188623009033768
- Author
- Tsitsiklis J
- Van Roy B
- Publication year
- Publication venue
- Machine Learning
External Links
Snippet
We develop a methodological framework and present a few different ways in which dynamic programming and compact representations can be combined to solve large scale stochastic control problems. In particular, we develop algorithms that employ two types of feature …
- 238000000605 extraction 0 abstract description 11
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Tsitsiklis et al. | Feature-based methods for large scale dynamic programming | |
Patil et al. | Align-rudder: Learning from few demonstrations by reward redistribution | |
Falster et al. | How functional traits influence plant growth and shade tolerance across the life cycle | |
Volaire et al. | What do you mean “functional” in ecology? Patterns versus processes | |
Hiebeler | Populations on fragmented landscapes with spatially structured heterogeneities: landscape generation and local dispersal | |
Cesa-Bianchi et al. | How to use expert advice | |
Wright et al. | Behavioral game theoretic models: a Bayesian framework for parameter analysis. | |
CN110390561B (en) | User-financial product selection tendency high-speed prediction method and device based on momentum acceleration random gradient decline | |
Pinosky et al. | Hybrid control for combining model-based and model-free reinforcement learning | |
Williams et al. | The influence of evolution on population spread through patchy landscapes | |
Abdullah et al. | Integrated MOPSO algorithms for task scheduling in cloud computing | |
Thomas et al. | Probing for sparse and fast variable selection with model‐based boosting | |
Goldman et al. | Fast and efficient black box optimization using the parameter-less population pyramid | |
US20210019635A1 (en) | Group specific decision tree | |
Zhang et al. | Collaboration of experts: Achieving 80% top-1 accuracy on imagenet with 100m flops | |
Demetçi et al. | Unsupervised integration of single-cell multi-omics datasets with disproportionate cell-type representation | |
Faldor et al. | Synergizing quality-diversity with descriptor-conditioned reinforcement learning | |
Madar et al. | Predictions for the abundance and clustering of H α emitting galaxies | |
Faliszewski et al. | Multiwinner voting in genetic algorithms for solving ill-posed global optimization problems | |
Jaeggi et al. | Multi-objective parallel tabu search | |
Demosthenous et al. | Deep reinforcement learning for improving competitive cycling performance | |
Eguiarte-Morett et al. | Premature convergence in morphology and control co-evolution: a study | |
Wang et al. | Training networks in null space of feature covariance with self-supervision for incremental learning | |
Avni et al. | Bidding games on markov decision processes | |
Lv et al. | A novel grasshopper optimization algorithm based on swarm state difference and its application |