Tsitsiklis et al., 1996 - Google Patents

Feature-based methods for large scale dynamic programming

Tsitsiklis et al., 1996

Document ID: 8982188623009033768
Author: Tsitsiklis J; Van Roy B
Publication year: 1996
Publication venue: Machine Learning

External Links

Cited by

Snippet

We develop a methodological framework and present a few different ways in which dynamic programming and compact representations can be combined to solve large scale stochastic control problems. In particular, we develop algorithms that employ two types of feature …

Continue reading at link.springer.com (PDF) (other versions)

238000000605 extraction 0 abstract description 11

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass

Similar Documents

Publication	Publication Date	Title
Tsitsiklis et al.	1996	Feature-based methods for large scale dynamic programming
Patil et al.	2020	Align-rudder: Learning from few demonstrations by reward redistribution
Falster et al.	2018	How functional traits influence plant growth and shade tolerance across the life cycle
Volaire et al.	2020	What do you mean “functional” in ecology? Patterns versus processes
Hiebeler	2000	Populations on fragmented landscapes with spatially structured heterogeneities: landscape generation and local dispersal
Cesa-Bianchi et al.	1997	How to use expert advice
Wright et al.	2012	Behavioral game theoretic models: a Bayesian framework for parameter analysis.
CN110390561B (en)	2022-04-29	User-financial product selection tendency high-speed prediction method and device based on momentum acceleration random gradient decline
Pinosky et al.	2023	Hybrid control for combining model-based and model-free reinforcement learning
Williams et al.	2016	The influence of evolution on population spread through patchy landscapes
Abdullah et al.	2019	Integrated MOPSO algorithms for task scheduling in cloud computing
Thomas et al.	2017	Probing for sparse and fast variable selection with model‐based boosting
Goldman et al.	2015	Fast and efficient black box optimization using the parameter-less population pyramid
US20210019635A1 (en)	2021-01-21	Group specific decision tree
Zhang et al.	2021	Collaboration of experts: Achieving 80% top-1 accuracy on imagenet with 100m flops
Demetçi et al.	2022	Unsupervised integration of single-cell multi-omics datasets with disproportionate cell-type representation
Faldor et al.	2025	Synergizing quality-diversity with descriptor-conditioned reinforcement learning
Madar et al.	2024	Predictions for the abundance and clustering of H α emitting galaxies
Faliszewski et al.	2016	Multiwinner voting in genetic algorithms for solving ill-posed global optimization problems
Jaeggi et al.	2004	Multi-objective parallel tabu search
Demosthenous et al.	2022	Deep reinforcement learning for improving competitive cycling performance
Eguiarte-Morett et al.	2024	Premature convergence in morphology and control co-evolution: a study
Wang et al.	2024	Training networks in null space of feature covariance with self-supervision for incremental learning
Avni et al.	2019	Bidding games on markov decision processes
Lv et al.	2022	A novel grasshopper optimization algorithm based on swarm state difference and its application