Lee et al., 2025 - Google Patents

Constrained Optimization Formulation of Bellman Optimality Equation for Online Reinforcement Learning

Lee et al., 2025

Document ID: 12631321522424602759
Author: Lee H; Choi K
Publication year: 2025
Publication venue: Authorea Preprints

External Links

Cited by

Snippet

This paper proposes an online reinforcement learning algorithm that directly solves the Bellman optimality equation by casting it as a constrained optimization problem. Unlike policy or value iteration, which incrementally approximate the Bellman (optimality) equation …

Continue reading at www.techrxiv.org (PDF) (other versions)

Classifications

- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only

Similar Documents

Publication	Publication Date	Title
Grandia et al.	2020	Nonlinear model predictive control of robotic systems with control lyapunov functions
Hsu et al.	2023	Isaacs: Iterative soft adversarial actor-critic for safety
Jost et al.	2015	Optimal and suboptimal event-triggering in linear model predictive control
Wang et al.	2021	Chance constraint robust control with control barrier functions
D'Jorge et al.	2020	Stochastic model predictive control for tracking linear systems
Mittal et al.	2020	Neural lyapunov model predictive control
Nair et al.	2022	Stochastic mpc with dual control for autonomous driving with multi-modal interaction-aware predictions
Cisneros et al.	2018	A dissipativity formulation for stability analysis of nonlinear and parameter dependent MPC
Curi et al.	2022	Safe reinforcement learning via confidence-based filters
Hu et al.	2022	Provable sim-to-real transfer in continuous domain with partial observations
Lee et al.	2025	Constrained Optimization Formulation of Bellman Optimality Equation for Online Reinforcement Learning
Beckenbach et al.	2018	Addressing infinite-horizon optimization in MPC via Q-learning
Ławryńczuk	2009	Neural networks in model predictive control
Peydayesh et al.	2021	Neuro‐adaptive distributed output‐feedback containment control for multiagent systems with nonstrict‐feedback nonlinear dynamics and input constraints
Zhang et al.	2021	Marl with general utilities via decentralized shadow reward actor-critic
Patel et al.	2024	Conformal robust control of linear systems
Lu et al.	2024	Mpc-inspired reinforcement learning for verifiable model-free control
Osinenko et al.	2017	Stacked adaptive dynamic programming with unknown system model
Kögel et al.	2012	Cooperative distributed MPC using the alternating direction multiplier method
Wang et al.	2025	Safe Navigation in Uncertain Crowded Environments Using Risk Adaptive CVaR Barrier Functions
Pan et al.	2015	Composite learning from model reference adaptive fuzzy control
Ou et al.	2009	Model predictive control of parabolic PDE systems with dirichlet boundary conditions via galerkin model reduction
Lu et al.	2024	Bridging the gaps: Learning verifiable model-free quadratic programming controllers inspired by model predictive control
Han	2020	Control system based on affine TS fuzzy model with uncertainty
Yu et al.	2025	A Convex Optimization Approach to Model-Free Inverse Optimal Control with Provable Convergence