Li et al., 2020 - Google Patents
Bound Controller for a Quadruped Robot using Pre-Fitting Deep Reinforcement LearningLi et al., 2020
View PDF- Document ID
- 12468100663161665766
- Author
- Li A
- Wang Z
- Wu J
- Zhu Q
- Publication year
- Publication venue
- arXiv
External Links
Snippet
The bound gait is an important gait in quadruped robot locomotion. It can be used to cross obstacles and often serves as transition mode between trot and gallop. However, because of the complexity of the models, the bound gait built by the conventional control method is often …
- 230000002787 reinforcement 0 title abstract description 9
Classifications
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/004—Artificial life, i.e. computers simulating life
- G06N3/008—Artificial life, i.e. computers simulating life based on physical entities controlled by simulated intelligence so as to replicate intelligent life forms, e.g. robots replicating pets or humans in their appearance or behavior
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Gangapurwala et al. | Rloc: Terrain-aware legged locomotion using reinforcement learning and optimal control | |
| Kalakrishnan et al. | Learning, planning, and control for quadruped locomotion over challenging terrain | |
| Yang et al. | Cajun: Continuous adaptive jumping using a learned centroidal controller | |
| JP5836565B2 (en) | Robot tracking and balancing system and method for mimicking motion capture data | |
| US20210162589A1 (en) | Systems and methods for learning agile locomotion for multiped robots | |
| CN110764416A (en) | Humanoid robot gait optimization control method based on deep Q network | |
| CN114047697B (en) | Four-foot robot balance inverted pendulum control method based on deep reinforcement learning | |
| Li et al. | Learning agile bipedal motions on a quadrupedal robot | |
| Park et al. | Inverse optimal control for humanoid locomotion | |
| CN118664586B (en) | Humanoid robot gait simulation learning method combined with periodical rewards | |
| Gao et al. | Global-position tracking control of a fully actuated nao bipedal walking robot | |
| CN118502405A (en) | Four-foot robot motion control method and system based on improved reinforcement learning reward function | |
| Yue | Learning locomotion for legged robots based on reinforcement learning: A survey | |
| Chen et al. | Deep reinforcement learning based co-optimization of morphology and gait for small-scale legged robot | |
| Mastrogeorgiou et al. | Slope handling for quadruped robots using deep reinforcement learning and toe trajectory planning | |
| Kuo et al. | Deep-reinforcement-learning-based gait pattern controller on an uneven terrain for humanoid robots | |
| Wang et al. | Reinforcement learning with imitative behaviors for humanoid robots navigation: synchronous planning and control | |
| Li et al. | Bound Controller for a Quadruped Robot using Pre-Fitting Deep Reinforcement Learning | |
| Li et al. | Model-based motion imitation for agile, diverse and generalizable quadupedal locomotion | |
| CN116795123A (en) | A method for quadruped robots to adapt to complex terrain | |
| Bussola et al. | Guided reinforcement learning for omnidirectional 3d jumping in quadruped robots | |
| Tirumala et al. | Gait library synthesis for quadruped robots via augmented random search | |
| Zhang et al. | A hierarchical reinforcement learning approach for adaptive quadruped locomotion of a rat robot | |
| Rossi et al. | Predicted Step Viability: A stability criterion for biped gait | |
| CN117032024A (en) | A motion control method for quadruped robot based on central pattern generator |