Publications about 'steepest descent' |
Articles in journal or book chapters |
Recent work on data-driven control and reinforcement learning has renewed interest in a relatively old field in control theory: model-free optimal control approaches which work directly with a cost function and do not rely upon perfect knowledge of a system model. Instead, an "oracle" returns an estimate of the cost associated to, for example, a proposed linear feedback law to solve a linear-quadratic regulator problem. This estimate, and an estimate of the gradient of the cost, might be obtained by performing experiments on the physical system being controlled. This motivates in turn the analysis of steepest descent algorithms and their associated gradient differential equations. This paper studies the effect of errors in the estimation of the gradient, framed in the language of input to state stability, where the input represents a perturbation from the true gradient. Since one needs to study systems evolving on proper open subsets of Euclidean space, a self-contained review of input to state stability definitions and theorems for systems that evolve on such sets is included. The results are then applied to the study of noisy gradient systems, as well as the associated steepest descent algorithms. |
This paper proposes a simple numerical technique for the steering of arbitrary analytic systems with no drift. It is based on the generation of "nonsingular loops" which allow linearized controllability along suitable trajetories. Once such loops are available, it is possible to employ standard Newton or steepest descent methods, as classically done in numerical control. The theoretical justification of the approach relies on recent results establishing the genericity of nonsingular controls, as well as a simple convergence lemma. |
Conference articles |
This paper proposes a technique for the control of analytic systems with no drift. It is based on the generation of "nonsingular loops" which allow linearized controllability. Once such loops are available, it is possible to employ standard Newton or steepest descent methods. The theoretical justification of the approach relies on results on genericity of nonsingular controls as well as a simple convergence lemma. |
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders.
This document was translated from BibTEX by bibtex2html