H-infinity Optimal Control and Related Minimax Design Problems
A Dynamic Game Approach
Tamer Basar and Pierre Bernhard
Second Edition
Birkhäuser, Boston, 1995
ISBN 0-8176-3814-8 (North America)
ISBN 3-7643-3814-8
The Preface and the Table of Contents of this book can be found below.
Preface
One of the major concentrated activities of the past decade in control theory
has been the
development of the so-called ``H-infinity optimal control theory,'' which
addresses the issue of worst-case controller design for linear plants
subject to unknown additive disturbances and plant uncertainties,
including problems of
disturbance attenuation, model matching, and tracking.
The mathematical symbol ``H-infinity'' stands for the Hardy space of
all complex-valued functions of a complex variable, which are analytic and
bounded in the open right-half complex plane. For a linear
(continuous-time, time-invariant) plant, the H-infinity norm of the
transfer matrix is the maximum of its largest singular value over all
frequencies.
Controller design problems where the H-infinity norm plays an
important role were
initially formulated by George Zames in the early 1980s, in the
context of sensitivity reduction in linear plants, with the design problem
posed as a mathematical optimization problem using an (H-infinity) operator
norm. Thus formulated originally in the frequency domain, the main tools
used during the early phases of research on this class of problems have been
operator and approximation theory, spectral factorization, and (Youla)
parametrization, leading initially to rather complicated (high-dimensional)
optimal or near-optimal (under the H-infinity norm) controllers.
Follow-up work in the mid-1980s has shown, however, that the maximum
McMillan degree of these controllers is in fact in the order of the
McMillan degree of the overall system transfer matrix, and further
work has shown that in a time-domain characterization of these controllers
(generalized) Riccati equations of the type that arises in linear-quadratic
differential games play a key role.
These findings have prompted further accelerated research on the topic,
for more direct time-domain (state-space) derivation of these results
--- a direction which has led also to more general formulations, including
time-varying plants and finite design horizons.
Among different time-domain approaches to this class of worst-case design
problems, the one that uses the framework of dynamic
( differential)
game theory seems to be the most natural. This is so because
the original H-infinity-optimal control problem (in its equivalent
time-domain formulation) is in fact a minimax optimization problem,
and hence a zero-sum game, where the controller can be viewed as
the minimizing
player and disturbance as the maximizing player. Using this framework,
we present in this book a complete theory that encompasses continuous-time
as well as discrete-time systems, finite as well as infinite horizons,
and several different measurement schemes, including
closed-loop perfect-state, delayed perfect-state, sampled-state, closed-loop
imperfect-state, delayed imperfect-state and sampled imperfect-state
information patterns.
We also discuss extensions of the linear theory to nonlinear systems,
and derivation of lower-dimensional controllers for systems with
regularly and singularly perturbed dynamics.
This is the second edition of our 1991 book with the same title, which, besides
featuring a more streamlined presentation of the results included in
the first edition, and at places under more refined conditions,
also contains substantial new material, reflecting new developments
in the field since 1991. Among these are
the nonlinear theory (Section 4.6; Chapters 5 and 6);
connections between H-infinity optimal control and risk-sensitive
stochastic control problems (Section 4.7);
H-infinity filtering for linear and nonlinear systems (Section 7.4); and
robustness considerations in the presence of regular and singular
perturbations (Chapter 8).
We also have included a rather detailed description of the relationship
between frequency- and time-domain approaches to robust controller
design (Section 1.3), and a complete set of results on the existence of value
and characterization of optimal policies in infinite-horizon LQ
differential games (Section 9.2), in addition to the complete set of results for
finite-horizon LQ differential games (involving the study of existence and
nonexistence of conjugate points) (Section 9.1).
As stated in the preface of the first edition,
the theory is now at a stage where it can be easily
incorporated into a second-level graduate course in a control curriculum,
that would follow a basic course in linear control theory covering
LQ and LQG designs. The framework adopted in this book makes such
an ambitious plan
possible, and indeed,
both authors have taught such courses during the last couple of years,
at the University of Illinois, Urbana-Champaign;
Ecole Superieure des Sciences de l'Informatique,
University of Nice; Institut Superieur d'Informatique et d'Automatique,
Ecole des Mines de Paris, Sophia Antipolis; and
Ecole Polytechnique, Paris.
For the most part, the only prerequisite for the book is
a basic knowledge of linear
control theory, at the level of, say, the texts by
Kwakernaak and Sivan (1972) or
Anderson and Moore (1990). No background in differential games, or game
theory in general, is required, as the requisite concepts and results have
been developed in the book at the appropriate level. The book is written
in such a way that makes it
possible to follow the theory for the continuous- and discrete-time
systems independently, so that a reader who is interested only in
discrete-time systems, for example, may skip Chapters 4 and 5 without
any disruption in the development of the theory. On the other hand, for
the benefit of the reader who is interested in both continuous- and
discrete-time results, we have included statements in each relevant
chapter that place the comparisons in proper perspective.
In addition to re-expressing our gratitute to the individuals and institutions
mentioned in the preface of the first edition of the book, we thank here
also many of our colleagues who have provided us with many valuable
comments and suggestions since the publication of the 1991 volume.
The first author would also like to acknowledge the Senior Scientist
Fellowship he received from the French Government during his
1994--95 sabbatical stay at INRIA, Sophia Antipolis, which facilitated
the collaboration of the two authors in the writing of this second
edition; the hospitality he received from INRIA during his stay
and the excellent working environment he found there contributed
immensely toward the completion of this project. The collaborative
research of the two authors on the topic of this book
was conducted under an NSF-INRIA international program, which also is
gratefully acknowledged.
Tamer Basar ----------- Pierre Bernhard
Urbana, Illinois, USA ----- Sophia Antipolis, Valbonne, France
May 2, 1995 ---------------- May 2, 1995
CONTENTS
Preface ..... (p. v)
Chapter 1: A General Introduction to Minimax (H-infinity Optimal)
Designs ..... (p. 1)
- A Brief History ..... (p. 1)
- Discrete- and Continuous-Time Models ..... (p. 5)
- Robust Controller Design ..... (p. 8)
- Robust disturbance rejection ..... (p. 8)
- Norms ..... (p. 12)
- Robust control design ..... (p. 14)
- A Relationship Between H-infinity Optimal Control and LQ Zero-Sum
Dynamic Games (p. 26)
- Organization of the Book ..... (p. 28)
- Conventions, Notation, and Terminology ..... (p. 30)
Chapter 2: Basic Elements of Static and Dynamic Games ..... (p. 33)
- Static Zero-Sum Games ..... (p. 33)
- Discrete-Time Dynamic Games ..... (p. 37)
- Continuous-Time Dynamic Games ..... (p. 44)
Chapter 3: The Discrete-Time Minimax Design Problem with Perfect-State Measurements ..... (p. 49)
- Introduction ..... (p. 49)
- The Soft-Constrained Linear-Quadratic Dynamic Game ..... (p. 51)
- Open-loop information structure for both players ..... (p. 51)
- Closed-loop perfect-state information for both players ..... (p. 53)
- Closed-loop 1-step delay information for both players ..... (p. 60)
- An illustrative example ..... (p. 63)
- Solution to the Disturbance Attenuation Problem ..... (p. 65)
- General closed-loop information ..... (p. 65)
- Illustrative example (continued) ..... (p. 69)
- A least favorable distribution for the disturbance ..... (p. 72)
- Optimum controller under the 1-step delay information pattern ..... (p. 73)
- The full-information controller ..... (p. 76)
- The Infinite-Horizon Case ..... (p. 77)
- More General Classes of Problems ..... (p. 96)
- More general plants and cost functions ..... (p. 96)
- Nonzero initial state ..... (p. 101)
- Extensions to Nonlinear Systems Under Nonquadratic Performance Indices ..... (p. 103)
- Summary of Main Results ..... (p. 104)
Chapter 4: Continuous-Time Systems with Perfect-State Measurements ..... (p. 107)
- Introduction ..... (p. 107)
- The Soft-Constrained Differential Game ..... (p. 109)
- Open-loop information structure for both players ..... (p. 109)
- Closed-loop perfect-state information for both players ..... (p. 115)
- Sampled-data information for both players ..... (p. 118)
- Delayed-state measurements ..... (p. 127)
- The Disturbance Attenuation Problem ..... (p. 129)
- Closed-loop perfect-state information ..... (p. 130)
- Sampled-state measurements ..... (p. 132)
- An illustrative example ..... (p. 135)
- Delayed-state measurements ..... (p. 137)
- The Infinite-Horizon Case ..... (p. 137)
- The soft-constrained differential game ..... (p. 138)
- The disturbance attenuation problem ..... (p. 150)
- Illustrative example (continued) ..... (p. 154)
- More General Classes of Problems ..... (p. 155)
- A more general cost structure ..... (p. 156)
- Unknown nonzero initial state ..... (p. 159)
- Nonlinear Systems and Nonquadratic Performance Indices ..... (p. 160)
- General nonlinear/nonquadratic structures ..... (p. 161)
- Special system dynamics where control and/or disturbance
enter linearly ........ (p. 166)
- Linearization of systems linear in u and w ..... (p. 171)
- Viscosity solutions ..... (p. 174)
- Connections with the Exponentiated Cost Stochastic Control Problem and Stochastic Differential Games ..... (p. 178)
- Risk-sensitive stochastic control problem ..... (p. 179)
- A class of stochastic differential games ..... (p. 182)
- Connections with the nonlinear H-infinity control problem ..... (p. 185)
- Main Points of the Chapter ..... (p. 186)
Chapter 5: The Continuous-Time Problem with Imperfect-State
Measurements ..... (p. 189)
- Formulation of the Problem ..... (p. 189)
- A Certainty Equivalence Principle and Its Application to the Basic
Problem P-sub-gamma ..... (p. 192)
- Sampled-Data Measurements ..... (p. 213)
- The Infinite-Horizon Case ..... (p. 218)
- More General Classes of Problems ..... (p. 228)
- More general outputs ..... (p. 228)
- Delayed measurements ..... (p. 231)
- Nonlinear/nonquadratic problems ..... (p. 236)
- Main Results of the Chapter ..... (p. 239)
Chapter 6: The Discrete-Time Problem with Imperfect-State
Measurements ........ (p. 243)
- The Problem Considered ..... (p. 243)
- A Certainty Equivalence Principle and Its Application to the
Basic Problem P-sub-gamma ..... (p. 247)
- The Infinite-Horizon Case ..... (p. 264)
- More General Classes of Problems ..... (p. 271)
- Cross terms in the performance index ..... (p. 271)
- Delayed measurements ..... (p. 274)
- Instantaneous feedback measurement ..... (p. 277)
- Nonlinear/nonquadratic problems ..... (p. 281)
- Main Points of the Chapter ..... (p. 282)
Chapter 7: Minimax Estimators and Performance Levels ..... (p. 285)
- Introduction ..... (p. 285)
- A Static Minimax Estimation Problem ..... (p. 288)
- Optimum Performance Levels ..... (p. 293)
- Discrete time ..... (p. 293)
- Continuous time ..... (p. 296)
- Continuous-Time H-infinity Filtering ..... (p. 298)
- Summary of Main Results ..... (p. 306)
Chapter 8: Robustness to Regular and Singular Perturbations ..... (p. 309)
- Introduction ..... (p. 309)
- Regular and Singular Perturbations, and Perfect-State Measurements ..... (p. 310)
- Problem formulation and spatial decomposition ..... (p. 310)
- A time-scale decomposition ..... (p. 316)
- Construction and optimality of composite controllers ..... (p. 328)
- Imperfect-State Measurements ..... (p. 331)
- Problem formulation and spatial decomposition ..... (p. 331)
- Time-scale decomposition and optimal slow controller ..... (p. 337)
- Summary of Main Results ..... (p. 345)
Chapter 9: Appendix A: Conjugate Points and Existence of Value
..... (p. 347)
- Conjugate Points in Finite-Horizon Games ..... (p. 347)
- Infinite-Horizon LQ Differential Games ..... (p. 361)
Chapter 10: Appendix B: Danskin's Theorem ..... (p. 383)
Bibliography ..... (p. 391)
List of Corollaries, Definitions, Facts, Lemmas,
Propositions, Remarks, and Theorems ..... (p. 405)
Index ..... (p. 409)