H-infinity Optimal Control and Related Minimax Design Problems

A Dynamic Game Approach

Tamer Basar and Pierre Bernhard

Second Edition
Birkhäuser, Boston, 1995


ISBN 0-8176-3814-8 (North America)
ISBN 3-7643-3814-8


The Preface and the Table of Contents of this book can be found below.

Preface

One of the major concentrated activities of the past decade in control theory has been the development of the so-called ``H-infinity optimal control theory,'' which addresses the issue of worst-case controller design for linear plants subject to unknown additive disturbances and plant uncertainties, including problems of disturbance attenuation, model matching, and tracking. The mathematical symbol ``H-infinity'' stands for the Hardy space of all complex-valued functions of a complex variable, which are analytic and bounded in the open right-half complex plane. For a linear (continuous-time, time-invariant) plant, the H-infinity norm of the transfer matrix is the maximum of its largest singular value over all frequencies.

Controller design problems where the H-infinity norm plays an important role were initially formulated by George Zames in the early 1980s, in the context of sensitivity reduction in linear plants, with the design problem posed as a mathematical optimization problem using an (H-infinity) operator norm. Thus formulated originally in the frequency domain, the main tools used during the early phases of research on this class of problems have been operator and approximation theory, spectral factorization, and (Youla) parametrization, leading initially to rather complicated (high-dimensional) optimal or near-optimal (under the H-infinity norm) controllers. Follow-up work in the mid-1980s has shown, however, that the maximum McMillan degree of these controllers is in fact in the order of the McMillan degree of the overall system transfer matrix, and further work has shown that in a time-domain characterization of these controllers (generalized) Riccati equations of the type that arises in linear-quadratic differential games play a key role. These findings have prompted further accelerated research on the topic, for more direct time-domain (state-space) derivation of these results --- a direction which has led also to more general formulations, including time-varying plants and finite design horizons.

Among different time-domain approaches to this class of worst-case design problems, the one that uses the framework of dynamic ( differential) game theory seems to be the most natural. This is so because the original H-infinity-optimal control problem (in its equivalent time-domain formulation) is in fact a minimax optimization problem, and hence a zero-sum game, where the controller can be viewed as the minimizing player and disturbance as the maximizing player. Using this framework, we present in this book a complete theory that encompasses continuous-time as well as discrete-time systems, finite as well as infinite horizons, and several different measurement schemes, including closed-loop perfect-state, delayed perfect-state, sampled-state, closed-loop imperfect-state, delayed imperfect-state and sampled imperfect-state information patterns. We also discuss extensions of the linear theory to nonlinear systems, and derivation of lower-dimensional controllers for systems with regularly and singularly perturbed dynamics.

This is the second edition of our 1991 book with the same title, which, besides featuring a more streamlined presentation of the results included in the first edition, and at places under more refined conditions, also contains substantial new material, reflecting new developments in the field since 1991. Among these are the nonlinear theory (Section 4.6; Chapters 5 and 6); connections between H-infinity optimal control and risk-sensitive stochastic control problems (Section 4.7); H-infinity filtering for linear and nonlinear systems (Section 7.4); and robustness considerations in the presence of regular and singular perturbations (Chapter 8). We also have included a rather detailed description of the relationship between frequency- and time-domain approaches to robust controller design (Section 1.3), and a complete set of results on the existence of value and characterization of optimal policies in infinite-horizon LQ differential games (Section 9.2), in addition to the complete set of results for finite-horizon LQ differential games (involving the study of existence and nonexistence of conjugate points) (Section 9.1).

As stated in the preface of the first edition, the theory is now at a stage where it can be easily incorporated into a second-level graduate course in a control curriculum, that would follow a basic course in linear control theory covering LQ and LQG designs. The framework adopted in this book makes such an ambitious plan possible, and indeed, both authors have taught such courses during the last couple of years, at the University of Illinois, Urbana-Champaign; Ecole Superieure des Sciences de l'Informatique, University of Nice; Institut Superieur d'Informatique et d'Automatique, Ecole des Mines de Paris, Sophia Antipolis; and Ecole Polytechnique, Paris.

For the most part, the only prerequisite for the book is a basic knowledge of linear control theory, at the level of, say, the texts by Kwakernaak and Sivan (1972) or Anderson and Moore (1990). No background in differential games, or game theory in general, is required, as the requisite concepts and results have been developed in the book at the appropriate level. The book is written in such a way that makes it possible to follow the theory for the continuous- and discrete-time systems independently, so that a reader who is interested only in discrete-time systems, for example, may skip Chapters 4 and 5 without any disruption in the development of the theory. On the other hand, for the benefit of the reader who is interested in both continuous- and discrete-time results, we have included statements in each relevant chapter that place the comparisons in proper perspective.

In addition to re-expressing our gratitute to the individuals and institutions mentioned in the preface of the first edition of the book, we thank here also many of our colleagues who have provided us with many valuable comments and suggestions since the publication of the 1991 volume. The first author would also like to acknowledge the Senior Scientist Fellowship he received from the French Government during his 1994--95 sabbatical stay at INRIA, Sophia Antipolis, which facilitated the collaboration of the two authors in the writing of this second edition; the hospitality he received from INRIA during his stay and the excellent working environment he found there contributed immensely toward the completion of this project. The collaborative research of the two authors on the topic of this book was conducted under an NSF-INRIA international program, which also is gratefully acknowledged.

Tamer Basar ----------- Pierre Bernhard
Urbana, Illinois, USA ----- Sophia Antipolis, Valbonne, France
May 2, 1995 ---------------- May 2, 1995


CONTENTS

Preface ..... (p. v)

Chapter 1: A General Introduction to Minimax (H-infinity Optimal) Designs ..... (p. 1)

  1. A Brief History ..... (p. 1)
  2. Discrete- and Continuous-Time Models ..... (p. 5)
  3. Robust Controller Design ..... (p. 8)
  4. A Relationship Between H-infinity Optimal Control and LQ Zero-Sum Dynamic Games (p. 26)
  5. Organization of the Book ..... (p. 28)
  6. Conventions, Notation, and Terminology ..... (p. 30)
Chapter 2: Basic Elements of Static and Dynamic Games ..... (p. 33)
  1. Static Zero-Sum Games ..... (p. 33)
  2. Discrete-Time Dynamic Games ..... (p. 37)
  3. Continuous-Time Dynamic Games ..... (p. 44)
Chapter 3: The Discrete-Time Minimax Design Problem with Perfect-State Measurements ..... (p. 49)
  1. Introduction ..... (p. 49)
  2. The Soft-Constrained Linear-Quadratic Dynamic Game ..... (p. 51)
  3. Solution to the Disturbance Attenuation Problem ..... (p. 65)
  4. The Infinite-Horizon Case ..... (p. 77)
  5. More General Classes of Problems ..... (p. 96)
  6. Extensions to Nonlinear Systems Under Nonquadratic Performance Indices ..... (p. 103)
  7. Summary of Main Results ..... (p. 104)
Chapter 4: Continuous-Time Systems with Perfect-State Measurements ..... (p. 107)
  1. Introduction ..... (p. 107)
  2. The Soft-Constrained Differential Game ..... (p. 109)
  3. The Disturbance Attenuation Problem ..... (p. 129)
  4. The Infinite-Horizon Case ..... (p. 137)
  5. More General Classes of Problems ..... (p. 155)
  6. Nonlinear Systems and Nonquadratic Performance Indices ..... (p. 160)
  7. Connections with the Exponentiated Cost Stochastic Control Problem and Stochastic Differential Games ..... (p. 178)
  8. Main Points of the Chapter ..... (p. 186)
Chapter 5: The Continuous-Time Problem with Imperfect-State Measurements ..... (p. 189)
  1. Formulation of the Problem ..... (p. 189)
  2. A Certainty Equivalence Principle and Its Application to the Basic Problem P-sub-gamma ..... (p. 192)
  3. Sampled-Data Measurements ..... (p. 213)
  4. The Infinite-Horizon Case ..... (p. 218)
  5. More General Classes of Problems ..... (p. 228)
  6. Main Results of the Chapter ..... (p. 239)
Chapter 6: The Discrete-Time Problem with Imperfect-State Measurements ........ (p. 243)
  1. The Problem Considered ..... (p. 243)
  2. A Certainty Equivalence Principle and Its Application to the Basic Problem P-sub-gamma ..... (p. 247)
  3. The Infinite-Horizon Case ..... (p. 264)
  4. More General Classes of Problems ..... (p. 271)
  5. Main Points of the Chapter ..... (p. 282)
Chapter 7: Minimax Estimators and Performance Levels ..... (p. 285)
  1. Introduction ..... (p. 285)
  2. A Static Minimax Estimation Problem ..... (p. 288)
  3. Optimum Performance Levels ..... (p. 293)
  4. Continuous-Time H-infinity Filtering ..... (p. 298)
  5. Summary of Main Results ..... (p. 306)
Chapter 8: Robustness to Regular and Singular Perturbations ..... (p. 309)
  1. Introduction ..... (p. 309)
  2. Regular and Singular Perturbations, and Perfect-State Measurements ..... (p. 310)
  3. Imperfect-State Measurements ..... (p. 331)
  4. Summary of Main Results ..... (p. 345)
Chapter 9: Appendix A: Conjugate Points and Existence of Value ..... (p. 347)
  1. Conjugate Points in Finite-Horizon Games ..... (p. 347)
  2. Infinite-Horizon LQ Differential Games ..... (p. 361)
Chapter 10: Appendix B: Danskin's Theorem ..... (p. 383)

Bibliography ..... (p. 391)

List of Corollaries, Definitions, Facts, Lemmas, Propositions, Remarks, and Theorems ..... (p. 405)

Index ..... (p. 409)