Applied Mathematics
Vol.06 No.08(2015), Article ID:58051,7 pages
10.4236/am.2015.68119
On the Iterative Solution to H¥ Control Problems
Ivan G. Ivanov1,2, Ivelin G. Ivanov2, Nikolay C. Netov1
1Faculty of Economics and Business Administration, Sofia University “St. Kliment Ohridski”, Sofia, Bulgaria
2Pedagogical College Dobrich, Shoumen University, Shoumen, Bulgaria
Email: i_ivanov@feb.uni-sofia.bg, iwelin.ivanow@gmail.com, nnetoff@feb.uni-sofia.bg
Copyright © 2015 by authors and Scientific Research Publishing Inc.
This work is licensed under the Creative Commons Attribution International License (CC BY).
http://creativecommons.org/licenses/by/4.0/



Received 28 May 2015; accepted 14 July 2015; published 17 July 2015
ABSTRACT
This paper addresses the problem for solving a Continuous-time Riccati equation with an indefinite sign of the quadratic term. Such an equation is closely related to the so called full information H¥ control of linear time-invariant system with external disturbance. Recently, a simultaneous policy update algorithm (SPUA) for solving H¥ control problems is proposed by Wu and Luo (Simultaneous policy update algorithms for learning the solution of linear continuous-time H¥ state feedback control, Information Sciences, 222, 472-485, 2013). However, the crucial point of their method is to find an initial point, which ensuring the convergence of the method. We will show one example where Wu and Luo’s method is not effective and it converges to an indefinite solution. Three effective methods for computing the stabilizing solution to the considered equation are investigated. Computer realizations of the presented methods are numerically compared on the computational platforms MATLAB and SCILAB.
Keywords:
Continuous-Time Riccati Equation, Indefinite Sign, Iterative Computation, Stabilizing Solution

1. Introduction
The continuous-time algebraic Riccati equations and their extensions have been investigated extensively in the literature. Recently, the H¥ control problem was solved for linear time-invariant system [1] -[3] and for stoch- astic systems [4] -[7] .
Wu and Luo [8] have commented the iterative solution of the following continuous-time algebraic Riccati equation
(1)
Note that this equation has indefinite quadratic part. Assume there exists a positive semidefinite solution
to (1) with property that real parts of eigenvalues of
are negative. Such type solution is called a stabilizing solution.
The H¥ linear quadratic problems have been introduces by Basar and Bernhard [9] as a two-player zero sum gane. We consider a model for a a two-player zero-sum game, where the control function
is a minimizing player (or a controller player) of the functional
and the disturbance function
is a maximizing player (or a disturbance player), where

The controller player aims to minimize the
and the disturbance player aims to maximize the
under a constrain of the system:
(2)
Knowing the stabilizing solution
to (1) we define the following functions:

The functions
have the property

And thus they form the equilibrium point of the two-player zero-sum game described by (2) and the functional
. This fact is well known in the literature and it can be derived using the Pontryagin’s Maximum Principle for example. Moreover, the stabilizing solution is very important solution to Equation (1).
So, why we need to study the iterative equations for computing the stabilizing equation to (1)? Many re- searchers have investigated Riccati Equation (1) and more specially how to compute his stabilizing solution. Lanzon et al. [1] have proposed two effective methods. The first method constructs two matrix sequences where the first sequence converges to the stabilizing solution. The second method avoids the second matrix sequence and defines one matrix sequence which directly approximates the stabilizing solution. Later, Wu and Luo [8] have studied the same equation and the proposed method in [1] . They have commented that the second Lanzon’s method (it is Algorithm 2 [8] ) is not fully effective and by this reason they have introduced the new method described as Algorithm 4 in their paper [8] . Here, we consider an example where these two algorithms will be compared.
Example 1. Let us we take the following matrix coefficients to (1) (using the MATLAB notations):
;





We execute Algorithm 2 [8] and Algorithm 4 [8] with the initial point
And the solution computed by Algorithm 4 is
Note that the matrix



indefinite and

We compute eigenvalues of



and the matrix



solution with
solution


So, this example gives us the conclusion that the Algorithm 4 described in [8] compute only a solution to (1) and this solution is not always positive definite and this solution is not always stabilizing.
In this reason we confirm that the Lanzon’s method [1] is an effective method for computing the stabilizing solution. His main essential feature is that the iterative process includes two iterative loops-the out loop and the inner loop. We extend the ideas described by Lanzon et al. [1] and Feng and Anderson [6] to propose iterative methods where one matrix sequence is constructed. Here we introduce additional two iterative methods which lead directly to the stabilizing solution. Our contribution is to apply two computational schemes for realization the first iterative equation. Moreover, the second iterative equation is a new method for computing the stabi- lizing solution to (1). We present a few examples for testing the introduced recurrence equations on the MATLAB and SCILAB computational platforms.
We write




2. Iterative Methods for Stabilizing Solution to (1)
The first method is the Lanzon’s method [1] and Algorithm 2 from [8] . We present the main theorem with pro-
perties for constructing two matrix sequences of positive semidefinite matrices
sequences are constructed as follows. We take

We find


where
The matrices

Theorem 1 Assume there exist symmetric matrices






defined by (3), (4) are satisfied for
1)

2)
3) the matrix


4)
Proof. The proof follows the proof of Theorem 3 from [1] .
Theorem 1 presents sufficient conditions for the equation


Further on, we consider an alternative iteration process where one matrix sequence is constructed. Consider the behaviour of the controller player (u(t)). Assume the controller player knows the matrix
find

And the functional

The corresponding Riccati equation regarding to


where
Based on recurrence Equation (5) we derive the following new iteration:

with

with


The notation “(6) + care” means that the iteration (6) is solved as a Ricacti equation with unknown matrix






Thus, Equation (6) can be considered as a new iteration formula. This equation constructs a new matrix
sequence

Equation (6) is obtained from (4) when we substitute


the matrix pair


finite. This is enough to start iterative process (7) with
Further on, we extend the idea for constructing the matrix sequence
in (4) we obtain:
Next, we extricate the term

manipulations. We derive

We apply the following implementation for the latest recurrence equation:

Our thoughts and algebraic manipulations for deriving recurrence Equation (8) show that it is equivalent to the main iterative process (3)-(4). Thus iteration (9) constructs a new matrix sequence which converges to the stabilizing solution of (1). In order to execute iteration (9) we apply the following algorithm:
1) We take

2) We compute


3) For

a) Compute

b) Find


c) Algorithm stops when the inequality

4) The stabilizing solution is
3. Numerical Experiments
We carry out experiments for solving a continuous-time algebraic Riccati equation with an indefinite quadratic
term (1). We construct two matrix sequences


sequence is computed using the iterative process (3)-(4). Iteration (4) is a Riccati equation and


per one iteration.
Moreover, we have carried out experiments in the open source software SCILAB
http://www.scilab.org/scilab/about. It provides a computing environment for scientific applications. There are functions for solving linear and nonlinear matrix equations. We apply the “ricc” function for solving a con- tinuous Riccati equation and “lyap” function for a linear Lyapunov equation.
Our experiments are executed in MATLAB on a 2.20 GHz Intel (R) Core (TM) i7-4702MQ CPU computer. We use two variables tolR and tol for small positive numbers to control the accuracy of computations. We
denote


is satisfied for some


For our purpose we have executed hundred runs of each value of n for two family of examples. The tables report the maximal number It of iterations for which the inequality





Example 2. We consider a family of examples in case

Results from experiments in Example 2 are given in Table 1 with

Example 3. We consider a family of examples in case

Results from experiments for Example 3 are given in Table 2 with

The application of all iterative methods shows that they achieve the same accuracy for different number of iterations. Our conclusions based on experiments are:
1) The execution the iterations (3), (4) and “(6) + care” takes almost the same CPU time (see the corresponding
Table 1. Example 2. Results from 100 runs for each value of n.
Table 2. Example 3. Results from 100 runs for each value of n.
columns of the tables). Note that the procedure care in these iterations have to be applied;
2) Iterations based on the solution of Lyapunov equations faster than the iterations based on the solution of Riccati equations;
3) The new iteration (9) is fastest than other iterative methods;
4) Comparing the MATLAB Execution and the SCILAB Execution we note the MATLAB implementations of the considered iterative methods are faster than the same executed in the SCILAB environment. However, the SCILAB implementations achieve the same accuracy and based on the fact it is an open source software we deduce the SCILAB is an useful tool for education to master and PhD students.
The conclusions are indicated by implemented numerical simulations.
4. Conclusion
We have studied two iterative processes for finding the stabilizing solution to generalized Riccati Equations (2). We have made numerical experiments for computing this solution and we have compared the considered methods numerically. We have compared the results from the experiments in regard of number of iterations and CPU time for executing. Our numerical experiments confirm the effectiveness of proposed new method (9). It is introduced here and moreover numerical experiments show its efficiency.
Acknowledgements
The present research paper was supported in a part by the EEA Scholarship Programme BG09 Project Grant D03-91 under the European Economic Area Financial Mechanism.
Cite this paper
Ivan G.Ivanov,Ivelin G.Ivanov,Nikolay C.Netov, (2015) On the Iterative Solution to H∞ Control Problems. Applied Mathematics,06,1263-1270. doi: 10.4236/am.2015.68119
References
- 1. Lanzon, A., Feng, Y., Anderson, B. and Rotkowitz, M. (2008) Computing the Positive Stabilizing Solution to Algebraic Riccati Equations with an Indefinite Quadratic Term via a Recursive Method. IEEE Transactions on Automatic Control, 53, 2280-2291.
http://dx.doi.org/10.1109/TAC.2008.2006108 - 2. Praveen, P. and Bhasin, S. (2013) Online Partially Model-Free Solution of Two-Player Zero Sum Differential Games, Preprints of the 10th IFAC International Symposium on Dynamics and Control of Process Systems. The International Federation of Automatic Control.
- 3. Vrabie, D. and Lewis, F. (2011) Adaptive Dynamic Programming for Online Solution of a Zero-Sum Differential Game. Journal of Control Theory and Applications, 9, 353-360.
http://dx.doi.org/10.1007/s11768-011-0166-4 - 4. Dragan, V., Freiling, G., Morozan, T. and Stoica, A.-M. (2008) Iterative Algorithms for Stabilizing Solutions of Game Theoretic Riccati Equations of Stochastic Control. Proceedings of the 18th International Symposium on Mathematical Theory of Networks & Systems.
http://scholar.lib.vt.edu/MTNS/Papers/078.pdf - 5. Dragan, V. and Ivanov, I. (2011) Computation of the Stabilizing Solution of Game Theoretic Riccati Equation Arising in Stochastic H∞ Control Problems. Numerical Algorithms, 57, 357-375.
http://dx.doi.org/10.1007/s11075-010-9432-7 - 6. Feng, Y.T. and Anderson, B.D.O. (2010) An Iterative Algorithm to Solve State-Perturbed Stochastic Algebraic Riccati Equations in LQ Zero-Sum Games. Systems & Control Letters, 59, 50-56.
http://dx.doi.org/10.1016/j.sysconle.2009.11.006 - 7. Zhu, H.-N., Zhang, C.-K. and Bin, N. (2012) Infinite Horizon LQ Zero-Sum Stochastic Differential Games with Markovian Jumps. Applied Mathematics, 3, 1321-1326.
http://dx.doi.org/10.4236/am.2012.330188 - 8. Wu, H.-N. and Luo, B. (2013) Simultaneous Policy Update Algorithms for Learning the Solution of Linear Continuous-Time H∞ State Feedback Control. Information Sciences, 222, 472-485.
http://dx.doi.org/10.1016/j.ins.2012.08.012 - 9. Basar, T. and Bernhard, P. (1995) H∞ Optimal Control and Related Minimax Design Problems: A Dynamic Game Approach. Systems & Control: Foundations and Applications Series, Birkhauser, Boston, 1995.
- 10. Lancaster, P. and Rodman, L. (1995) Algebraic Riccati Equations. Clarendon Press, Oxford.

















