Paper Menu >>
Journal Menu >>
Journal of Mathematical Finance, 2011, 1, 8-13 doi:10.4236/jmf.2011.11002 Published Online May 2011 (http://www.SciRP.org/journal/jmf) Copyright © 2011 SciRes. JMF Legendre App roximation for Solving a Class of Nonlinear Optimal Control Problems Emran Tohidi, Omid Reza Navid Samadi, Mohammad Hadi Farahi Department of Applied Mathematics, School of Mathematical Sciences, Ferdowsi University of Mashhad, Mashhad, Iran E-mail: {emran.tohidi, om_na92}@stu-mail.um.ac.ir, farahi@math.um.ac.ir Received March 24, 2011; revised May 16, 2011; accepted M ay 19, 2011 Abstract This paper introduces a numerical technique for solving a class of optimal control problems containing non- linear dynamical system and functional of state variables. This numerical method consists of two major parts. In the first part, using linear combination property of intervals, we convert the nonlinear dynamical system into an equivalent linear system. And in the second part, which we are dealing with a linear dynamical sys- tem, using Legendre expansions for approximating both the state and associated control together with discre- tizing the constraints over the Chebyshev-Gauss-Lobatto points, the optimal control problem is transformed into a corresponding NLP problem which is diretly solved. The proposed idea is illustrated by several nu- merical examples. Keywords: Optimal Control, Legendre Polynomials, Linear Combination Property of Intervals, Chebyshev-Gauss-Lobatto Points, Nonlinear Programming 1. Introduction Optimal control theory is widely applied in aerospace, engineering, economics and other areas of science and has received considerable attention of researchers. Dur- ing the past two decades, enormous effort has been spent on the development of computational methods for gene- rating solutions of optimal control problems [1-8]. Al- though many computational methods have been devel- oped and proposed, modification of the existing methods and development of new methods should yet be explored to obtain accurate solutions successfully. The approaches to numerical solutions of optimal con- trol problems may be divided into two major classes: the indirect methods and the direct methods. The indirect methods are based on the pontryagin maximum principle and require the numerical solution of boundary value problems that result from the necessary conditions of optimal control [9]. For many practical optimization problems, these boundary value problems are quite dif- ficult to solve. In fact, the manner in which pontryagin maximum principle is used differs so significantly from one type of problem to another that no standard solution procedure can be devised. Therefore, one has to devise direct computational algorithms to solve optimal control problems. Direct optimization methods transcribe the (infinite- dimensional) continuous problem to a finite-dimensional nonlinear programming problem (NLP) through some parametrization of the state and/or control variables. In the direct methods, initial guesses have to be provided only for physically intuitive quantities such as the states and possibly controls. However, continuous advances in NLP algorithms and software have made these the me- thods of choice in many applications [10]. In this paper, we present a direct approach that based upon linear combination property of intervals and Le- gendre polynomials approximations together with the Chebyshev-Gauss-Lobatto (CGL) points as the colloca- tion nodes to determine the optimal trajectories of high order nonlinear ( possibly discontinuous ) dynamic sys- tems. The most important reason of CGL points consid- eration instead of Legendre-Gauss-Lobatto (LGL) points is that CGL’s have a closed-form formula, but, LGL's have no analytic forms. Our method consists of two ma- jor parts. In the first part, using linear combination prop- erty of intervals, we transform nonlinear dynamical sys- tem into a corresponding linear system. And in the second part, using general ideas of Razzaghi [11] (i.e., E. TOHIDI ET AL. Copyright © 2011 SciRes. JMF 9 Legendre method for linear systems), and applying the CGL points as collocation nodes for discretizing the lat- ter linear dynamical system and inequality state con- straints, the optimal control problem is converted into an NLP problem, which its parameters are the unknown Legendre coefficients. We also apply high-order Gauss- -lobatto quadrature rules [6] for approximating the inte- gral involved in the performance index in the discretiza- tion procedure. The advantages of recasting the optimal control problem as an NLP are: 1) the proposed method eliminates the requirement of solving a (2PBVP); 2) state and control inequality are easier to handle. Numerical examples are given to demonstrate the ap- plicability of the proposed technique. Moreover, a com- parison is made with optimal solutions obtained by the presented approach and a collocation method [12]. 2. Preliminaries 2.1. Properties of the Shifted Legendre Polynomials The Legendre polynomials which are orthogonal in the interval [−1,1] satisfy the following recurrence relation 11 21 =, 1 11 iii ii PxxPx Pxi ii (2.1.1) with 0=1Px and 1=Px x . In order to use these polynomials on the interval 0, h, one can apply the change of variables 2 =1 t xh in (2.1.1). Therefore, the shifted Legendre polynomials are constructed as follows 2 ˆ=1 ,. ii t PtPt oh h (2.1.2) The orthogonal property of shifted Legendre polyno- mials is given by 0 0 ˆˆd= = 21 h ij ij PtPt thij i (2.1.3) A function, f t, which is absolutely integrable within 0th may be expressed in terms of a shifted Legendre series as =0 ˆ =ii i f tfPt (2.1.4) where 0 21 ˆ =d. h ii i f ftPt t h (2.1.5) If we assume that the derivative of f t in Equation (2.1.4) is described by =0 ˆ =ii i f tgPt (2.1.6) the relationship between the coefficients i f in (2.1.4) and i g in (2.1.6) can be obtained as follows [11] 11 232122123 =0 =1,2, ii i higi giif i (2.1.7) Further, the product of two shifted Legendre poly- nomials ˆi Pt and ˆj Pt can be approximated by 0 ˆˆ ˆ = N ijijn n n PtPt Pt (2.1.8) where 0 21 ˆˆ ˆ =d, =0,1,,. h ijnij n nPtPtPtt nN h (2.1.9) 2.2. Linear Combination Property of Intervals This property states that every uniform continuous func- tion with a compact and connected domain can be writ- ten as a convex linear combination of its maximum and minimum. In other words, if and are the maxi- mum and minimum of the uniform continuous function H x, one can write =1, 01.Hx (2.2.1) 3. Problem Statement Consider the following nonlinear system =, x tAtxtHtut (3.1) with known initial and final conditions 0 0= x x, =h x hx , where x t and ut are 1n and 1q state and control vectors respectively, nn A tR and ut U where U is a compact and connected subset of q R. It is assumed that =nq and , H tut is a smooth or non-smooth continuous function over 0, hU . Moreover, there exists a pair of state and control variables , x tut such that satisfy (3.1) and two point boundary conditions 0 0= x x and =h x hx. The problem is to find the optimal control ut and the corresponding state trajectory x t, 0th , satisfy- ing Equation (3.1) while minimizing the cost functional 0 =,d. h J ftxt t (3.2) Two special cases of , f txt in (3.2) are ,= T f txtctxt and E. TOHIDI ET AL. Copyright © 2011 SciRes. JMF 10 1 ,= 2 T f txtxtqtxt . Also, with the assumption of enough smoothness one can consider the following inequality state constraint ,0.txt (3.3) 4. Linearization o f th e Dy na mi ca l S ys te m Since :0, n H hU R is a continuous function and 0, hU is a compact and connected subset of 1n R , then ,: H tutu U is a closed set in n R clearly. Thus, ,: i H tutu U for =1,2, ,in is closed in R. Now, suppose that the lower and upper bounds of the ,: i H tutu U are i g t and i wt respec- tively. Therefore, ,, 0,. ii i g tHtut wtth (4.1) In other words =,:, 0,, iui g tMinHtutuUth (4.2) =,:, 0,. iui wtMaxHtutu Uth (4.3) Using linear combination property of intervals, that explained briefly in Section 2, , i H tut can be ex- pressed as a convex linear combination of its minimum i g t and maximum i wt as follows ,= 1 , iiiii ii i H tutt wttgt ttgt (4.4) where = iii twtgt and 0,1 it . Note that according to Equation (4.4), it is the associated control variable. Now, the main problem with the assumption of ,= T f txtctxt is transformed into the following optimal control problem 0 min d hT ctxtt (4.5) Subject to =, 0,1xtAtxttt gtt (4.6) ,0,txt (4.7) 0 0=, =. h x xxh x (4.8) For solving the above-mentioned problem one can apply the Legendre polynomials together with Cheby- shev-Gauss-Lobatto (CGL) points (as collocation nodes). In the next section, the state variable x t and associated control variable t are expanded in terms of Legendre polynomials with unknown coefficients. Then, using CGL points as the collocation nodes the latter problem is converted to an NLP problem which its parameters are the unknown coefficients of the state and associated control. 5. Discretization A discretization of the interval 01 0=< <<= N s ssh is chosen, where =, =0,1,, 22 ii hh s ti N (5.1) with π =cos ii tN . Trivially, si’s are shifted CGL points in the interval 0, h. We use the following expansions to approximate both x t and associated control t =0 ˆ == , N Nii i x txt aPt (5.2) =0 ˆ == , N Nii i ttbPt (5.3) where ˆi Pt’s are the ith order shifted Legendre polynomials. To find the Legendre expansion coeffi- cients i c of the derivative N x t such that =0 =0 ˆˆ == NN Nii ii ii x taPtcPt (5.4) we use the recurrence relation (2.1.7). Using CGL points for discretizing dynamical system (4.6) together with the inequality state constraints (4.7) and boundedness of associated control t , the opti- mal control problem (4.5) - (4.8) is changed into the following NLP problem 01 min ,,, N La aa (5.5) Subject to =, =0,, NNN iiiiii x sAsxss sgs iN (5.6) ,0, 01, =0,, NN ii i s xss iN (5.7) 00 =0 =0 =1=, ==. NN i NN iih ii x saxxhax (5.8) TT 12 01 ,,,=,,, , NN aaaMcc c (5.9) where 01 ,,, N La aa is a linear objective function, 0 ˆˆ 0== 1 i ii PPs and ˆˆ ==1 iiN Ph Ps. Note that the constraints of (5.9) arise from the fol- lowing relations 11 =, =1,2,,. 221 23 ii i cc h aiN ii (5.10) E. TOHIDI ET AL. Copyright © 2011 SciRes. JMF 11 where 1==0 NN cc . Hint. After obtaining optimal state * x t and asso- ciated control *t , for evaluating optimal control * ut we use the following equation ** ,= . H tu tttgt (5.11) 6. Illustrative Examples In this section, we conduct two numerical examples to illustrate the effectiveness of the proposed method. We use the method that stated in Sections 4 and 5 to transform the main problems into the equivalent NLP problems, and comparisons of our solutions with a col- location method solutions [12] are presented. All the problems are programmed in MAPLE 12 and run on a PC with 1.8 GHz and 1 GB RAM. Example 6.1 We first consider a problem containing non-smooth function , H tut which indirect approa- ches (base upon pontryagin maximum principle) can not dealing with this case in a proper way. The problem is to find the control ut and the state x t which mini- mize 1 0 =sin2πed t J txtt (6.1) subject to 3sin 2π 52 =e t xtt ttxtut (6.2) with 1,1ut , 0=0.9x and 1=0.4x. Here 3sin 2π ,= e t Htut ut, and according to the (4.2) and (4.3) we have sin 2π =min,:1,1 = et u gtHtut u and =max,:1,1 =0 u wtHtut u , thus sin 2π ==e. t twtgt In Figures 1 and 2 we plot the optimized state x t and control ut for =11N. Also, the numerical results compared with the colloca- tion method [12] are listed in Table 1. From Table 1. one can see that our method achieves good result with a relatively smaller of nodes than [12]. Example 6.2 Find the control ut and the state x t, which minimize 1 0 1 =e2 d 2 t J txt t (6.3) subject to =ln 3xttxtut t (6.4) with 1,1ut , 0=0x and 1=0.8x. Here ,=ln 3Htutut t , and according to the (4.2) and (4.3) we have Figure 1. Optimal state x*(.) of example 6.1. Figure 2. Optimal control u*(.) of example 6.1. Table 1. Comparison of J* between methods. NLegendre Method Collocation Method [12] 6−0.0331943700 −0.0354240398 8−0.0346216260 −0.0356414811 10 −0.0370224908 −0.0368552070 11 −0.0387730644 −0.0377178920 =min,:1,1 =ln2 u gtHtut ut and =max,:1,1 =ln4 u wtHtut ut , thus 4 ==ln4ln2=ln 2 t twtgt ttt In Figures 3 and 4 we plot the optimized state x t and control ut for =12N. Also, the numerical results compared with the collocation method [12] are listed in Table 2. From Table 2. we see that the performance E. TOHIDI ET AL. Copyright © 2011 SciRes. JMF 12 Figure 3. Optimal state x*(.) of example 6.2. Figure 4. Optimal control u*(.) of example 6.2. Table 2. Comparison of J* between methods. N Legendre Method Collocation Method [12] 7 −0.1820295698 −0.1815318828 9 −0.1825806940 −0.1821287778 11 −0.1826754624 −0.1823052407 12 −0.1828354838 −0.1826714837 index got by our approach are better than those obtained by the method in [12]. 7. Conclusions The aim of the present work is the determination of the optimal control and state vectors by a direct method of solution based upon linear combination property of in- tervals and shifted Legendre series expansions together with the CGL points as collocation nodes respectively. The method is based upon reducing a nonlinear optimal control problem to an NLP. The unity of the weight function of orthogonality for shifted Legendre series and the simplicity of the discretization are merits that make the approach very attractive. Moreover, only a small number of shifted Legendre series is needed to obtain a very satisfactory solution. The given numerical examples supports this claim. 8. References [1] R. R. Bless, D. H. Hoges and H. Seywald, “Finite Ele- ment Method for the Solution of State-Constrained Op- timal Control Problems,” Journal of Guidance, Control, and Dynamics, Vol. 18, No. 5, 1995, pp. 1036-1043. doi:10.2514/3.21502 [2] R. Bulrisch and D. Kraft, “Computational Optimal Con- trol,” Birkhauser, Boston, 1994. [3] G. Elnagar, M. A. Kazemi and M. Razzaghi, “The Pseu- dospectral Legendre Method for Discretizing Optimal Control Problems,” IEEE Transactions on Automatic C on- trol, Vol. 40, No. 10, 1995, pp. 1793-1796. doi:10.1109/9.467672 [4] G. N. Elnagar and M. A. Kazemi, “Pseudospectral Che- byshev Optimal Control of Constrained Nonlinear Dy- namical Systems,” Computational Optimization and Ap- plications, Vol. 11, No. 2, 1998, pp. 195-217. doi:10.1109/9.467672 [5] W. W. Hager, “Multiplier Methods for Nonlinear Optim- al Control Problems,” SIAM Journal on Numerical Anal, Vol. 27, No. 4, 1990, pp. 1061-1080. doi:10.1137/0727063 [6] A. L. Herman and B. A. Conway, “Direct Trajectory Optimization Using Collocation Based on High-Order Gauss-Lobatoo Quadrature Rules,” Journal of Guidance, Control, and Dynamics, and Dynamics, Vol. 19, No. 3, 1996, pp. 592-599. doi:10.2514/3.21662 [7] M. Ross and F. Fahroo, “Legendre Pseudospectral Ap- proximations of Optimal Control Problems,” Lecture Note s in Control and Information Sciences, Vol. 295, No. 1, 2003, pp. 327-342. [8] J. Vlassenbroeck and R. V. Dooren, “A Chebyshev Tech- nique for Solving Nonlinear Optimal Control Problems,” IEEE Transactions on Automatic Control, Vol. 33, No. 4, 1988, pp. 333-340. 10.1109/9.192187 [9] H. Seywald and R. R. Kumar, “Finite Difference Scheme for Automatic Costate Calculation,” Journal of Guidance, Control, and Dynamics, and Dynamics, Vol. 19, No. 1, 1996, pp. 231-239. doi:10.2514/3.21603 [10] K. Schittkowskki, “NLPQL: A Fortran Subroutine for E. TOHIDI ET AL. Copyright © 2011 SciRes. JMF 13 Solving Constrained Nonlinear Programming Problems,” Operations Research Annals, Vol. 5, No. 2, 1985, pp. 385-400. [11] M. Razzaghi and G. Elnagar, “A Legendre Technique For Solving Time-Varing Linear Quadratic Optimal Control Problems,” Journal of the Franklin Institute, Vol. 330, No. 3, 1993, pp. 453-463. doi:10.1016/0016-0032(93)90092-9 [12] O. V. Stryk, “Numerical Solution of Optimal Control Problems by Direct Collocation,” In: R. Bulrisch, A. Miele, J. Stoer and K. H. Well, Eds., Optional Control of Variations, Optimal Control Theory and Numerical Me- thods, International Series of Numerical Mathematics, Birkhäuser Verlag, Basel, 1993, pp. 129-143. |