A Modified Limited SQP Method For Constrained Optimization

doi:10.4236/am.2010.11002

Applied Mathematics, 2010, 1, 8-17

doi:10.4236/am.2010.11002 Published Online May 2010 (http://www.SciRP.org/journal/am)

A Modified Limited SQP Method For Constrained

Optimization*

Gonglin Yuan1, Sha Lu2, Zengxin Wei1

1Department of Mathematics and Information Science, Guangxi University, Nanning, China

2School of Mathematics Science, Guangxi Teacher’s Education University, Nanning, China

E-mail: glyuan@gxu.edu.cn

Received December 23, 2009; revised February 24, 2010; accepted March 10, 2010

Abstract

In this paper, a modified variation of the Limited SQP method is presented for constrained optimization. This

method possesses not only the information of gradient but also the information of function value. Moreover,

the proposed method requires no more function or derivative evaluations and hardly more storage or arith-

metic operations. Under suitable conditions, the global convergence is established.

Keywords: Constrained Optimization, Limited Method, SQP Method, Global Convergence

1. Introduction

Consider the constrained optimization problem

Ijxg

Eixhts

xf

j

i





,0)(

,0)(..

)(min

(1)

where RRghf n

ji :,, are twice continuously diffe-

rentiable, },,,2,1{mE 

0},,,2,1{ 













llmmmI 

is an integer. Let the Lagrangian function be defined by

)()()(),,( xhxgxfxLTT



 (2)

where



and



are multipliers. Obviously, the La-

grangian function L is a twice continuously differenti-

able function. Let S be the feasible point set of the

problem (1). We define 

I

to be the set of all the sub-

scripts of those inequality constraints which are active

at 

x, i.e., }.0)(|{ 

xgandIiiI i

It is well known that the SQP methods for solving

twice continuously differentiable nonlinear programming

problems, are essentially Newton-type methods for find-

ing Kuhn-Tucher points of nonlinear programming

problems. These years, the SQP methods have been in

vogue [1-8]: Powell [5] gave the BFGS-Newton-SQP

method for the nonlinearly constrained optimization. He

gave some sufficient conditions, under which SQP me-

thod would yield 2-step Q-superlinear convergence rate

(assuming convergence) but did not show that his mod-

ified BFGS method satisfied these conditions. Coleman

and Conn [2] gave a new local convergence qua-

si-Newton-SQP method for the equality constrained non-

linear programming problems. The local 2-step

Q-superlinear convergence was established. Sun [6]

proposed quasi -Newton-SQP method for general 1

LC

constrained problems. He presented the locally conver-

gent sufficient conditions and superlinear convergent

sufficient conditions. But he did not prove whether the

modified BFGS-quasi-Newton-SQP method satisfies the

sufficient conditions or not. We know that, the BFGS

update exploits only the gradient information, while the

information of function values of the Lagrangian func-

tion (2) available is neglected.

If n

R

x



holds, then the problem (1) is called un-

constrained optimization problem (UNP). There are ma-

ny methods [9-13] for the UNP, where the BFGS method

is one of the most effective quasi-Newton method. The

normal BFGS update exploits only the gradient informa-

tion, while the information of function values available is

neglected for UNP too. These years, lots of modified

BFGS methods (see [14-19]) have been proposed for

UNP. Especially, many efficient attempts have been

made to modify the usual quasi-Newton methods using

both the gradient and function values information (e.g.

[19,20]). Lately, in order to get a higher order accuracy

in approximating the second curvature of the objective

function, Wei, Yu, Yuan, and Lian [18] proposed a new

BFGS-type method for UNP, and the reported numerical

results show that the average performance is better than

that of the standard BFGS method. The superlinear con-

vergence of this modified has been established for un-

iformly convex function. Its global convergence is estab-

lished by Wei, Li, and Qi [20]. Motivated by their ideas,

Yuan and Wei [21] presented a modified BFGS method

*This work is supported by the Chinese NSF grants 10761001 and the

Scientific Research Foundation of Guangxi University (Grant No.

X081082), and Guangxi SF grants 0991028.

G. L. Yuan ET AL.

9

which can ensure that the update matrix are positive de-

finite for the general convex functions. Moreover, the

global convergence is proved for the general convex

functions.

The limited memory BFGS (L-BFGS) method (see

[22]) is an adaptation of the BFGS method for

large-scale problems. The implementation is almost

identical to that of the standard BFGS method, the only

difference is that the inverse Hessian approximation is

not formed explicitly, but defined by a small number of

BFGS updates. It is often provided a fast rate of linear

convergence, and requires minimal storage.

Inspired by the modified method of [21], we combine

this technique and the limited memory technique, and

give a limited SQP method for constrained optimization.

The global convergence of the proposed method will be

established for generally convex function. The major

contribution of this paper is an extension of, based on the

basic of the method in [21], the method for the UNP to

constrained optimization problems. Unlike the standard

SQP method, a distinguishing feature of our proposed

method is that a triple },,{ 

iii Ays being stored, where

1iii

s

xx



,,)()( 1iiixixi sAzLzLy 





 1i

z



111

(,, )

iii

x







,),,( iiii

x

z





, i



and i



are the

multipliers which are according to the Lagrangian objec-

tive function at i

x, while 1i



and 1i



are the mul-

tipliers which are according to the Lagrangian objective

function at 1i

x, and 

i

A is a scalar related to Lagran-

gian function value. Moreover, a limited memory SQP

method is proposed. Compared with the standard SQP

method, the presented method requires no more function

or derivative evaluations, and hardly more storage or

arithmetic operations.

This paper is organized as follows. In the next section,

we briefly review some modified method and the L-BFGS

method for UNP. In Section 3, we describe the modified

limited memory SQP algorithm for (2). The global con-

vergence will be established in Section 4. In the last sec-

tion, we give a conclusion. Throughout this paper, ||||



denotes the Euclidean norm of vectors or matrix.

2. Modified BFGS Update and the L-BFGS

Update for UNP

We will state the modified BFGS update and the

L-BFGS update for UNP in the following subsections,

respectively.

2.1. Modified BFGS Update

Quasi-Newton methods for solving UNP often need to

update the iterate matrixk

B. In tradition, }{ k

Bsatisfies

the following quasi -Newton equation:

kkk SB





1 (3)

where kkkxxS





1,)()( 1kkk xfxf 









.The very

famous updatek

Bis the BFGS formula

k

T

k

T

kk

T

k

T

kkk

kkSSBS

BSSB

BB







1 (4)

Let k

H be the inverse of k

B, then the inverse up-

date formula of (4) method is represented as

,)()(

)(

)()(

)(

2

1

k

T

k

T

kk

k

T

k

T

kk

k

T

k

T

kk

k

T

k

T

kkkk

T

kkkk

k

T

k

T

kkkkk

T

k

kk

S

SS

S

IH

S

I

S

HSSSHS

S

SSHS

HH























(5)

which is the dual form of the DF

P

update formula

in the sense thatkk BH



, 11 kk BH , and kk ys



.

It has been shown that the BFGS method is the most ef-

fective in quasi-Newton methods from computation point

of view. The authors have studied the convergence

of fand its characterizations for convex minimization

[23-27]. Our pioneers made great efforts in order to find

a quasi-Newton method which not only possess global

convergence but also is superior than the BFGS method

from the computation point of view [15-17,20,28-31].

For general functions, it is now known that the BFGS

method may fail for non-convex functions with inexact

line search [32], Mascarenhas [33] showed that the non-

convergence of the standard BFGS method even with

exact line search. In order to obtain a global convergence

of BFGS method without convexity assumption on the

objective function, Li and Fukushima [15,16] made a

slight modification to the standard BFGS method. Now

we state their work [15] simply. Li and Fukushima (see

[15]) advised a new quasi-Newton equation with the fol-

lowing form







1

1kkk SB



, where,

1

kkkkk Sgt





0

k

t

is determined by }0,

||||

max{1 2

k

T

k

kS

S

t



 . Un-

der appropriate conditions, these two methods [15,16]

are globally and superlinearly convergent for nonconvex

minimization problems.

In order to get a better approximation of the objective

function Hessian matrix, Wei, Yu, Yuan, and Lian (see

[18]) also proposed a new quasi-Newton equation:

,)3()2( 2

1kkkkkk SASB











where

2

||||

)]()([)]()([2

)3(

k

T

kkkkkkkk

kS

Sxfdxfdxfxf

A





.

Then the new BFGS update formula is

G. L. Yuan ET AL.

10

.

)2(

)2()2(

)2()2( 2

22

1





k

T

k

T

kk

T

k

T

kkk

kk SSBS

BSSB

BB





(6)

Note that this quasi-Newton formula (6) contains both

gradient and function value information at the current

and the previous step. This modified BFGS update for-

mula differs from the standard BFGS update, and a

higher order approximation of )(

2xf can be obtained

(see [18,20]).

It is well known that the matrix k

B are very impor-

tant for convergence if they are positive definite [24,25].

It is not difficult to see that the condition 0

2



k

T

k

S



can ensure that the update matrix )2(

1k

B from (6) in-

herits the positive definiteness of)2(

k

B. However this

condition can be obtained only under the objective func-

tion is uniformly convex. If f is a general convex

function, then 2

k

T

k

S



and k

T

k

S



may equal to 0. In

this case, the positive definiteness of the update matrix

k

B can not be sure. Then we conclude that, for the gen-

eral convex functions, the positive definiteness of the

update matrix k

B generated by (4) and (6) can not be

satisfied.

In order to get the positive definiteness of )2(

k

B

based on the definition of 2

k



and k



for the general

convex functions, Yuan and Wei [21] give a modified

BFGS update, i. e., the modified update formula is de-

fined by

,

)3(

)3()3(

)3()3( 3

33

1

k

T

k

T

kk

T

k

T

kkk

kk S

SBS

BSSB

BB 









(7)

where }0),3(max{,

3

kkkkkk AASA 





. Then the

corresponding quasi-Newton equation is



3

1)3( kkk SB



(8)

which can ensure that the condition 0

3



k

T

k

S



holds

for the general convex function f(see [21] in detail).

Therefore, the update matrix 1k

B from (8) inherits the

positive definiteness of k

B for the general convex

function.

2.2. Limited Memory BFGS-Type Method

The limited memory BFGS (L-BFGS) method (see [22])

is an adaptation of the BFGS method for large-scale

problems. In the L-BFGS method, matrix k

H is ob-

tained by updating the basic matrix )0

~

(

0mH times

using BFGS formula with the previous m

~ iterations.

The standard BFGS correction (5) has the following

form

T

kkkkk

T

kkSSVHVH





1 (9)

where

k

T

k

kS





1

, T

kkkkSIV



 ,

I

is the unit ma-

trix. Thus, 1k

H in the L-BFGS method has the fol-

lowing form:

.

][][

][

12

~

2

~

1

~

2

~

11

~

1

~

1

~

1

~

111111

1

T

kkk

kmk

T

mkmk

T

mk

T

kmk

kmkmk

T

mk

T

k

T

kkkk

T

kkkkk

T

k

T

k

T

kkkkk

T

kk

SS

VVSSVV

VVHVV

SSVSSVHVV

SSVHVH





























(10)

3. Modified SQP Method

In this section, we will state the normal SQP method and

the modified limited memory SQP method, respectively.

3.1. Normal SQP Method

The first-order Kuhn-Tucker condition of (2) is























.0)(

,,0)(,0,0)(

,0)()()(

xh

Ijforxgxg

xhxgxf

jjj

TT





(11)

The system (11) can be represented by the following

system:

,0)( zH (12)

where Szz





),,(





and lmnlmn RRH













: is

defined by

.

)(

}),(min{

)()()(

)(



















xh

xg

xhxgxf

zH

TT





(13)

Since ,, gf



and h



are continuously differentia-

ble functions, it is obviously that )(zH is continuously

differentiable function. Then, for all lmn

Rd 







, the

directional derivative ):( dzH



of the function )(zH

exists. Denote the index sets by

)}(|{)( xgiz ii 







(14)

and )}.(|{)( xgiz ii 







(15)

G. L. Yuan ET AL.

11

Under the complementary condition, it is clearly that

)(z



is an index set of strongly active inequality con-

straints, and )(z



is an index set of weakly active and

inactive inequality constraints. In terms of these sets, the

directional derivative along the direction ),,(



dddd x



is given as follows

,

)(

)}(,min{

)(

):(

)(























x

T

zix

T

i

zix

T

i

dxh

dgd

dg

Gd

dzH

i





(16)

where Gis a matrix which elements are the partial deriv-

atives of )(zL

x

 to ,

x

d,



d,



drespectively. If

ii ddgd zix

T

i



 )(

)}(,min{ holds, then the set

.

000)(

000

000)(

)()()(

)(



















T

xh

I

xg

xhxgxgV

zW







(17)

By (33) in [6], we know than the system

),( kkk zHdW  (18)

where ),,(kkk dddd xk



 and )( kk zWW, define

the Kuhn-Tucker condition of problem (2), which also

defines the Kuhn-Tucker condition of the following qua-

dratic programming :),( kk VzQP

,0)()(

,0)()(..

,

2

1

)(min









sxhxh

sxgxg

sxgxgts

sVssxf

T

kk

T

kk

T

kk

k

TT

k





(19)

where ).(, 2

kxxkk zLVxxs 

Generally, suppose that )1(

k

B is an estimate of k

V

and )1(

k

B can be updated by BFGS method of qua-

si-Newton formula

,

)1(

)1()1(

1

k

T

k

T

kk

T

k

T

kkk

kk sy

yy

sBs

BssB

BB 

 (20)

where kkk xxs 1, )()( 1kxkxk zLzLy  ,

),,,( 1111  kkkk xz





),,,( kkkk xz





k



and k



are the multipliers which are according to the Lagrangian

objective function at k

x, while 1k



and 1k



are the

multipliers which are according to the Lagrangian objec-

tive function at 1k

x. Particularly, when we use the up-

date formula (20) to (19), the above quadratic program-

ming problem can be written as :),(kk BzQP

.0)()(

,0)()(

,0)()(..

,)1(

2

1

)(min









sxhxh

sxgxg

sxgxgts

sBssxf

T

kk

T

kk

T

kk

k

TT

k





(21)

Suppose that ),,(





s is a Kuhn-Tucker triple of the

sub problem),( kk BzQP, therefore, it is obviously

that 0



s if ),,(





kk

x is a Kuhn-Tucker triple of (2).

3.2. Modified Limited Memory SQP Method

The normal limited memory BFGS formula of qua-

si-Newton-SQP method with k

H for constrained opti-

mization (2) is defined by























][][

][

12

~

2

~

1

~

2

~

11

~

1

~

1

~

1

~

111111

1

kmk

T

mkmk

T

mk

T

kmk

kmkmk

T

mk

T

k

T

kkkk

T

kkkkk

T

k

T

k

T

kkkkk

T

kk

VVssVV

VVHVV

ssVssVHVV

ssVHVH







(22)

where ,

1

k

T

k

kys





,

T

kkkk syIV





I

is the unit

matrix. To maintain the positive definiteness of the li-

mited memory BFGS matrix, some researchers suggested

to discard correction },{ kk ys if 0

k

T

kys does not

hold (e.g. [34]). Another technique was proposed by

Powell [35] in which k

y is defined by











,,)1(

,2.0,

otherwisesBy

sBsysify

y

kkkkk

kk

T

kk

T

kk



where ,

8.0

k

T

kkk

T

k

kk

T

k

kyssBs

sBs







kk HB 

1 of (22). How-

ever, if the Lagrangian objective function ),,(





xL is

a general convex function, then k

T

kys may equal to 0.

In this case, the positive definiteness of the update matrix

k

H of (22) can not be sure.

Whether there exists a limited memory SQP method

which can ensure that the update matrix are positive de-

finite for general convex Lagrangian objective func-

tion ),,(





xL . This paper gives a positive answer. Let

2

11

||||

)]()([)]()([2

~

k

T

kxkxkk

ks

szLzLzLzL

A

.

Con-

sidering the discussion of the above section, we discuss

k

A

~ for general convex Lagrangian objective function

G. L. Yuan ET AL.

12

),,(





xL in the following cases to state our motivation.

case i: If ,0

~

k

A we have

.0||||

~

)

~

(2 k

T

kkkk

T

kkkk

T

kyssAyssAys (23)

case ii: If 0

~

k

A, we get

,

||||

)]()([)(2

||||

)]()([)]()([2

~

0

2

11

2

11

k

T

k

T

kxkxkkx

k

T

kxkxkk

k

s

ys

s

szLzLszL

s

szLzLzLzL

A















(24)

which means that 0

k

T

kys holds. Then we present our

modified limited memory SQP formula

1

111 111

111

1121121

[]

[][]

kkk

TT

kkkk

TTT T

kkkkkkk kkkk

TT

kkmkmkmk

TT T

km kkmkmkm kmk

T

kkk

HVHV ss

VVHVss Vss

VVH VV

VV ssVV

ss











 

 

 

 

 

  





















(25)

where ,

1





k

T

k

kys



,

T

kkkk syIV  



and

kkkk sAyy }0,

~

max{

. It is not difficult to see that the

modified limited memory SQP formula (25) contains

both the gradient and function value information of La-

grangian function at the current and the previous step if

0

~



k

A holds.

Let 

k

B be the inverse of 

k

H. More generally, sup-

pose that 

k

B is an estimate of k

V. Then the above

quadratic programming problem (19) can be written as

:),( 

kk BzQP

.0)()(

,0)()(

,0)()(..

,

2

1

)(min







 

sxhxh

sxgxg

sxgxgts

sBssxf

T

kk

T

kk

T

kk

k

TT

k





(26)

Suppose that ),,(





s is a Kuhn-Tucker triple of the

subproblem ),(

kk BzQP , therefore, it is obviously that

0s if ),,(





kk

x is a Kuhn-Tucker triple of (2).

Now we state our algorithm as follows.

Modified limited memory SQP algorithm 1 for (2)

(M-L-SQP-A1)

Step 0: Star with an initial point ),,( 0000





xz

and an estimate 

0

H of )( 0

2

0zLV xx

 , 

0

H is a

symmetric and positive definite matrix, positive con-

stants 10







,0

0m is a positive constant. Set

0



k;

Step 1: For given k

z and 

k

H, solve the subproblem

,0)()(

,0)()(..

,

2

1

)(min 1







 

sxhxh

sxgxg

sxgxgts

sHssxf

T

kk

T

kk

T

kk

k

TT

k





(27)

and obtain the unique optimal solution k

d;

Step 2: k



is chosen by the modified weak

Wolfe-Powell (MWWP) step-size rule

,)()()(k

T

kxkkkkk dzLzLdzL 



(28)

and

,)()(k

T

kxk

T

kkkx dzLddzL 



(29)

then let .

1kkkkdxx









Step 3: If 1k

z satisfies a prescribed termination cri-

terion (18), stop. Otherwise, go to step 4;

Step 4: Let },1min{

~

0

mkm





. Update 

0

H for m

~

times to get 

1k

H by formula (25).

Step 5: Set 1





kk and go to step 1.

Clearly, we note that the above algorithm is as simple

as the limited memory SQP method, form storage and

cost point of a view at each iteration.

In the following, we assume that the algorithm updates



k

B-the inverse of 

k

H. The M-L-SQP-A1 with Hessian

approximation 

k

B can be stated as follows.

Modified limited memory SQP algorithm 2 for (2)

(M-L-SQP-A2)

Step 0: Star with an initial point ),,( 0000





xz

and an estimate 

0

B of )( 0

2

0zLV xx

 , 

0

B is a sym-

metric and positive definite matrix, positive constants

10







, 0

0m is a positive constant. Set

0



k;

Step 1: For given k

z and 

k

B, solve the subproblem

),( 

kk BzQP and obtain the unique optimal solution k

d;

Step 2: Let },1min{

~

0

mkm



. Update 

k

B with

the triples k

mkiiii Ays 1

~

},,{ 

, i.e.,

for kmkl ,,1

~









, compute

G. L. Yuan ET AL.

13

,

1

l

T

l

T

ll

k

l

k

T

l

k

T

ll

l

k

l

k

l

ksy

yy

sBs

BssB

BB 







 (30)

where lll xxs  1, llll sAyy   and 

 1

~

mk

k

B for

all k.

Note that M-L-SQP-A1 and M-L-SQP-A2 are mathe-

matically equivalent. In the next section, we will estab-

lish the global convergence of M-L-SQP-A2.

4. Convergence analysis of M-L-SQP-A2

Let 

xbe a local optimal solution and ),,(  



xz

be the corresponding Kuhn-Tucker triple of problem (1).

In order to get the global convergence of M-L-SQP-A2,

the following assumptions are needed.

Assumption A. 1) i

hf , and i

g are twice conti-

nuously differentiable functions for all Sx  and S

is bounded.

2) }),({}),({   IjxgEixh iiare positive li-

near independence.

3) (Strict complementarity) For0, 

j

Ij



.

(iv) 0Vss Tfor all0swith sxh T

i)( 

Ei



,0

and   Ijsxg T

i,0)(, where )(

2

 zLV xx .

(v) }{k

zconverges to 

z

where 0)(  

zL

x.

(vi) The Lagrangian function )(zL is convex for all

Sz .

Assumption A(vi) implies that there exists a constant

0H such that

.,|||| SzHV  (31)

Due to the strict complementary Assumption A(3), at a

neighborhood of 

z, the method (26) is equivalent to

the following equality constrained quadratic program-

ming:

.0)()(

,0)()(..

,

2

1

)(min



 

sxhxh

sxgxgts

sBssxf

T

kk

T

kk

k

TT

k



(32)

Without loss of generality for the locally convergent

analysis, we may discuss that there are only active con-

straints in (2). Then (18) becomes the following system

with 

k

B instead of k

V:

)(

00)(

)()(

k

kxx

T

TzH

xh

xg

zL

d

xh

xg

xhxgB

k























































(33)

In the case of only considering active constraints, we

can suppose that



















00)(

)()(

T

k

xh

xg

xhxgV

W



(34)

And

,

00)(

)()(

,





















T

k

KH

xh

xg

xhxgB

D



(35)

when 

k

B is close to k

V, KH

D, is close to k

W.

Lemma 4.1 Let Assumption A hold. Then there exists

a positive number 1

M such that

.,2,1,0,

||||

1

2





kM

ys

y

k

T

k

Proof. By Assumption A, then there exists a positive

number 0

M such that (see [36])

.0,

||||

0

2

 kM

ys

y

k

T

k

k (36)

Since the function )(xL is convex, then we have

k

T

kxkk szLzLzL )()()( 1

and

,)()()( 11 k

T

kxkk szLzLzL   the above two in-

equalities together with the definition of k

A

~

imply that

2

||||

||

|

~

|

k

T

k

ks

ys

A. (37)

Using the definition of 

k

y, we get

k

T

kkk

T

kk

T

kysAysys 

}0,

~

max{ (38)

and

||,||2||||||||||}0,

~

max{|||||||||| kkkkkkk yyysAyy 



(39)

where the second inequality of (39) follows (37). Com-

bining (38), (39), and (36), we obtain:

.4

||||4||||

0

22

M

ys

y

ys

y

k

T

k

T

k

k



Let 01 4MM



, we get the conclusion of this lemma.

The proof is complete.

Lemma 4.2 Let k

B is generated by (30). Then we have

,)det()det(

1

~

1

~

1 





k

mkl ll

T

l

T

l

mk

kk sBs

ys

BB (40)

where )det( 

k

B denotes the determinant of 

k

B.

Proof. To begin with, we take the determinant in both

sides of (20)

G. L. Yuan ET AL.

14

,

)1(

))1((

)])1(

)1(

))1((

)((

)))1((1)(

)1(

1))[(1((

)

)1(

())1((

))

)1(

)(1(())1((

1

kk

T

k

T

k

kk

T

k

T

kk

k

T

k

T

k

T

k

T

kk

T

k

kk

T

kk

k

T

k

T

kkk

kk

T

k

T

kk

k

T

k

T

kkk

kk

T

k

T

kk

sBs

sy

BDet

yB

sBs

sB

sy

y

s

sy

y

yB

sBs

sB

sBDet

ys

yyB

sBs

Bss

IDetBDet

ys

yyB

sBs

Bss

IBDetBDet











where the third equality follows from the formula (see,

e.g., [37] Lemma 7.6)

).)(()1)(1()det( 324143214321 uuuuuuuuuuuuI TTTTTT 

Therefore, there is also a simple expression for the de-

terminant of (30)

.)det()det(

1

~

1

~

1 





k

mkl ll

T

l

T

l

mk

kk sBs

ys

BB

Then we complete the proof.

Lemma 4.3 Let Assumption A hold. Then there exists a

positive constant 1



such that

,||||1kk

s





 where ||||

)(

k

T

kx

kd

dzL





.

Proof. By Assumption A, we have

).1(||||)(

))()((

2

1

0

1









HddtddtzVd

dzLzL

kkkkkk

T

kk

k

T

kxkx



On the other hand, using (29), we get

.)()1())()(( 1k

T

kxk

T

kxkx dzLdzLzL 



Therefore, ,

1

|||| kk

H

s









let 1

1

1





H





. The

proof is complete.

Using Assumption A, it is not difficult to get the fol-

lowing lemma.

Lemma 4.4 Let Assumption A hold. Then the sequence

)}({ k

zL monotonically decreases, and Szk for all

0k. Moreover,

.))((

0









k

T

kxk dzL



Similar to Lemma 2.6 in [38], it is not difficult to get

the following lemma. Here we also give the proof

process.

Lemma 4.5 If the sequence of nonnegative numbers

),1,0(kmk satisfy







k

j

k

jkccm

0

11 ,,2,1,0, (41)

then 0suplim 

kk m.

Proof. We will get this result by contradiction. As-

sume that 0suplim



kk m, then, for 11

0c





, there

exists 0

1k, such that 1





k

m for all 1

kk . Hence,

for all 1

kk,









1

0

11

1

k

j

k

kj

j

kmc































 





1

11

1

0

1

suplim k

k

j

k

km

c



,

which is a contradiction, thus, 0suplim

kk m.

Lemma 4.6 Let }{ k

x be generated by M-L-SQP-A2 and

Assumption A hold. If 0||)(||inflim 

 kx

kzL , then, there

exists a constant 0

0



such that



.0,

1

0

 



kallfor

k

j



Proof. Assume that 0||)(||inflim 

 kx

kzL , i.e., there

exists a constant 0

2csuch that

,2,1,0,||)(|| 2



kczL kx . (42)

Now we prove that the update matrix 

1k

B will al-

ways be generated by the update formula (30), i.e., 

1k

B

inherits the positive definiteness of 

k

B or0



k

T

kys

always holds. For 0



k, this conclusion holds at hand.

For all 1k, assume that 

k

B is positive definite. We

will deduce that 0



k

T

kys always holds from the fol-

lowing three cases.

Case 1. If 0

~

k

A. By the definition of 

k

y and As-

sumption A, we have

0}0,

~

max{ 



k

T

kkk

T

kk

T

kysAysys .

Case 2. If 0

~



k

A. By the definition of 

k

y, (24), and

Assumption A, we get 0



k

T

kk

T

kysys .

Case 3. If 0

~

k

A. By the definition of 

k

y, (29), As-

sumption A, )(

1

kxkk zLBd , and the positive defi-

niteness of 

k

B, we obtain

0)1()()1(  

kk

T

kkkx

T

kkk

T

kk

T

kdBdzLdysys



,

So, we have 0



k

T

kys , and 

1k

B will be generated by the

update formula (30). Thus, the update matrix 

1k

B will

always be generated by the update formula (30).

Taking the trace operation in both sides of (30), we get

,

||||||||

)(

2

1

~

2

1

~

1

~

1





















l

T

l

k

mkl

ll

T

l

ll

k

mkl

mk

k

ys

y

sBs

sB

BTr

(43)

G. L. Yuan ET AL.

15

where )( 

k

BTr denotes the trace of 

k

B. Repeating

this trace operation, we have

.

||||||||

)(

||||||||

)()(

0

2

0

2

0

2

1

~

2

1

~

1

~

1





























k

ll

T

l

k

lll

T

l

ll

l

T

l

k

mkl

ll

T

l

ll

k

mkl

mk

kk

ys

y

sBs

sB

BTr

ys

y

sBs

sB

BTrBTr



(44)

Combining (42), (44), )(

1

kxkk zLBd   , and

Lemma 4.1, we obtain

.)1(

)()(

)()( 1

0

2

01 Mk

zLHzL

c

BTrBTr

k

ljxj

T

jx

k



 







(45)

Using 

1k

Bis positive definite, we have 0)( 1



k

BTr .

By (45), we obtain

2

10

0

2

2)1()(

)()( c

MkBTr

zLHzL

c

k

ljxj

T

jx











 (46)

and

.)1()()( 101 MkBTrBTr k

 (47)

By the geometric-arithmetic mean value formula we

get

.

)1()(

)1(

)()(

1

10

2

0



























k

j

jxj

T

jx MkBTr

ck

zLHzL (48)

Using Lemma 4.2, (30), and (38), we have

.

1

)det(

1

)det(

)det()det(

0

1

~

1

~

1

~

1

~

1

~

1

~

1



















 

















k

jj

k

mkl l

mk

k

mkl ll

T

l

T

l

mk

k

mkl ll

T

l

T

l

mk

kk

B

sBs

ys

B

sBs

ys

BB











This implies

.

1

)det(

0

1

0











k

j

k

B





(49)

By using the geometric-arithmetic mean value formula

again, we get

.

)(

)det( 1

1

n

k

kn

BTr

B



















 (50)

Using (47), (49) and (50), we obtain

1

3

10

0

10

0

1

10

0

10

0

1,

])([

)det(

min

}1,

])([

)det(

min{

)exp(

1

])([

)det(

1

])1()([

)det(

1

























































k

n

k

n

k

j

C

MBTr

nB

MBTr

nB

n

MBTr

nB

k

MkBTr

nB





(51)

where }1,

])([

)det(

min{

)exp(

1

10

0

3n

n

MBTr

nB

n

c

















. Let

.

||||)(||

)(

cos

jjx

j

T

jx

jdzL

dzL









Multiplying (48) with (51), for all0k, we get

1

10

2

1

3

0

]

)1()(

)1(

[cos||)(|||||| 











kk

k

j

jjxk MkBTr

ck

czLs



.]

)(

[1

10

2

23 



k

MBTr

cc (52)

According to Lemma 4.4 and Assumption A we know

that there exists a constant 0

2



M

such that

2112|||||||||||||||| Mxxxxs kkkkk











 . (53)

Combining the definition of k



and (53), and noting

that jjjx zL









cos||)(|| , we get for all 0k,

.]

2))((

[1

0

1

210

2

23

0













kk

k

j

jMMBTr

cc



The proof is complete.

Now we establish the global convergence theorem for

M-L-SQP-A2.

Theorem 4.1 Let Assumption (i) hold and let the se-

quence }{ k

z be generated by M-L-SQP-A2. Then we

have

0||)(||inflim 

 kx

kzL . (54)

Proof. By Lemma 4.3 and (28), we get

.)(

||||)()(

2

1

kk

kkkk

zL

szLzL







 (55)

By (55), we have 





0

2

k



, this implies that

G. L. Yuan ET AL.

16

0lim 

 k

k



. (56)

Therefore, relation (54) can be obtained from (56) and

Lemma 4.6 directly.

5. Conclusion

For further research, we should study the properties of

the modified limited memory SQP method under weak

conditions. Moreover, numerical experiments for practi-

cally constrained problems should be done in the future.

6. References

[1] P. T. Boggs, J. W. Tolle and P. Wang, “On The Local

Convergence of Quasi-Newton Methods for Constrained

Optimization,” SIAM Journal on Control and Optimiza-

tion, Vol. 20, No. 2, 1982, pp. 161-171.

[2] F. H. Clarke, “Optimization and Nonsmooth Analysis,”

Wiley, New York, 1983.

[3] T. F. Coleman and A. R. Conn, “Nonlinear Programming

Via Exact Penlty Function: Asymptotic Analysis,” Ma-

thematical Programming, Vol. 24, No. 1, 1982, pp. 123-

136.

[4] M. Fukushima, “A Successive Quadratic Programming

Algorithm with Global and Superlinear Convergence

Properties,” Mathematical Programming, Vol. 35, No. 3,

1986, pp. 253-264.

[5] M. J. D. Powell, “The Convergence of Variable Metric

methods for Nonlinearly Constrained Optimization Cal-

culations,” In O. L. Mangasarian, R. R. Meyer and S. M.

Robinson Eds., Nonlinear Programming 3, Academic

Press, New York, 1978, pp. 27-63.

[6] W. Sun, “Newton's Method and Quasi-Newton-SQP Me-

thod for General LC1 Constrained Optimization,” Applied

Mathematics and Computation, Vol. 92, No. 1, 1998, pp.

69-84.

[7] F. C. Thomas and Q. C. Andrew, “On The Local Con-

Vergence of a Quasi-Newton Methods for The Nonlinear

Programming Problem,” SIAM Journal on Numerical

Analysis, Vol. 21, No. 4, 1984, pp. 755-769.

[8] Y. Yuan and W. Sun, “Theory and Methods of Optimiza-

tion,” Science Press of China, Beijing, 1999.

[9] G. Yuan, “Modified Nonlinear Conjugate Gradient Me-

thods with Sufficient Descent Property for Large-Scale

Optimization Problems,” Optimization Letters, Vol. 3, No.

1, 2009, pp. 11-21.

[10] G. L. Yuan and X. W. Lu, “A New Line Search Method

with Trust Region for Unconstrained Optimization, ”

Communications on Applied Nonlinear Analysis, Vol. 15,

No. 1, 2008, pp. 35-49.

[11] G. L. Yuan and X. W. Lu, “A Modified PRP Conjugate

Gradient Method,” Annals of Operations Research, Vol.

166, No. 1, 2009, pp. 73-90.

[12] G. Yuan, X. Lu and Z. Wei, “A Conjugate Gradient Me-

thod with Descent Direction for Unconstrained Optimiza-

tion,” Journal of Computational and Applied Mathemat-

ics, Vol. 233, No. 2, 2009, pp. 519-530.

[13] G. L. Yuan and Z. X. Wei, “New Line Search Methods

for Unconstrained Optimization,” Journal of the Korean

Statistical Society, Vol. 38, No. 1, 2009, pp. 29-39.

[14] W. C. Davidon, “Variable Metric Methods for Minimiza-

tion,” SIAM Journal on Optimization, Vol. 1, No. 1, 1991,

pp. 1-17.

[15] D. H. Li and M. Fukushima, “A Modified BFGS Method

and Its Global Convergence in Non-convex Minimiza-

tion,” Journal of Computational and Applied Mathemat-

ics, Vol. 129, No. 1-2, 2001, pp. 15-35.

[16] D. H. Li and M. Fukushima, “On The Global Conver-

gence of The BFGS Methods for Non-convex Uncon-

strained Optimization Problems,” SIAM Journal on Op-

timization, Vol. 11, No. 4, 2000, pp.1054-1064.

[17] M. J. D. Powell, “A New Algorithm for Unconstrained

Optimation,” In J. B. Rosen, O. L. Mangasarian and K.

Ritter Eds., Nonlinear Programming, Academic Press,

New York, 1970.

[18] Z. Wei, G. Yu, G. Yuan and Z. Lian, “The Superlinear

Convergence of A Modified BFGS-type Method for Un-

constrained Optimization,” Computational Optimization

and Applications, 29(2004), pp. 315-332.

[19] J. Z. Zhang, N. Y. Deng and L. H. Chen, “New Qua-

si-Newton Equation and Related Methods for Uncon-

strained Optimization,” Journal of Optimization Theory

and Applications, Vol. 102, No. 1, pp. 147-167.

[20] Z. Wei, G. Li and L. Qi, “New Quasi-Newton Methods

for Unconstrained Optimization Problems,” Applied Ma-

thematics and Computation, Vol. 175, No. 2, 2006, pp.

1156-1188.

[21] G. L. Yuan and Z. X. Wei, “Convergence Analysis of A

Modified BFGS Method on Convex Minimizations,”

Computational Optimization and Applications, doi:

10.1007/s10 589-008-9219-0.

[22] R. H. Byrd, J. Nocedal and R. B. Schnabel, “Representa-

tions of Quasi-Newton Matrices and Their Use in Limited

Memory Methods,” Mathematical Programming, Vol. 63,

No. 1-3, 1994, pp. 129-156.

[23] C. G. Broyden, J. E. Dennis and J. J. Moré, “On The Lo-

cal and Supelinear Convergence of Quasi-Newton Me-

thods,” IMA Journal of Applied Mathematics, Vol. 12 No.

3, 1973, pp. 223-246.

[24] R. H. Byrd and J. Nocedal, “A Tool for The Analysis of

Quasi-Newton Methods with Application to Uncon-

strained Minimization,” SIAM Journal on Numerical

Analysis, Vol. 26, No. 3, 1989, pp. 727-739.

[25] R. H. Byrd, J. Nocedal and Y. Yuan, “Global Conver-

gence of A Class of Quasi-Newton Methods on Convex

Problems,” SIAM Journal on Numerical Analysis, Vol. 24,

No. 5, 1987, pp.1171-1189.

[26] J. E. Dennis adn J. J. Moré, “A Characteization of Super-

Linear Convergence and Its Application to Quasi-Newton

Methods,” Mathematics of Computation, Vol. 28, No.

126, 1974, pp. 549-560.

[27] A. Griewank and Ph. L. Toint, “Local Convergence

G. L. Yuan ET AL.

17

Analysis for Partitioned Quasi-Newton Updates,” Nume-

rische Mathematik, Vol. 39, No. 3, 1982, pp. 429-448.

[28] A. Perry, “A Class of Conjugate Algorithms with a Two

Step Variable Metric Memory,” Discussion paper, No.

269, Center for Mathematical Studies in Economics and

Management Science, Northwestern University, 1977.

[29] D. F. Shanno, “On the Convergence of A New Conjugate

Gradient Algorithm,” SIAM Journal on Numerical Anal-

ysis, 15, No. 6, 1978, pp. 1247-1257.

[30] Z. Wei, L. Qi and X. Chen, “An SQP-type Method and

Its Application in Stochastic Programming,” Journal of

Optimization Theory and Applications, Vol. 116, No. 1,

2003, pp. 205-228.

[31] G. L. Yuan and Z. X. Wei, “The Superlinear Conver-

gence Analysis of A Nonmonotone BFGS Algorithm on

Convex Objective Functions,” Acta Mathematica Sinica,

Vol. 24, No. 1, 2008, pp. 35-42.

[32] Y. Dai, “Convergence Properties of the BFGS Algo-

rithm,” SIAM Journal on Optimization, Vol. 13, No. 3,

2003, pp. 693-701.

[33] W. F. Mascarenhas, “The BFGS Method with Exact Line

Searchs Fals for Non-convex Objective Functions,” Ma-

thematical Programming, Vo. 99 No. 1, 2004, pp. 49-61.

[34] R. H. Byrd, P. Lu, J. Nocedal and C. Zhu, “A Limited

Memory Algorithm for Bound Constrained Optimiza-

tion,” SIAM Journal on Scientific Computing, Vol. 16,

No. 5, 1995, pp. 1190-1208.

[35] M. J. D. Powell, “A fast Algorithm for Nonlinear Con-

strained Optimization Calculations,” Lecture Notes in

Mathematics, Vol. 630, Springer, Berlin, pp. 144-157.

[36] M. J. D. Powell, “Some Properties of The Variable Me-

tric Algorithm,” In F. A. Lootsma Ed., Numerical me-

thods for Nonlinear Optimization, Academia Press, Lon-

don, 1972.

[37] J. E. Dennis and J. J. Moré, “Quasi-Newton Methods,

Motivation and Theory,” SIAM Review, Vol. 19, No. 1,

1977, pp. 46-89.

[38] J. Y. Han and G. H. Liu, “Global Convergence Analysis

of a New Nonmonotone BFGS Algorithm on Convex

Objective Functions,” Computational Optimization and

Applications, Vol. 7, No. 3, 1997, pp. 277-289.

Paper Menu >>

Journal Menu >>