The Cardinality Constraint-Based Optimization problem is investigated in this note. In portfolio optimization problem, the cardinality constraint allows one to invest in assets out of a universe of N assets for a prespecified value of K. It is generally agreed that choosing a “small” value of K forces the implementation of diversification in small portfolios. However, the question of how small must be K has remained unanswered. In the present work, using a comparative approach we show computationally that optimal portfolio selection with a relatively small or large number of assets, K, may produce similar results with differentiated reliabilities.
Although the portfolio optimization problem has been studied using various analytical and numerical techniques for more than half a century, recent development of computer based methods has opened new horizon to research in computation finance. The problem of portfolio optimization has been rendered to be complex for direct solving by traditional numerical approaches when constraints that model investors sentiments and frictions are included in the mathematical model. An optimal stock portfolio investment strategy should show the investor how much to invest in each asset in a given portfolio. The decision variable of stock portfolio optimization is the weight of the asset in the portfolio. Once an optimal weight is obtained, the expected return and risk can be easily calculated. The solution to the stock portfolio optimization problem now lies in graphically obtaining the efficient frontier, which is a risk-return trade-off curve. Each point on the efficient frontier gives the minimum level of risk to take for an expected return or, alternatively, the maximum return one can expect for a given level of risk. Hence a rational investor would usually choose portfolios that occur on the efficient frontier, since these represent “optimal” portfolios.
The Markowitz approach for the solution of the Portfolio Selection Problem assumed a perfect market, ignoring transaction costs, taxes and permitting trading of securities at any proportions. Under these assumptions, the mathematical model is reduced to a quadratic optimization which can be directly solved by classical numerical methods [
We finally introduce the Conditional Value-at-Risk (CVaR) into the constraints and define the cardinality constraints to allow the investors to invest partially in smaller portfolios. The Cardinality Constraint, which is our main focus in this work, is adopted when the investor can only invest in K assets out of the universe of N assets, for a prespecified value of K. Choosing a small value of K forces the implementation of diversification in small portfolios. A cardinality constrained optimization portfolio may then be viewed as a significant research topic in computational finance since the inclusion of such constraints turns the problem mathematically speaking in a mixed integer quadratic programming problem rendering it to be complex for direct solving by numerical methods. However, in some complex cases, a Genetic Algorithm (GA) approach may be required.
In this paper, we discuss the solution of complex cardinality constraint-based optimization problem but we also include all basic constraints (as part of the general constraints on portfolios over time), and implement a GA approach for investigating the solution of such a problem [
We present in this section the mathematical foundation, propositions and theorems, and previous analytical results which support and validate our computational approach.
This subsection presents an overview of the portfolio problem with respect to previous references [
X = { x ∈ R N , x 1 + x 2 + ⋯ + x N = 1 ; x j ≥ 0 , j = 1 , 2 , ⋯ , N } (1)
In some applications one may introduce short selling; that is, allowing some x j to be negative. Other restrictions may limit the exposure to particular assets in given groups, by imposing restrictions on upper bounds of the x j s or their particular subset sum. One can also limit the absolute differences between the proportion x j ( t ) of our assets and some reference proportion x j ¯ (which may represent the existing benchmark portfolio). One of the main difficulties of the Portfolio Optimization Problem remains the type of constraint being applied to the problem. In the next subsection we will present the concept of a dominance constrained portfolio and show how it simplifies the portfolio problem to some extent.
We begin by assuming that the benchmark random return rate Y having a finite expected value is available. It may have the form Y = R ( x ¯ ) for some benchmark portfolio x ¯ , which may be some expected average of our current portfolio (see [
Find min f
Subject to R ( x ) ≥ 2 Y , x ∈ X
Here ≥ 2 denotes dominance and f : X → R is a concave continuous function. In particular, we may use f ( x ) = E ( R ( x ) ) . Now we present some standard theorems and propositions which shall clarify our concepts.
Theorem 1. Let P be a randomly constructed portfolio and P 1 the benchmark portfolio. The second order stochastic dominance constraint is equivalent to the continuum of VaR constraints on portfolios. That is, C V a R α ( P ) ≥ C V a R α ( P 1 ) for all α ∈ ( 0,1 ] , where CVaR α is the conditional Value-at-Risk at level α .
Proof (by analogy to [
The function h ( α , P ) is a curve defined on the portfolio P such that
h ( α , P ( x ) ) ≥ s u p κ { α κ − E [ ( κ − P ( x ) ) + ] } = s u p κ { α κ − g ( P ( x ) ; κ ) } (2)
Observe that h ( • , P ( x ) ) is the conjugate of the function ( P 0 ( x ) ; n ) such that g ( P ( x ) ; κ ) ≤ g ( P 0 ( x ) ) for all κ ∈ R (also notice that g and h are dual functions by the definition). This implies that h ( α , P ( x ) ) ≥ h ( α , P 0 ( x ) ) , but, since g ( P ( x ) ) is continuous by duality of the conjugate, we conclude that the initial capital allowed for risk exposure at level α ≠ 0 is given by the benchmark outcome P 1 , and finally − C V a R α ( P 1 ) = T α , where T α is the tail conditional expectation equivalent or the expected shortfall at fixed level α . ,
Remark 1. If the portfolio is discrete, the stochastic dominance constraint can be replaced by infinitely many inequalities.
Proposition 1. Assume that the portfolio P 1 ( x ) defined as a linear function on assets has a discrete distribution with realisation { p i } , i = 1 , ⋯ , N . Then, the following equality holds:
E [ ( p i − P ( x ) ) + ] ≤ E [ ( p i − P 1 ( x ) ) + ] (3)
where ( p i − P ( x ) ) + ≡ m a x ( 0, p i − P ( x ) ) , recalling that P denotes our portfolio.
Remark 2. The above proposition does not imply that the continuum of CVaR constraints can be replaced by finitely many constraints of the form C V a R α ( P ( x ) ) ≥ C V a R ( P 1 ( x ) ) .
Proposition 2. Let P = P ( x ) be the payoff or value of a portfolio at some future time and 0 ≤ α ≤ 1 . If the underlying distribution of the portfolio P is a continuous distribution then the Expected Shortfall E S α is reduced to the Tail Conditional Expectation T C E α
T C E α ( P ) = E S α ( P ) = E [ − P | P ≤ − V a R α ( P ) ] (4)
Let us now focus on extreme values of the portfolio. Basically a portfolio may have heavy tail distribution partly due to violent market movements. These large market movements, far from being accepted as simple outliers, focus the attention of all investor or players, since their magnitude may be such that they compose a fraction of the portfolio return aggregate over long period of time. These observations have motivated numerous theoretical and computational efforts to understand the intermittent role of various assets in the portfolio and model adequately the tail of distribution of their returns. Such studies are relevant for risk management purposes and also necessary for calculation of average risk (of loss in portfolios) which may be required to determine regulatory capital requirements [
m N ( x , Δ t ) = min { x ( t + k Δ t ) ; k = 1 , ⋯ , N }
M N ( x , Δ t ) = max { x ( t + k Δ t ) ; k = 1 , ⋯ , N }
In terms of risk management m n ( x , Δ t ) represents the worst relative loss over time horizon Δ t of an investor holding a portfolio P ( x ( t ) ) . The question of how the properties of returns affect the probability distribution of m n ( x , Δ t ) and M n ( x , Δ t ) simultaneously over time seems to be quite attractive. Hence, if we knew the stochastic process generating the returns, we could easily evaluate the distribution of the extremes, but this is unfortunately not the case and that is where extreme value theory is needed. However we will not discuss the extreme value theory in this paper.
The reliability of a portfolio P is a quantity that serves as a good measure to compare the goodness of the fit in the portfolio. For a given level of expected return, with σ pred and σ real to be the corresponding predicted and realised risks, the reliability ℜ of a portfolio is given by
ℜ = | σ pred − σ real σ real | (5)
A portfolio is more reliable when ℜ is small. In our setup the predicted risk is computed by our GA, whereas the realised risk uses the usual formula of variance. Here we use only a positive definite reliability to simplify the comparison between optimal reliability gains over generation time. In the next section we shall formulate the cardinality constraints based optimization problem.
To bring the element of time into play, we now expand our approach from vectors to matrices. Let N be the number of assets in the universe, μ i = E ( R j ) the expected return of the asset i, and σ i j the covariance between the returns of assets i and j in the historical data. Let w = ( w i j ) denote the weight matrix with elements representing the proportion of capital to be invested in asset j at time t = i , and K i be the number of assets in which the investor decides to invest their capital at time i. Let w j denote the vector given by column j of w (that is, the weights over all time intervals for asset j, where i denotes the ith entry of w j ), x 0 the N × N matrix of initial prices of assets over all time intervals, P = ( P i j ) the N × N matrix of portfolios (rows) over time where P i j = x 0 , i j w i j . Using the above notation, the expected stock portfolio return vector and risk matrix are then given respectively by
R ¯ = ∑ j = 1 N w j μ j and r = ∑ i = 1 N ∑ j = 1 N ( w i , w j ) σ i j (6)
where
μ j = 1 N ∑ i = j n P j (7)
is the arithmetic mean of column j of the portfolio matrix, ( w i , w j ) is the scalar product of w i and w j .
We also define a risk aversion parameter λ ∈ [ 0,1 ] to present what is known as the weighted formulation of the portfolio optimization problem. Observation of λ reveals that when it is close to zero, the weights shift toward stocks yielding high returns and when close to one, the weights shift toward combinations of stocks yielding low volatility in the efficient set. Define the cost function
f = λ ∑ i = 1 N ∑ j = 1 N ( w i , w j ) σ i j − ( 1 − λ ) ∑ j = 1 N w j μ j (8)
Now let us formulate our modified problem
a) Find
m i n w i , w j f (9)
b) Subject to:
i) ∑ j = 1 N w i j = 1 for all i = 1 , ⋯ , N (Basic constraint)
ii) ν j ≤ w i j ≤ δ j , 0 ≤ ν j < δ j ≤ 1 for all i , j (Boundary constraint)
iii) Pr ( l P ( t ) ≤ 0.01 ) ≤ 0.01 (Probability constraint)
iv) #Math_108# where Z i j = { 1 ; if w i j > ν j 0 ; otherwise (Cardinality constraint)
l P ( t ) is the loss in portfolio P at time t and condition (i) requires that each row of the weight matrix sums to 1, with condition (ii) requiring two vectors ν = ( ν j ) , δ = ( δ j ) consisting of (possibly distinct) boundaries. Condition (iii) stipulates that the loss of any new portfolio created from an old one should be less than 1% to be acceptable. In this paper we will not go into detail on the GA implementation, rather focusing on the analysis of its outputs. For the interested reader, the algorithm may be found in [
We apply our method to the historical data of eight years worth of assets (
In the mathematical interpretation, this data is an 8 by 8 matrix where columns represent assets and rows are the values of the asset. The values are normalised beforehand for stability reasons. Fixing a given year (row), then by multiplying each asset by the corresponding weight of the above matrix data in and summing the results, we obtain the portfolio value at the given year (row). Below we give the results obtained for the values K = 3 , 4 , 7 , 8 .
Comments in Figures 1-4 show the results obtained with three experiments (that is, three different initial weight matrices were generated at random, subject to the given conditions) on which the GA was run five times for each of the four different values of K. Each table exhibits the following: Best cost value attained; Gain from minimally reliable portfolio or row; Mean and standard deviation (SD) of each experiment, Total % Gain in Matrix portfolio value from a randomly produced initial weight matrix w 0 ; Maximum Mean Portfolio Value in Best fit matrix over all generations, and Minimum Standard deviation of Portfolio Value in the best fit matrix over all generations obtained by the GA. It can be seen that when the investor decides on diversification in the portfolio constructed with K = 3 and K = 7 assets, the expected portfolio value is higher than in the portfolio constructed with K = 4 and K = 8 assets.
The proposed comparative analysis shows that high return on a portfolio may be
Year | CAC 40 (Close) | FTSE 100 (Close) | S & P 500 (Close) | DJW 5000 (Close) | NASDAQ Comp. (Close) | BARCLAY S 7610 vr treas (Close) | MCSI EAFE Close | Gold London PM Fix |
---|---|---|---|---|---|---|---|---|
2003 | 2937.88 | 3567.40 | 855.70 | 8125.10 | 1320.91 | 85.49 | 94.75 | 343.80 |
2004 | 3638.44 | 4390.70 | 1131.13 | 11,029.20 | 2066.15 | 85.56 | 138.32 | 416.25 |
2005 | 3913.69 | 4852.30 | 1181.27 | 11,642.60 | 2962.41 | 85.69 | 157.20 | 427.75 |
2006 | 4947.99 | 5760.30 | 1280.08 | 12,953.60 | 2305.82 | 83.32 | 62.86 | 530.00 |
2007 | 5608.31 | 6203.10 | 1438.24 | 14,489.70 | 2463.93 | 82.11 | 74.24 | 639.75 |
2008 | 4869.79 | 5879.80 | 1378.55 | 13,896.70 | 2389.86 | 89.93 | 72.34 | 846.75 |
2009 | 2973.92 | 4149.60 | 825.88 | 8335.60 | 1476.42 | 94.70 | 38.70 | 874.50 |
2010 | 3739.46 | 5188.50 | 1073.87 | 11,099.40 | 2147.35 | 90.70 | 52.48 | 1121.50 |
K | 3 | 4 | 7 | 8 |
---|---|---|---|---|
Best Cost | −3.408 | −3.467 | −3.564 | −3.564 |
Max (Normalized) Mean PV | 0.485 | 0.478 | 0.485 | 0.483 |
Min (Normalized) SD PV | 0.208 | 0.193 | 0.066 | 0.066 |
Min Rel | 0.216 | 0.211 | 0.178 | 0.292 |
Min Rel % Gain | 156.114 | 51.143 | 223.592 | 147.173 |
Total % Gain | 86.517 | 87.052 | 116.160 | 120.608 |
Best ES | 0.485 | 0.478 | 0.484 | 0.483 |
achieved with both a relatively low ( K = 3 ) and a relatively high ( K = 7 ) number of assets and differentiated risks (which have, respectively, a relatively high and low risk). We recall that we choose to measure the risk of a portfolio by the normalised standard deviation in its value over the given number of trials. We observed that the “mean” portfolio for K = 3 has a high mean portfolio value with high risk, and an investor who hopes to increase his or her portfolio value in an aggressive manner may chose such an allocation procedure. This approach will then challenge the Markowitz belief that the only optimal portfolio is the one with higher expected mean portfolio value and smaller risk or standard deviation. But generally speaking, most nonaggressive investors will tend to follow the Markowitz theory when constructing an investment strategy.
Our results have again highlighted the possibility of constructing an optimal portfolio with both low or relatively high risk (which is almost twice the lowest risk; see
In this paper a comparative study of the Constrained Portfolio Optimization Problem with cardinality constraint is investigated. The experimental studies have been undertaken on widely used indexes form the Period of January 2003 to January 2010. Our conclusions are as follows:
1) The diversification in asset numbers less than the total number of assets (that is, for K < N ) may increase the expected portfolio return with different reliabilities. We found that the “mean” portfolios for K = 3 and K = 7 both had high mean portfolio values, but the K = 7 portfolio had the best reliability, meaning that the investor will choose either K = 3 or K = 7 depending on how reliable it is likely to be.
2) Important information such as the gain from minimally reliable portfolio or row, mean and standard deviation (SD) of each experiment, total percentage gain in matrix portfolio value, and the maximum mean and minimum standard deviation in portfolio value may play an important role in optimal portfolio selection and management.
3) The “mean” portfolio for K = 7 has the highest “most reliable row” gain percentage, indicating that the preference of investors may be higher on such a portfolio compared to that of K = 3 , even though they have identical mean portfolio values.
Future directions of this research include: investigating new concepts for diversification in large portfolios and comparing with results of diversification in small portfolios, therefore building a theory that could link those two approaches.
We are grateful to all colleagues for discussions and suggestions which helped us to improve the presentation of this paper. We thank the Waseda University and University Paris 1, for the generous research support during the preparation of this work. This work was also supported by the Grant-in-Aid for Scientific Research, both of which are part of the MEXT (Ministry of Education, Culture and Sports, Sciences and Technology of Japan). We finally thank Professor Yuichi Sakumura for useful comments and the SAKURA Research & Consulting for bringing up such an interesting research topic.
Jimbo, H.C., Ngongo, I.S., Andjiga, N.G., Suzuki, T. and Onana, C.A. (2017) Portfolio Optimization under Cardinality Constraints: A Comparative Study. Open Journal of Statistics, 7, 731-742. https://doi.org/10.4236/ojs.2017.74051