Fuzzy regression analysis is an important regression analysis method to predict uncertain information in the real world. In this paper, the input data are crisp with randomness; the output data are trapezoid fuzzy number, and three different risk preferences and chaos optimization algorithm are introduced to establish fuzzy regression model. On the basis of the principle of the minimum total spread between the observed and the estimated values, risk-neutral, risk-averse, and risk-seeking fuzzy regression model are developed to obtain the parameters of fuzzy linear regression model. Chaos optimization algorithm is used to determine the digital characteristic of random variables. The mean absolute percentage error and variance of errors are adopted to compare the modeling results. A stock rating case is used to evaluate the fuzzy regression models. The comparisons with five existing methods show that our proposed method has satisfactory performance.
Fuzzy set theory was introduced by Zedah in 1964. On the basis, fuzzy set theory has been developed rapidly and applied in many fields, such as control system, artificial intelligence, and pattern recognition, etc. In the field of fuzzy regression analysis, two main methods are proposed by Tanaka and Diamond. The first approach; Tanaka [
The latter scholars have developed new studies based on the two mentioned above. Tanaka and Watada [
The previous models were studied to solve fuzziness, but some data not only have fuzziness but also have randomness. Only few previous studies have examined both fuzziness and randomness in empirical research. In this paper, according to the actual meaning of the random variables, the appropriate numerical characteristics are selected. Introduced chaos optimization algorithm (COA) determines the numerical characteristics of the random variable. Three mathematical programming models are proposed, called risk-neutral, risk-averse and risk-seeking. Based on this, the fuzzy regression coefficient of different risk models can be obtained, and model are determined.
Definition 2.1 [
Definition 2.2 [
Definition 2.3 [
μ A ( x ) = { L ( a − x α ) x ≤ a , α > 0 R ( x − a β ) x > a , β > 0
where a ∈ R is the center of the function, α , β are the left and right spread of the function, respectively. This fuzzy number is referred to as L − R fuzzy number. The shape of function satisfies 1) L ( x ) = L ( − x ) , R ( x ) = R ( − x ) , 2) L ( 0 ) = 1 , R ( 0 ) = 1 , and 3) R ( x ) are non-increasing on [ 0, ∞ ) .
R ˜ L R is a set of all L − R fuzzy numbers, L − R fuzzy number A ˜ has the left and right spreads from a real number whose membership is 1. The spreads represent the fuzziness of the fuzzy number could be symmetric or non-symmetric. If the spreads are null, there is no fuzziness of the number, and it is a real number. The choice of the L and R functions is dependent upon the subjective judgement.
Proposition 2.1 ( α , a , β ) L R , ( γ , b , δ ) ∈ R ˜ L R , k ∈ R .
1) ( α , a , β ) L R + ( γ , b , δ ) L R = ( α + γ , a + b , β + δ ) L R
2) k ⋅ ( α , a , β ) L R = { ( k α , k a , k β ) L R , k ≥ 0 ( − k β , k a , − k α ) R L , k < 0
− ( α , a , β ) L R = ( β , − a , α ) R L , especially.
Definition 2.4 A ˜ ∈ R ˜ , if the membership function of A ˜ is defined as
μ A ( x ) = { x − a + l l , a − l ≤ x ≤ a 1 , a ≤ x ≤ b b + r − x r , b ≤ x ≤ b + r 0 , else
A ˜ = ( l , a , b , r ) T is trapezoidal fuzzy number. If the fuzzy number is symmetric fuzzy numbers, l = r . The fuzzy number is triangular fuzzy number when a = b.
The set of trapezoid fuzzy numbers is denoted by R ˜ T , and R ˜ T ⊆ R ˜ L R ⊆ R ˜ .
Proposition 2.2 ( l 1 , a 1 , b 1 , r 1 ) , ( l 2 , a 2 , b 2 , r 2 ) ∈ R ˜ T , k ∈ R , then
1) ( l 1 , a 1 , b 1 , r 1 ) T + ( l 2 , a 2 , b 2 , r 2 ) T = ( l 1 + l 2 , a 1 + a 2 , b 1 + b 2 , r 1 + r 2 ) T
2) k ⋅ ( l 1 , a 1 , b 1 , r 1 ) T = { ( k l 1 , k a 1 , k b 1 , k r 1 ) T , k ≥ 0 ( − k r 1 , k b 1 , k a 1 , − k l 1 ) T , k < 0
The fuzzy linear regression model can be state as
Y ˜ i = A ˜ 0 + A ˜ 1 x i 1 + ⋯ + A ˜ k x i k = A ˜ x i , i = 1 , ⋯ , n (1)
where Y ˜ i = ( l y i , a y i , b y i , r y i ) T is fuzzy output value of the ith observation, x i j is crisp value of the jth independent variable in the ith observation, j = 1 , ⋯ , k , k is the number of independent variables; and x i 0 = 1 . A ˜ j = ( l j , a j , b j , r j ) is the fuzzy coefficients. Y ¯ ˜ i is the observed value.
As the characteristics of the sample directly reflect the overall population, the sample's numerical characteristics can be used to evaluate the overall population. The common numerical characteristics are expectation, variance, etc. Select the corresponding numerical characteristics according to the actual needs, determine the numerical characteristics to be determined using COA.
The chaos optimization algorithm proposed by Li [
The processes of applying COA in this study are described as follows.
First, the number of iterations of COA is defined. Each chaos variable represents numerical characteristics to be estimated, and the number of elements in a chaos variable is equal to the number of parameters to be determined. The chaos variable is initialized in which the values are selected randomly in the range [ 0,1 ] . The ranges of parameters [ a , b ] are initialized, in which a and b are the lower and upper limits of the optimization variable, respectively.
Second, the iteration number is set as m = 1 . Based on the initialized chaos variable, the logistic model used in COA in Equation (2), and logistic mapping can generate chaos variables through iteration.
c m = f ( c m − 1 ) = μ c m − 1 ( 1 − c m − 1 ) (2)
where μ is control parameter; c m ∈ [ 0,1 ] is the mth iterations value of the chaos variable c; and c 0 is the initialized chaos variable.
The linear mapping for converting chaos variables into optimization variables is formulated as follows:
q m = a + ( b − a ) c m (3)
where q m is the optimization variable and the value of q m is the parameter settings to be determined. Based on the iteration, the chaos variable traverse between [ 0,1 ] , and the corresponding optimization variables traverse in the corresponding range [ a , b ] . In this case, the optimal solution can be identified in the area of feasible solutions. Based on the values of q m , the numerical characteristics can be obtained. The model can be developed based on numerical characteristics and fuzzy coefficients by which the predicted output of Y ˜ i = ( l y i , a y i , b y i , r y i ) can be obtained. The predicted crisp output of Y ˜ i is denoted as y ^ i , which is the centroid of Y ˜ i .
Third, the mean absolute percentage error (MAPE) is defined as the average of percentage errors, which is scale-independent and is a popular measure for evaluating accuracy [
M A P E = 1 n ∑ i = 1 n | y ^ i − y i | y i ⋅ 100 (4)
where y ^ i is the centroid of Y ˜ i , y i is the centroid of Y ¯ ˜ i . The value of MAPE and q m in the first iteration are recorded as the best fitness value f v * = M A P E 1 and the best solution q * = q 1 , respectively.
Fourth, the iteration continues by m + 1 → m . The chaos variable and optimization variable are updated by Equation (2) and Equation (3), respectively. The MAPE in the m + 1 iteration, M A P E m + 1 is obtained using Equation (4). The process is as follows:
( f v * = M A P E m + 1 , q * = q m + 1 M A P E ≤ f v * f v * = M A P E m , q * = q m else (5)
Considering the random variable, the model in Equation (1) can be rewritten as follows:
( l y i , a y i , b y i , r y i ) = ( l 0 , a 0 , b 0 , r 0 ) + ( l 1 , a 1 , b 1 , r 1 ) x ′ i 1 + ⋯ + ( l k , a k , b k , r k ) x ′ i k (6)
a y i = ∑ j = 0 , x i j ≥ 0 k a j x ′ i j + ∑ j = 1 , x i j < 0 k b j ( − x ′ i j ) (7)
b y i = ∑ j = 0 , x i j ≥ 0 k b j x ′ i j + ∑ j = 1 , x i j < 0 k a j ( − x ′ i j ) (8)
l y i = ∑ j = 0 , x i j ≥ 0 k l j x ′ i j + ∑ j = 1 , x i j < 0 k r j ( − x ′ i j ) (9)
r y i = ∑ j = 0 , x i j ≥ 0 k r j x ′ i j + ∑ j = 1 , x i j < 0 k l j ( − x ′ i j ) (10)
where x ′ i j is a certain numerical characteristic of x i j , i = 1 , ⋯ , n ; j = 1 , ⋯ , k .
In this section, three different risk preference are introduced, called risk-neutral, risk-averse and risk-seeking problems, to determine the fuzzy regression model.
Dobois and Prade [
P o s ( A ˜ = B ˜ ) = sup x ∈ R { min { A ˜ ( x ) , B ˜ ( x ) } } (11)
N e s ( A ˜ ⊆ B ˜ ) = inf x ∈ R { max { 1 − A ˜ ( x ) , B ˜ ( x ) } } (12)
where Pos and Nes are short for Possibility and Necessity. Observe that index Equation (11) means the degree of possibility that A ˜ is equal to B ˜ and index Equation (12) means the degree of necessity that A ˜ includes B ˜ . Let Y ˜ i and Y ¯ ˜ i represent the estimate value and the observed value, respectively. The degree of fitness of the risk-neutral, risk-averse, risk-seeking denoted by f i R N , f i R A , f i R S , respectively and defined as, f i R N = P o s ( Y ¯ ˜ i = Y ˜ i ) , f i R A = N e s ( Y ¯ ˜ i ⊆ Y ˜ i ) , f i R S = N e s ( Y ¯ ˜ i ⊇ Y ˜ i ) . By considering Y ˜ 1 , Y ˜ 2 , ⋯ , Y ˜ m , the risk-neutral,risk-averse and risk-seeking degree of fitness the estimate model, denoted by f R N , f R A , f R S , respectively, are defined by f R N = min i { f i R N } , f R A = min i { f i R A } , f R S = min i { f i R S } . From the properties of the two indices Equation (11) and Equation (12), f R N ≥ h , f R A ≥ h and f R S ≥ h can be transformed to the usual inequalities for some fixed h ∈ [ 0,1 ] .
Proposition 4.1 [
Y ˜ i , L ( h ) ≤ Y ¯ ˜ i i , R ( h ) , Y ˜ i , R ( h ) ≥ Y ¯ ˜ i i , L ( h ) , i = 1 , ⋯ , n
2) f R N ≥ h if and only if
Y ˜ i , L ( h ) ≤ Y ¯ ˜ i , R ( 1 − h ) , Y ˜ i , R ( h ) ≥ Y ¯ ˜ i , L ( 1 − h ) , i = 1 , ⋯ , n
3) f R S ≥ h if and only if
Y ˜ i , L ( 1 − h ) ≤ Y ¯ ˜ i , R ( h ) , Y ˜ i , R ( 1 − h ) ≥ Y ¯ ˜ i , L ( h ) , i = 1 , ⋯ , n
where Y ˜ i , L ( h ) , Y ˜ i , R ( h ) , Y ¯ ˜ i , L ( h ) , Y ¯ ˜ i , R ( h ) are the left and right end point of fuzzy estimated output and observed output in the h level, respectively. Proposition 4.1 can be understood as
1) f R N ≥ h if and only if
Y ¯ ˜ i ( h ) ∩ Y ˜ i ( h ) ≠ ∅ , i = 1 , ⋯ , n
2) f R A ≥ h if and only if
Y ¯ ˜ i ( 1 − h ) ⊆ Y ˜ i ( h ) , i = 1 , ⋯ , n
3) f R S ≥ h if and only if
Y ˜ i ( 1 − h ) ⊆ Y ¯ ˜ i ( h ) , i = 1 , ⋯ , n
Corollary 4.1 If max { f R A , f R S } ≥ h , 1 2 ≤ h ≤ 1 , then f R N ≥ h .
The objective function of the model is to minimize the difference between the total spread of observed and estimated values it is given by
J = ∑ i = 1 n { ( Y ˜ i , R ( 0 ) − Y ¯ ˜ i , R ( 0 ) ) 2 + ( Y ˜ i , L ( 0 ) − Y ¯ ˜ i , L ( 0 ) ) 2 } (13)
Considering the above assumption, the problem is to obtain fuzzy parameters of risk-neutral, risk-averse and risk-seeking model in order to minimize J in Equation (13) subject to f R N ≥ h , f R A ≥ h and f R S ≥ h , respectively. Consequently, the mathematical models lead to the following three quadratic programming problems:
RNP(h) (Risk―neutral problem)
min : J = ∑ i = 1 n { ( Y ˜ i , R ( 0 ) − Y ¯ ˜ i , R ( 0 ) ) 2 + ( Y ˜ i , L ( 0 ) − Y ¯ ˜ i , L ( 0 ) ) 2 } (14)
s .t . Y ˜ i , L ( h ) ≤ Y ¯ ˜ i , R ( h ) , Y ˜ i , R ( h ) ≥ Y ¯ ˜ i , L ( h ) Y ˜ i , R ( 0 ) − Y ˜ i , L ( 0 ) ≥ 0 , i = 1 , ⋯ , n (15)
RNP(h) (Risk―averse problem)
min : J = ∑ i = 1 n { ( Y ˜ i , R ( 0 ) − Y ¯ ˜ i , R ( 0 ) ) 2 + ( Y ˜ i , L ( 0 ) − Y ¯ ˜ i , L ( 0 ) ) 2 } (16)
s .t . Y ˜ i , L ( h ) ≤ Y ¯ ˜ i , R ( 1 − h ) , Y ˜ i , R ( h ) ≥ Y ¯ ˜ i , R ( 1 − h ) Y ˜ i , R ( 0 ) − Y ˜ i , L ( 0 ) ≥ 0 , i = 1 , ⋯ , n (17)
RNP(h) (Risk―seeking problem)
min : J = ∑ i = 1 n { ( Y ˜ i , R ( 0 ) − Y ¯ ˜ i i , R ( 0 ) ) 2 + ( Y ˜ i , L ( 0 ) − Y ¯ ˜ i i , L ( 0 ) ) 2 } (18)
s .t . Y ˜ i , R ( 1 − h ) ≤ Y ¯ ˜ i , R ( h ) , Y ˜ i , L ( h ) ≥ Y ¯ ˜ i , L ( h ) Y ˜ i , R ( 0 ) − Y ˜ i , L ( 0 ) ≥ 0, i = 1, ⋯ , n (19)
The algorithm of the proposed model is summarized below.
Step 1: The parameters are initialized, including the number of iterations, initial value of numerical characteristics, initialized chaos variables, and ranges of parameters.
Step 2: The structure of the proposed model is generated using Equation (8). The number of terms in the model is 1 + k , where k is the number of independent variables.
Step 3: The iteration begins from m = 1 . The chaos variables c m are generated based on the logistic model in Equation (2) and transformed into optimization variables q m using Equation (3).
Step 4: The interval of a random variable is defined based on the experimental data, and the corresponding numerical characteristics is selected. The numerical characteristic value of random variables are then generated based on the values of q m . The random variables are substituted by their corresponding numerical characteristics, and the probabilistic terms of the proposed models are generated.
Step 5: The fuzzy coefficient of each term of the proposed model is determined by solving the quadratic programming problems shown in Equations (14) to (19).
Step 6: Predicted output y ^ i is calculated with the developed proposed models. MAPE between estimate value y ^ i and actual value y i for all data sets can then be obtained using Equation (4) as the fitness value of the iteration m.
Step 7: The iteration is continued by m + 1 → m and stops after the number of iterations reaches the predefined value. The values of MAPE are obtained for each iteration and compared. The solution with the smallest fitness value is selected based on step 3 in Section 3.2. A model with the smallest error is then generated.
A stocking rating case was conducted to evaluate the effectiveness of the proposed approach. Securities analysts collect data to study macro economy and company, then forecast the earning and stocking value of the company being tracked. And provide five investment rating, which are buying, increasing, neutral, reducing, and selling for investors. Stock rating coefficient: 1.00 - 1.09 buying; 1.10 - 2.09 increasing; 2.10 - 3.09 neutral; 3.10 - 4.09 decreasing; 4.10 - 5.00 selling. Introducing fuzzy number to deal with the interval of different ratings. Therefore, the stock rating can be expressed as trapezoid fuzzy number as follows:
“ buying ” = ( 0.00 , 1.00 , 1.09 , 0.01 ) T
“ increasing ” = ( 0.01 , 1.10 , 2.09 , 0.01 ) T
“ neutral ” = ( 0.01 , 2.10 , 3.09 , 0.01 ) T
“ decreasing ” = ( 0.01 , 3.10 , 4.09 , 0.01 ) T
“ selling ” = ( 0.01,4.10,5.00,0.00 ) T
There are many factors that affect stock rating. However, based on the three principles of comprehensiveness, comparability and feasibility, we choose the following indicators to form an evaluation index system: profitability (x1), operational capacity (x2), short-term debt paying ability (x3) and volatility of the stock (x4).
This case study the relationship between stock rating (y) and profitability (x1), operational capacity(x2), short-term debt paying ability (x3) and volatility of the stock (x4). In the stock rating forecast, the input is the crisp value and the output is the trapezoid fuzzy number. The value of stock price volatility is the standard deviation of the closing price of the stock from Sep 18th, 2017 to Oct 18th.
The sample date collected are shown in
After the data collection, based on the initialized stock volatility, fuzzy coefficient of different risks can be obtained by Equations (14) to (19). Generating new stock volatility by COA, Equation (4) are considered as the objective function. In the end, stock volatility based on different risks are shown in
ID | x1 | x2 | x3 | x4 | ID | x1 | x2 | x3 | x4 |
---|---|---|---|---|---|---|---|---|---|
1 | 29.98 | 0.98 | 0.37 | 0.12 | 14 | 43.12 | 0.84 | 2.21 | 0.08 |
2 | 22.88 | 1.01 | 0.86 | 0.08 | 15 | 37.49 | 2.22 | 1.50 | 1.03 |
3 | 48.57 | 1.50 | 0.49 | 0.54 | 16 | 1.50 | 0.63 | 2.12 | 1.19 |
4 | 26.20 | 2.53 | 0.22 | 0.95 | 17 | 10.05 | 0.73 | 1.89 | 0.03 |
5 | 149.13 | 1.07 | 0.64 | 0.11 | 18 | 40.87 | 0.65 | 3.32 | 0.24 |
6 | 99.59 | 1.75 | 0.96 | 1.15 | 19 | 9.63 | 0.35 | 3.20 | 0.13 |
7 | 76.79 | 0.91 | 1.22 | 0.56 | 20 | 3.08 | 0.43 | 0.72 | 0.04 |
8 | 1.72 | 2.02 | 0.06 | 0.07 | 21 | 23.10 | 1.35 | 1.78 | 1.31 |
9 | 6.46 | 0.56 | 0.23 | 0.12 | 22 | 20.97 | 1.25 | 1.43 | 0.39 |
10 | 65.67 | 0.17 | 0.11 | 0.60 | 23 | 21.99 | 1.07 | 0.67 | 0.11 |
11 | 58.65 | 0.91 | 0.09 | 0.19 | 24 | 19.87 | 2.54 | 0.23 | 0.22 |
12 | 197.61 | 2.64 | 0.43 | 0.25 | 25 | 35.34 | 2.22 | 1.37 | 0.32 |
13 | 95.72 | 1.33 | 0.79 | 0.33 |
ID | Rating | ID | Rating | ID | Rating |
---|---|---|---|---|---|
1 | (0.00 1.00 1.09 0.01) | 10 | (0.00 1.00 1.09 0.01) | 19 | (0.01 3.10 4.09 0.01) |
2 | (0.01 2.10 3.09 0.01) | 11 | (0.00 1.00 1.09 0.01) | 20 | (0.01 3.10 4.09 0.01) |
3 | (0.00 1.00 1.09 0.01) | 12 | (0.00 1.00 1.09 0.01) | 21 | (0.01 3.10 4.09 0.01) |
4 | (0.00 1.00 1.09 0.01) | 13 | (0.01 1.10 2.09 0.01) | 22 | (0.01 2.10 3.09 0.01) |
5 | (0.00 1.00 1.09 0.01) | 14 | (0.01 1.10 2.09 0.01) | 23 | (0.01 3.10 4.09 0.01) |
6 | (0.01 1.10 2.09 0.01) | 15 | (0.00 1.00 1.09 0.01) | 24 | (0.01 1.10 2.02 0.01) |
7 | (0.01 1.10 2.09 0.01) | 16 | (0.01 3.10 4.09 0.01) | 25 | (0.01 3.10 4.09 0.01) |
8 | (0.00 1.00 1.09 0.01) | 17 | (0.01 2.10 3.09 0.01) |
V o E = 1 n − 1 ∑ i = 1 n ( | y ^ i − y i | y i − M A P E ) 2 (20)
The same survey data was utilized to develop models based on SR, TH, DM, KC, JL. For SR, the centroid of fuzzy number are considered as fuzzy number. The models are produced by different methods as follows in
In this paper, we developed three mathematical programming models, called risk-neutral, risk-averse, and risk-seeking, to studied fuzzy linear regression
ID | Confidence Interval | ID | Confidence Interval | ID | Confidence Interval |
---|---|---|---|---|---|
1 | [0.0940 0.2010] | 10 | [0.4539 0.9276] | 19 | [0.0734 0.1034] |
2 | [0.0577 0.1179] | 11 | [0.1431 0.2925] | 20 | [0.0276 0.0564] |
3 | [0.4031 0.8238] | 12 | [0.1924 0.3932] | 21 | [0.9000 1.8391] |
4 | [0.7527 1.4829] | 13 | [0.2474 0.5057] | 22 | [0.2895 0.5916] |
5 | [0.0758 0.1548] | 14 | [0.0651 0.1330] | 23 | [0.0865 0.1767] |
6 | [0.8851 1.8087] | 15 | [0.6015 1.2292] | 24 | [0.1550 0.3167] |
7 | [0.4293 0.8772] | 16 | [0.9011 1.8414] | 25 | [0.2229 0.4555] |
8 | [0.0553 0.1131] | 17 | [0.0293 0.0294] | ||
9 | [0.0825 0.1686] | 18 | [0.1644 0.2844] |
ID | Variance | ID | Variance | ID | Variance | ID | Variance |
---|---|---|---|---|---|---|---|
1 | 0.1737 | 8 | 0.0603 | 15 | 0.7842 | 22 | 0.5819 |
2 | 0.0996 | 9 | 0.1086 | 16 | 0.9935 | 23 | 0.1307 |
3 | 0.6695 | 10 | 0.8233 | 17 | 0.0293 | 24 | 0.2934 |
4 | 0.9967 | 11 | 0.1974 | 18 | 0.1844 | 25 | 0.2609 |
5 | 0.0813 | 12 | 0.2061 | 19 | 0.0934 | ||
6 | 1.3950 | 13 | 0.4853 | 20 | 0.0292 | ||
7 | 0.6433 | 14 | 0.0799 | 21 | 1.2637 |
ID | Variance | ID | Variance | ID | Variance | ID | Variance |
---|---|---|---|---|---|---|---|
1 | 0.1977 | 8 | 0.1024 | 15 | 0.7519 | 22 | 0.3215 |
2 | 0.1045 | 9 | 0.1090 | 16 | 1.5833 | 23 | 0.1449 |
3 | 0.4042 | 10 | 0.5274 | 17 | 0.0293 | 24 | 0.2303 |
4 | 1.1250 | 11 | 0.1826 | 18 | 0.1844 | 25 | 0.3179 |
5 | 0.1009 | 12 | 0.2223 | 19 | 0.0934 | ||
6 | 1.1709 | 13 | 0.2794 | 20 | 0.0347 | ||
7 | 0.8414 | 14 | 0.1223 | 21 | 1.4748 |
analysis, and meanwhile considered the randomness caused by independent variables in the modeling. In the proposed approach, the numerical characteristics of random variables is decided by COA. Fuzzy regression analysis is then conducted to determine the fuzzy coefficients for all the terms of the
Approaches | Developed models |
---|---|
SR | y ˜ SR = 1.8487 − 0.0062 x 1 − 0.0557 x 2 + 0.5399 x 3 − 0.0385 x 4 |
TH | y ˜ TH = ( 1.9286 × 10 − 8 , − 0.0417,1.0065,6.5089 × 10 − 9 ) + ( 2.0434 × 10 − 8 , − 0.0254,1.0415,2.1557 × 10 − 8 ) x 1 + ( 1.8247 × 10 − 8 , − 0.1749,0.8416,9.7891 × 10 − 9 ) x 2 + ( 1.9099 × 10 − 8 ,0.0139,0.9926,6.8315 × 10 − 9 ) x 3 + ( 2.4991 × 10 − 8 ,0.3568,0.6446,4.3585 × 10 − 8 ) x 4 |
DM | y ˜ DM = ( 0.0044 , 1.8221 , 2.3881 , 0.0083 ) + ( 1.3791 × 10 − 5 , 0.0063 , 0.0068 , 1.0424 × 10 − 5 ) x 1 + ( 0.0010 , 0.0064 , 0.0069 , 0.0006 ) x 2 + ( 1.9002 × 10 − 6 , − 0.420391 , − 0.420390 , 1.9229 × 10 − 6 ) x 3 + ( 0.0016 , − 0.0632 , − 0.0148 , 0.0012 ) x 4 |
KC2 | y ˜ KC = 0.6673 − 0.1242 x 1 + 0.1517 x 2 + 0.8083 x 3 + 0.5304 x 4 + ( 0.2784,0.2852,0.9872,0.9972 ) |
JL | y ˜ JL = ( 0.00016 , 0.9514 , 0.9537 , 0.0016 ) + ( 3.4895 × 10 − 6 , − 0.0474 , − 0.0471 , 3.4895 × 10 − 6 ) x 1 + ( 0.0001 , 0.5746 , 0.5760 , 0.0001 ) x 2 + ( 0.0001 , 0.2136 , 0.2155 , 0.0001 ) x 3 + ( 0.0004 , 0.7414 , 0.7454 , 0.0001 ) x 4 |
RNP | y ˜ RNP = ( 0.3066 , 1.6750 , 2.1384 , 1.0213 ) + ( 6.8889 × 10 − 10 , − 0.007399967 , − 0.007399966 , 6.8890 × 10 − 10 ) x 1 + ( 0.0106 , − 0.1169 , − 0.1067 , 0.0106 ) x 2 + ( 2.7728 × 10 − 7 , 0.3528503 , 0.3528505 , 2.6929 × 10 − 7 ) x 3 + ( 6.6698 × 10 − 8 , − 0.30650488 , − 0.30650481 , 6.7342 × 10 − 8 ) x 4 |
RAP | y ˜ RAP = ( 4.0794 × 10 − 7 , 1.2521 , 1.9009 , 2.0041 ) + ( 9.8956 × 10 − 9 , − 0.01046018 , − 0.01046014 , 3.4631 × 10 − 8 ) x 1 + ( 3.1431 × 10 − 7 , 0.0188 , 0.1190 , 0.1158 ) x 2 + ( 3.5577 × 10 − 7 , 0.2115 , 0.2508 , 0.0335 ) x 3 + ( 1.1538 × 10 − 6 , − 0.76236 , − 0.76235 , 2.4335 × 10 − 6 ) x 4 |
Validation error | SR | TH | DM | KC | JL | RNP | RAP | RSP |
---|---|---|---|---|---|---|---|---|
MAPE | 35 | 1623 | 73 | 40 | 135 | 32 | 37 | *3 |
VoE | 121 | 1,845,500 | 5223 | 1669 | 18,542 | 1120 | 1369 | * |
1h = 0 for all approaches.
2In the KC model, (0:2784; 0:2852; 0:9872; 0:9972) is all coordinate points.
3RSP is infeasible for each 0 ≤ h ≤1.
proposed models. The generated model can address the fuzziness caused by human subjective judgement and the randomness caused by independent variables. The results from the stock rating case indicates that the proposed models have better performance than the others methods.
The authors declare no conflicts of interest regarding the publication of this paper.
Gao, N.N. and Lu, Q.J. (2018) Probabilistic Fuzzy Regression Approach from the Point of View Risk. Journal of Data Analysis and Information Processing, 6, 156-167. https://doi.org/10.4236/jdaip.2018.64010