OJSOpen Journal of Statistics2161-718XScientific Research Publishing10.4236/ojs.2016.63038OJS-67321ArticlesPhysics&Mathematics An Efficient Class of Estimators for the Finite Population Mean in Ranked Set Sampling LakhkarKhan1JavidShabbir2Department of Statistics Government College Toru, Khyber Pukhtunkhwa, PakistanDepartment of Statistics, Quaid-i-Azam University, Islamabad, Pakistan08062016060342643514 February 2016accepted 11 June 14 June 2016© Copyright 2014 by authors and Scientific Research Publishing Inc. 2014This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/

In this paper, we propose a class of estimators for estimating the finite population mean of the study variable under Ranked Set Sampling (RSS) when population mean of the auxiliary variable is known. The bias and Mean Squared Error (MSE) of the proposed class of estimators are obtained to first degree of approximation. It is identified that the proposed class of estimators is more efficient as compared to  estimator and several other estimators. A simulation study is carried out to judge the performances of the estimators.

Ranked Set Sampling Auxiliary Variable Bias Mean Squared Error Relative Efficiency
1. Introduction

The problem of estimation in the finite population mean has been widely considered by many authors in different sampling designs. In application, there may be a situation when the variable of interest cannot be measured easily or is very expensive to do so, but it can be ranked easily at no cost or at very little cost. In view of this situation,  introduced the Ranked Set Sampling (RSS) procedure.  proved the mathematical theory that the sample mean under RSS was an unbiased estimator of the finite population mean and more precise than the sample mean estimator under simple random sampling (SRS).

The auxiliary information plays an important role in increasing efficiency of the estimator.  suggested an estimator for population ratio in RSS and showed that it had less variance as compared to usual ratio estimator in simple random sampling (SRS).

In RSS, perfect ranking of elements was considered by  and  for estimation of population mean. In some situations, ranking may not be perfect. According to  , the sample mean in RSS is an unbiased estimator of the population mean regardless of errors in ranking of the elements. In  , the ranking of elements was done on basis of the auxiliary variable instead of judgment.  suggested an estimator for population mean and ranking of the elements was observed on basis of the auxiliary variable.  had suggested a class of Hartley-Ross type unbiased estimators in RSS.  had also proposed unbiased estimators in RSS and stratified ranked set sampling.

In this paper, we suggest a class of estimators for the population mean, using known population mean of the auxiliary variable in RSS. It is shown that the proposed class of estimators outperforms as compared to the  ,  and several other estimators. Also some special cases of the proposed class are considered in Table A1 (Appendix).

2. Ranked Set Sampling Procedure

In ranked set sampling (RSS), we select m random samples, each of size m units from the population, and rank the units within each sample with respect to a variable of interest. In order to facilitate the ranking, the design parameter m, is chosen to be small. From the first sample the unit having the lowest rank is selected, from the second sample the unit having second lowest rank is selected and the process is continued until from the last sample the unit having the highest rank is selected. In this way, we obtain m measured units, one from each sample. The cycle may be repeated r times until units have been measured. These units form the RSS data.

Suppose that the variable of interest Y is difficult to measure and to rank, but there is the auxiliary variable X, which is correlated with Y. The variable X may be used to obtain the rank of Y. To perform the sampling procedure, m bivariate random samples, each of size m units are drawn from the population then each sample is ranked with respect to one of the variables Y or X. Here, we assume that the perfect ranking is done on basis of the auxiliary variable X while the ranking of Y is with error. An actual measurement from the first sample is then taken of the unit with the smallest rank of X, together with the variable Y associated with the smallest rank of X. From the second sample of size m the Y associated with the second smallest rank of X is measured. The process is continued until from the mth sample, the Y associated with the highest rank of X is measured. The cycle is repeated r times until bivariate units have been measured out of the total selected units.

3. Some Existing Estimators and Notations

We consider a situation when rank the elements on the auxiliary variable. Let be the ith judgment ordering in the ith set for the study variable Y based on the ith order statistics of the ith set of the auxiliary variable X at the jth cycle. Based on RSS, the sample mean estimator of the population mean, is given by

where.

To obtain the bias and of estimators, we define:

such that

,

and

, , ,

where

, , ,. and are the

coefficients of variation of Y and X respectively. It also be noted that the values of and are the means of ith order statistics from some specific distributions (see  ).

The variance of under RSS scheme, is given by

 proposed an estimator of the population ratio under RSS as:

When population mean () of the auxiliary variable (X) is known, and the variables Y and X are positively correlated,  proposed the ratio estimator for population mean () based on RSS as

The bias and MSE of, up to the first degree of approximation, are given by

and

When population mean () of the auxiliary variable (X) is known, and the variables Y and X are negatively correlated, then the product estimator based on RSS is defined as:

The bias and MSE of, up to the first degree of approximation, are given by

and

 suggested an estimator under RSS and is defined as:

where is suitably chosen constant.

The minimum bias and MSE of at optimum value of i.e.

are given by

and

The difference-type estimator for population mean () based on RSS, is given by

where d is a constant.

The minimum variance of at optimum value of d i.e.

is given by

Following  ,  suggested a class of estimators of the population mean (), based on RSS as:

where is a suitably chosen constant, a and b are either real numbers or functions of known parameters of the auxiliary variable X, g is a scalar which takes value of 1 (for generating ratio-type estimators) and (for generating product-type estimators) and are constants whose sum need not be unity.

The bias of, is given by

The MSE of, to first degree of approximation, is given by

where

We discuss two cases.

Case 1: Sum of weights is unity (i.e.).

Solving (17), the optimum value of, is given by

Substituting in (17), we get the minimum MSE of, given by

Case 2: Sum of weights is flexible (i.e.).

Solving (17), the optimum values of and are given by

and

Substituting the optimum values of and in (17), we get

4. Proposed Class of Estimators

Following  and  , we propose a class of estimators of the population mean (), under RSS as

where is a suitably chosen constant, a and b are either real numbers or the functions of known parameters of the auxiliary variable X and are constants whose sum need not be unity. From (20) we can generate a large number of estimators for the different values of the constants (Table A1 in Appendix). The proposed estimator can be written in terms of and as

where.

Solving (21), we have

Taking expectation of both sides of above equation, we get bias of, given by

Squaring both sides of Equation (22) and ignoring higher order terms of e’s, we have

Taking expectation of both sides of above equation, we obtain the MSE of as given by

where

We discuss two cases.

Case 1: Sum of weights is unity (i.e.).

The optimum value of, is given by

Thus, the minimum MSE of, is given by

Case 2: Sum of weights is flexible (i.e.).

For, the MSE of in Equation (24) is minimized for

and

Substituting the optimum values of and in (24), we get

Note: It is difficult to make the theoretical comparison due to complexity, therefore we adopt the numerical study.

5. Simulation Study

We use the same data set as earlier used by  , and perform some simulation study to investigate the per- formances of the estimators.

Population (source:  ).

Y = Number of acres devoted to farms during 1992 (ACRES92).

X = Number of large farms during 1992 (LARGEF92).

We set and to select a sample of units from the population of size. To compute the values of, and by simulation, we explain our simulation methodology as follow.

Here, and can be written as

and

where

To find the possible values of the ratio for, we generate and calculate, , , , and. It means that when the first smallest value is selected from the ranked set sample, the expected ratio of that value to the population mean could be close to 0.25, and when the second smallest value is selected the ratio of that value to the population mean could be close to 0.50, and when the third smallest value is selected the expected ratio of that value to the population mean will close to 1. Similarly, the expected ratio of the fourth and fifth values could be close to 1.25 and 1.75 respectively. In each case we weighted error term with a small number 0.08 to make sure that the ratio remains positive. In other words, it means that we are generating. Thus, the possible values of the ratio are expected to remain close to those we are considering here. Similarly, for the possible values of the ratio, we consider , , , , and, where. Here we weighted with a small number 0.05 because it may be less risky to rank the auxiliary variable X than the study variable Y. Thus the values of, , and are obtained through this simulation and are represented in the last three columns of Table 1.

PREs of proposed class of estimators through simulation
abg
−1.5−1.5−10.1140.6103.2160.9161.4153.2164.50.005730.005740.00573
−1.5−1.5−10.5139.3103.2159.9160.5163.8164.40.005900.006040.00596
−1.5−1.5−10.9148.1103.4167.1167.5165.5171.80.004620.004040.00431
−1.5−1.510.1144.5103.3164.1164.8157.3167.80.005160.004850.00499
−1.5−1.510.5132.4103.0154.5156.8157.5158.70.006890.007640.00725
−1.5−1.510.9144.6103.3164.2168.6163.4168.80.005140.004820.00497
−1.50−10.1136.9103.1158.0158.6148.0161.50.006250.006580.00641
−1.50−10.5137.6103.1158.6159.2162.0163.00.006150.006420.00628
−1.50−10.9142.5103.3162.4162.9162.7167.00.005460.005300.00538
−1.5010.1130.1103.0152.8153.7141.0156.10.005200.008160.00766
−1.5010.5140.9103.2161.2163.5164.9165.70.005680.005670.00567
−1.5010.9137.2103.1158.3162.7159.5162.90.006200.006510.00635
−1.51.5−10.1140.8103.2161.1161.6150.5164.80.005690.005700.00569
−1.51.5−10.5140.3103.2160.7161.2164.1165.30.005760.005820.00578
−1.51.5−10.9135.2103.1156.7157.3158.7161.10.006490.006970.00673
−1.51.510.1138.2103.2159.1159.9147.8162.70.006050.006290.00616
−1.51.510.5139.2103.2159.8162.2163.0164.30.005920.006020.00598
−1.51.510.9143.3103.3163.1168.0163.9168.20.005330.005130.00522
1.5−1.5−10.1133.4103.1155.4156.0142.9158.80.006720.007430.00706
1.5−1.5−10.5140.8103.2161.1161.6164.5165.70.005690.005780.00576
1.5−1.5−10.9140.3103.2160.8161.3162.0165.40.005750.005780.00576
1.5−1.510.1142.3103.2162.4163.1152.1166.10.005460.005400.00541
1.5−1.510.5145.3103.3164.7167.1168.7169.50.005040.004670.00484
1.5−1.510.9139.1103.2159.9164.3161.1164.60.005920.006050.00598
1.50−10.1133.4103.0155.4156.0144.4158.80.006720.007430.00658
1.50−10.5140.8103.2161.1161.6164.8165.60.005690.005680.00566
1.50−10.9145.9103.3165.2165.6164.8169.60.004960.004530.00473
1.5010.1142.3103.3162.4163.1153.6166.10.005450.005400.00540
1.5010.5141.6103.2161.8164.1165.6166.30.005570.005510.00553
1.5010.9140.3103.2160.7165.1161.4165.30.005760.005820.00578
1.51.5−10.1139.2103.2159.9160.4151.8163.50.005910.006050.00597
1.51.5−10.5133.0103.0155.1155.7158.1159.20.006790.007490.00713
1.51.5−10.9137.3103.1158.4158.9159.0162.70.006190.006500.00634
1.51.510.1141.7103.2161.9162.4154.4165.50.005550.005510.00520
1.51.510.5142.3103.3162.3164.6166.4166.80.005480.005340.00540
1.51.510.9135.2103.1156.8161.0157.8161.20.006480.007010.00672

We investigate the percentage relative efficiency (PRE) of ratio estimator (say), the Searls estimator, the difference estimator,  estimator when with respect to conventional estimator (say). We also calculate PRE of the proposed class of estimators, say, when and when, say, , with respect to. The PRE of our proposed estimator and other existing estimators, , with respect to con- ventional estimator, is defined as

The PREs of our proposed estimator and other existing estimators with respect to conventional estimator are given in Table 1.

6. Conclusions

Since and are the fixed constants in  estimator and in the proposed class of estimators. There can be a large number of combinations for different values of these constants. Here, only limited number of results are reported in Table 1. Obviously, it can be observed through the simulation study in Table 1, that the proposed class of estimators is more efficient than all considered estimators. Its PRE increases from 164.5 to 171.8 when changes from 0.1 to 0.9 but decreases slightly when is close to 0.5. Generally, we can say PRE of proposed class increases as value of increases for fixed values of constants a, b and g  . Class of estimators has maximum PRE 167.5, but it is less efficient as compared to the proposed class of estimators for all the choices of constants reported in Table 1. Also from the Table 1, we can see that other competitor estimators are also less efficient than the proposed class of estimators. If we make comparison between the two proposed cases then the class of estimators in Case 2 is more precise than the Case 1. We can see from Table 1 that by fixing the values of a and b at, the proposed classes of estimators give more precise results when the value of is away form, either close to 0 or 1. While by fixing positive values of the constants a and b, we get more precise results for close to 0.5.

Therefore, the proposed class of estimators can be preferred over its competitive estimators in application under RSS.

Acknowledgements

The authors wish to thank the editor and the anonymous referees for their suggestions which led to improvement in the earlier version of the manuscript.

Cite this paper

Lakhkar Khan,Javid Shabbir, (2016) An Efficient Class of Estimators for the Finite Population Mean in Ranked Set Sampling. Open Journal of Statistics,06,426-435. doi: 10.4236/ojs.2016.63038

Appendix Some special cases of the proposed class of estimators
EstimatorRemarks