Open Journal of Statistics
Vol.05 No.02(2015), Article ID:55569,8 pages
10.4236/ojs.2015.52013
A Note on the Precision of Stratified Systematic Sampling
Akeem O. Kareem1, Isaac O. Oshungade2, Gafar M. Oyeyemi2
1Institute for Security Studies, Abuja, Nigeria
2Department of Statistics, University of Ilorin, Ilorin, Nigeria
Email: keemkareem@yahoo.com, layiosungade@yahoo.com, matanmi@unilorin.edu.ng
Copyright © 2015 by authors and Scientific Research Publishing Inc.
This work is licensed under the Creative Commons Attribution International License (CC BY).
http://creativecommons.org/licenses/by/4.0/



Received 14 February 2015; accepted 3 April 2015; published 13 April 2015
ABSTRACT
Conflicting views had greeted the use of systematic sampling for sample selection and estimation in stratified sampling in terms of the precision of the population mean base on the inherent characteristics of the population. These conflicting views were analyzed using Cochran data (1977, p. 211) [1] . When the population units are ordered, variance of systematic sampling for all possible systematic samples provides equal, non-negative and most precise estimates for all the variance functions considered i.e.
, unlike when a single systematic sample is used and when variance of simple random sampling is used to estimate selected systematic samples.
Keywords:
Precision, Systematic Sampling, Stratified Systematic Sampling, Systematic Random Estimator

1. Introduction
Cochran (1977) [1] describes systematic sampling thus: suppose that N units in the population are numbered 1 to N in some order. To select a sample of n units, we take a unit at random from the first k units and every kth unit thereafter. The selection of the first kth units determined the whole sample. This is called an every kth systematic sample.
Murthy (1967) [2] states that systematic sampling is operationally more convenient and at the same time saves time while ensuring equal probability of inclusion of each unit in the sample. He describes technique of systematic sampling as consisting of selecting every kth unit starting with the unit corresponding to a number r chosen at random from 1 to k, where k is taken as the integer nearest to
. The random number r chosen from 1 to k is known as random start and the constant k is termed the sampling interval.
A sample selected by this procedure is termed a systematic sample with a random start r. Therefore, the value of r determines the whole sample. In other words, this procedure amounts to selecting with equal probability one of the k possible groups of units (samples) into which the population can be divided in a systematic manner.
Same view was expressed by Raj and Chandhok (1998) [3] . They described systematic sampling as a more convenient method of sample selection when the units were serially numbered from 1 to N with the assumption that N = nk, where n is the sample size desired, and k is an integer. A number is taken at random from the numbers 1 to k (using a table of random number/random number generator). Suppose the random number is i, then the sample contains n units with serial numbers
. Thus, the sample consists of the first unit selected at random and every kth unit thereafter. It is therefore called a 1-in-ksystematic sample.
Early studies on the development of theory of systematic sampling was as reported by Murthy (1967, p.134) [2] while Cochran (1977) [1] reported that Madow (1953) [4] had carried systematic sampling to its logical conclusion with his recommendation that a systematic sample be chosen at or near the center of the interval, i.e. instead of starting the sequence by a random number chosen between 1 and k, we take the starting number as
if k is odd and either
or
if k is even.
Guatschi (1957) [5] investigated the efficiency of single and multiple random start systematic sampling in population exhibiting different characteristics and reported that when the population was in random order
and
for a population with linear trend, while in a periodic population
equality results when
. He, however, concluded that, with an exponential correlelo-
gram, single random start
was more precise than multiple random starts
.
Murthy (1967) [2] , Cochran (1977) [1] , Raj and Chandhok (1998) [3] and Okafor (2002) [6] have all mentioned that systematic sampling can be looked into in another way in relation to cluster sampling. They explained that in a population with N = nk, the population can be divided into k large systematic sampling units each containing n of the original n units. The operation of choosing a randomly located systematic sample is just the operation of choosing one of these large sampling units at random. Thus, systematic sampling amounts to selecting of a simple random sample of one cluster unit from a population of k cluster units with probability
.
Thus for a population of Y units
divided into k possible clusters, the k possible samples with their means are as shown in Table 1 below.
Considering all the k possible samples, the sample mean
is obtained thus:
(1)
Showing that when N = nk, 

Table 1. Compositions of systematic samples of k clusters (such that N = nk).
Above is the applicable systematic sampling in a situation in which N = nk. In practice, it is common to encounter situations in which
2. Approaches When
2.1. Circular Systematic Sampling (CSS)
Lahiri (1952) [7] suggests the Circular Systematic Sampling (CSS) which consists of taking a random number from 1 to N and selecting the unit corresponding to this random start and every 


for
It implies from CSS, therefore, that the usual procedure of selecting a random start r from 1 to k and including in the sample the units corresponding to 

2.2. Murthy’s Approach
Murthy (1967) [2] suggested that when






Then, the units’ 
This approach is suitable in situations in which the sample size n is not fixed or predetermined and the sampler is free to adjust the sample to suit the above application. Therefore, Murthy’s approach to handle 
2.3. Fractional Interval Approach
Another approach when 





2.4. New Partially Systematic Sampling (NPSS)
Leu and Tsui (1996) [8] developed the New Partially Systematic Sampling (NPSS) in order to derive an unbiased estimator of the variance of systematic sampling


2.5. Remainder Linear Systematic Sampling (RLSS)
Also reviewed in this section is Remainder Linear Systematic Sampling (RLSS) due to Chang and Huang (2000) [9] . This procedure is a modification of the LSS. It is developed for situation when




a) Divide the population units into two strata with the first stratum containing the front 






b) From stratum II, random start 




Sample of size n is the combination of 

Therefore, in stratified systematic sampling when

3. Estimation Procedures in Systematic Sampling
Estimation of the population mean of a systematic sample over all possible samples is as given by relation (1). For the variance of the population mean, Murthy (1967) [2] , while assuming 




where
Equivalent to
It is simplified as
where 




Note that 
and
Therefore 


where

Other expressions for the estimation of variance of the mean of systematic samples by various authors are reported by Murthy (1967, Section 5.8, pp. 153-155) [2] and Cochran (1977, pp. 213-226) [1] . Cochran, however, remarked “that no dearth of formulae for the estimated variance, but all appeared to have limited applicability”.
On the efficiency of systematic sampling in relation to other sampling scheme, literature agreed that efficiency of systematic sampling was strongly anchored on the arrangement of the population units. Cochran (1977) [1] stated that it greatly depended on the properties of the population. For some population, systematic sampling is extremely precise and for others, SRS is more precise than systematic sampling, not even with increase in sample size n. Thus, it is difficult to give general advice about the situation in which systematic sampling is to be recommended. However, the knowledge of the population structure is necessary for its most effective use.
Same view was expressed by Murthy (1967) [2] , that a good arrangement of the population units may yield a better estimate while a bad arrangement may lead to inefficient estimate and therefore, warned that one had to be careful with the use of systematic sampling and to ensure first, that the existing arrangement did not lead to inefficient estimates before using it. One way suggested is to ensure that the units are arranged either in increasing or decreasing order and this directly suits our investigation in this study, since application of methods of strata construction requires that the population units be arranged in order of magnitude to avoid overlapping of units.
Cochran (1977) [1] stated that several formulae had been developed for

1) The variances of the mean of systematic sample given by Cochran are:

where

This can further be expressed as
which is the weighted variance over all possible systematic samples generated by random start



2) The second one is given as

where 
where the numerator is averaged overall 



The two expressions of 

3) The third is expressed in terms of variance of stratified random sample in which the strata are composed of the first k units, the second k units and so on.
The subscript j in 


where
This is the variance among units that lie in the same stratum. The divisor 


This quantity is the correlation between the deviations from the stratum means of pairs of units that are in the same systematic sample.

It implies therefore from relation (9) above that a systematic sample has the same precision as that of a stratified random sampling sample with one unit per stratum if

Thus, we have examined systematic sampling in terms of procedure and estimation process. But our concern is taking a systematic sample of fixed sample size n from each stratum for estimation purpose.
3.1. Estimation in Stratified Systematic Sampling
Much have been said in Section 2 on the significance of the arrangement of the population units on the precision



Notations
Cochran (1977, p. 91) [1] has stated that expressions for the mean and variance of stratified sampling applied generally to all classes of stratified sampling and are not restricted to stratified random sampling. Therefore, all notations in Cochran (1977, p. 90) [1] are also valid for stratified systematic sampling.
The subscript h denotes the stratum and i the unit within the stratum.
The subscript “sy” in this section denotes systematic sample.

is the mean of systematic sample in stratum h, equivalent to relation (1).

is the population mean of the stratified systematic sample.

is the variance of stratified systematic samples in stratum h when
Therefore, 






is the variance of the population mean of stratified systematic samples.

is the MSE of the population mean of stratified systematic samples.
The mean and the variance of RLSS are given below (see relation 2.2 and 2.3, p. 251 of Chang and Huang (2000) [9] ).


To suite our applications, expression (17) and (18) are modified as follows:


It should be noted that that expressions 


3.2. Empirical Investigation
Systematic samples are easy to draw and to execute but may not be simple in term of estimation as there are competing estimators. This drew our attention for an empirical investigation to ensure the right choice of estimator in the face of conflicting reports. Murthy (1967, section 5.8, p. 153) [2] stated that “it is not possible to estimate unbiasedly the variance of the population mean and total on the basis of a single sample, but it is possible to build up some biased but useful variance estimators on the basis of systematic samples”. Same view was expressed by Mendenhall et al. (1971, p. 151) [11] that “an unbiased estimate of 






Raj and Chandhok (1998) [3] stated that “when units are deliberately ordered, the formula for estimating variance of SRS will not apply to systematic sampling”. However, Cochran (1977, p. 227) [1] stated that if there were many strata, one systematic sample can be used in most of them.
In view of the above, the question is: should a single systematic sample be used to estimate 



Empirical investigation reveals that when we select a single systematic sample, the result is as shown in Table 2 below.
Since the efficiency of systematic sampling depends on the arrangement of the population units, an attempt is also made to rearrange Cochran (1977) [1] data (Table 8.3, p. 211), in order of magnitude; same sample of size 


while 

Table 3 below gives the estimates for single systematic samples when the units are arranged in order of magnitude before sample selection.
4. Conclusion
This analysis brings to the lime light the caution by Murthy (1967, p. 145) [2] in the application of systematic sampling that “one has to be careful in using systematic sampling and should at least ensure that the existing arrangement do not lead to inefficient estimates”. From the empirical investigation, it could be observed that when population units are arranged in order of magnitude, a more precise estimate is obtained for 





Table 2. Variance of single systematic samples using Cochran’s data.
*In Table 2, g5 indicates the center for systematic sample estimates when Madow’s procedure is used while the subscript i = 1, ・・・, k = 10 is the random start in the interval 1 to 10.
Table 3. Variance of single systematic samples using Cochran’s data when sampling units are arranged in order of magnitude.
*In Table 3, g5 indicates the center for systematic sample estimates when Madow’s procedure is used while the subscript i = 1, ・・・, k = 10 is the random start in the interval 1 to 10.
shown in Table 2 and Table 3 above. Therefore, when systematic sampling is the choice design within strata, estimates for all possible systematic samples should be used and the sampling units arranged in order of magnitude within the stratum. Kareem et al. (2015) [10] used this procedure and reported higher efficiency of systematic sampling within stratum over the popularly used SRS. It is hereby recommended that 



References
- Cochran, W.G. (1977) Sampling Techniques. 3rd Edition, John Wiley and Sons, New York.
- Murthy, M.N. (1967) Sampling Theory and Methods. 2nd Edition, Statistical Publishing Society, Calcutta.
- Raj, D. and Chandhok, P. (1998) Sample Survey Theory. Narosa Publishing House, New Delhi.
- Madow, W.G. (1953) On the Theory of Systematic Sampling III. Annals of Mathematical Statistics, 24, 101-106. http://dx.doi.org/10.1214/aoms/1177729087
- Gautschi, W. (1957) Some Remarks on Systematic Sampling. Annals of Mathematical Statistics, 28, 385-394. http://dx.doi.org/10.1214/aoms/1177706966
- Okafor, F.C. (2002) Sample Survey Theory with Applications. Afro-Orbis Publications Ltd., Nsukka.
- Lahiri, D.B. (1952) NSE Instruction to Field Workers. See Murthy (1967, p. 140).
- Leu, C.-H. and Tsui, K.-W. (1996) New Partially Systematic Sampling. Statistica Sinica, 6, 617-630.
- Chang, H.J. and Huang, K.C. (2000) Reminder Linear Systematic Sampling Sankya. The Indian Journal of Statistics, 62 (Series B), 249-256.
- Kareem, A.O, Oyeyemi, G.M. and Adewara, A.A (2015) On the Choice of an Efficient Sampling Scheme within Strata ICASTOR. Indian Journal of Mathematical Science, 9. (In Press)
- Mendehall, W., Ott, L. and Scheafffer, R.L. (1971) Elementary Survey Sampling. Duxbary Press, Belmont.

















