Open Journal of Statistics
Vol.05 No.04(2015), Article ID:56539,8 pages
10.4236/ojs.2015.54026
On Discordance Tests for the Wrapped Cauchy Distribution
A. H. Abuzaid1, M. M. El-hanjouri2, M. M. Kulab2
1Department of Mathematics, Faculty of Science, Al-Azhar University-Gaza, Gaza, Palestine
2Department of Applied Statistics, Faculty of Economics and Administrative Sciences, Al-Azhar University-Gaza, Gaza, Palestine
Email: alizaid33@yahoo.com, moamin2000@hotmail.com, monebmostafa@gmail.com
Copyright © 2015 by authors and Scientific Research Publishing Inc.
This work is licensed under the Creative Commons Attribution International License (CC BY).
http://creativecommons.org/licenses/by/4.0/



Received 28 March 2015; accepted 17 May 2015; published 22 May 2015
ABSTRACT
Circular data as any other types of data are subjected to contamination with some unexpected observations which are known outliers. In this paper, four tests of discordancy for circular data based on M, C, D, and A statistics are extended to the wrapped Cauchy distribution to detect possible outliers. The cut-off points and the power of performances are investigated via extensive simulation study. Results show that tests perform better as the concentration of the samples is increased. Two real circular data sets are analysed for illustration.
Keywords:
Arc Length, Circular Distance, Outlier, Wrapped Normal Distribution

1. Introduction
Circular data refer to a set of observations measured by angles and distributed within
radians and it can be presented on the circumference of a unit circle. Circular data need special statistical methods to be described and modeled rather than the conventional linear techniques. Circular data can be found whenever periodic phenomena occur; it is the source of interest to scientists in many fields, including: biology, meteorology, physics, psychology, image analysis, medicine, astronomy, social sciences and earth sciences, see [1] . The existence of outliers is considered as one of the most common problems in statistical analysis. This can be extended to circular data due to the expected influence of outliers on the parameters estimates. Outliers in the context of circular data would be defined as a set of observations which is inconsistent with the rest of the sample. It is expected to lie far from the mean direction of the circular sample. Despite this, there are only a few numerical and graphical tests of discordancy in circular samples. The problem of outliers in different types of circular data including univariate samples, regression, functional relationship models and circular time series are addressed by several authors (see [2] -[7] ).
The rest of this paper is organized as follows: Section 2 describes the properties of the wrapped Cauchy distribution. Section 3 presents four discordance tests to detect possible outliers in circular univariate data. In Section 4, the cut-off points for tests are obtained based on samples generated from the wrapped Cauchy distribution. The power of performances is investigated via simulation studies in Section 5. Lastly, we apply the statistics on two real data sets for illustration in Section 6.
2. Wrapped Cauchy Distribution
A circular random variable
can be obtained from any random variable on the real line X with probability density function
, and distribution function
by defining
.
That’s mean wrapping the original distribution on the real line around the circle to get the wrapped distribution. The Cauchy distribution on the real line with the density
(1)
where
and
are the mean and standard deviation, respectively. Once we wrapped the
around the circle, then we get to the wrapped Cauchy distribution with probability density function denoted by
and given by:
(2)
where
is the mean direction and
is the concentration parameter that is called the mean resultant length. Then, the distribution function of the wrapped Cauchy is given by:

Reference [8] introduced the wrapped Cauchy distribution, and [9] illustrated that the wrapped Cauchy distribution can be obtained by mapping Cauchy distribution on to the circle by the transformation
. Reference [10] quantified the dispersion measure
for the wrapped Cauchy distribution by a concentration
parameter




and as 

The 





3. Discordance Tests for Circular Samples
Suppose that we are given angles 


3.1. M Statistic
The statistic was proposed by [12] and given in the following formulation, 






where
3.2. C Statistic
It was proposed by [2] , and given by

where 

3.3. D Statistic
It was derived based on the relative arc lengths between the ordered observations of a circular sample where








to the greatest arc containing a single observation




3.4. A Statistic
Reference [13] defined the circular distance between two angles 


proposed a new test based on the summation of all circular distances
from the point of interest 



Furthermore, the approximated distribution of the A statistic was discussed in [15] .
For the mentioned four tests of discordancy the cut-off points at three percentiles 10%, 5% and 1% are obtained based on simulation studies for samples generated from von Mises distribution with various sample sizes and concentration parameters, and also for the wrapped normal distribution (see [5] ). The values of statistics are then compared with the associated cut-off points, if the value of statistics is greater than the cut-off point, then the null hypothesis is rejected and the observation is labeled as an outlier.
4. Cut-Off Points of the Discordance Tests
In this section, we obtain the cut-off points for the four test statistics based on simulation studies. The percentage points of the null distribution of free outliers in the generated random circular samples from the wrapped Cauchy distribution, with mean direction zero and concentration parameter


For each combination of the sample size n and concentration parameter
Table 1. The 5th percentile cut-off points for the test based on the M statistic.
Table 2. The 5th percentile cut-off points for the test based on the C statistic.
Table 3. The 5th percentile cut-off points for the test based on the D statistic.
Table 4. The 5th percentile cut-off points for the test based on the A statistic.
present part of the cut-off points at 5% percentiles. The comprehensive cut-off points are available upon request from the authors. From the obtained cut-off points we notice that:
Firstly, as one would expect, there are an inverse relationship between the cut-off points and the level of percentiles. Secondly, for M statistic the increase of the concentration parameter 









5. Performance of the Discordance Tests
The power of performance of discordancy tests can be evaluated via several measures. References [16] [17] stated that a good test of discordancy should have: 1) a high power function; 



To study the performances of the four discordancy tests, we use 3000 samples based on different sizes n and concentration parameter





Figure 1 illustrates the behavior of power of performances of the tests for different cases. The main results can be summarized as follows:
Firstly, the performance for all statistics increases when we increase the contamination value 






6. Real Data Analysis
For illustration purposes, two real data sets following the wrapped Cauchy distribution are considered to be analyzed, and to apply the proposed tests of discordancy to illustrate their performance in real data as given in the following subsections.
6.1. The Ants’ Direction Data
Reference [10] randomly selected the directions chosen by 100 ants toward a black target when they are released in a round arena as a part from a study conducted by [18] . The wrapped Cauchy distribution has been shown to be the best distribution for the data [19] . The estimates of location parameters, namely circular mean and median are 183˚ and 180˚, respectively. Which are close to each other and reflects the symmetry of the data distribution. Two measures of dispersion inform that the data are moderately concentrated, where the estimates of mean resultant length and concentration parameter are 0.61 and 0.65 respectively.
Table 5 gives the actual values of each test statistics, the corresponding cut-off points for 

Figure 1. Relative performances of discordancy tests for wrapped Cauchy distribution.
Table 5. Results of discordancy tests on ants’ direction data.
6.2. Wind Data
It consists of the wind direction at 6 a.m. and 12 noon were measured each day at the weather station in Milwaukee for 21 consecutive days. Reference [20] proposed a circular-circular regression model with error follow the wrapped Cauchy distribution. The curve is expressed as a form of the Mȍbius circle transformation. As an example, [20] used their model for regressing this data at 12 noon on that at 6 a.m. The maximum likelihood estimates of the parameters are 

The circular mean and median of circular error is very close to zero (−0.04) and 0.031, respectively, and the estimate of the mean resultant length and concentration parameter are 0.552 and 0.773 respectively. Reference [20] considered observations number 5, 7, 12, 17 and 20 as outliers without using any discordance test, and they stated that “Apart from five outliers, the proposed model seems to provide a satisfactory fit to the data”. We have implemented four discordancy tests M, C, D, and A to test whether the suspected five observations are outliers or not.
Table 6 presents the actual values of the discordancy test statistics, their corresponding cut-off point and the decision, for 

In order to detect any other outliers, observation number 5 is excluded and the descriptive statistics are re-estimated, the mean of circular error is −0.015 which gets closer to zero and the estimates of the mean resultant length and concentration parameter are 0.62 and 0.8 respectively. Then, the four tests of discordancy are obtained as given in the second iteration in Table 6, for 
Figure 2. Circular plot of circular error of the wind data.
Table 6. Results of discordancy tests on wind data.
7. Conclusion
In this paper four tests of discordancy M, C, D and A were extended for the wrapped Cauchy distribution; the cut-off points and the power of performances were investigated via extensive simulation study. It was noticed that for any sample size, all considered discordancy tests at moderate or less concentration parameter (
References
- Jammalamadaka, S.R. and Sengupta, A. (2001) Topics in Circular Statistics. World Scientific Press, Singapore.
- Collett, D. (1980) Outliers in Circular Data. Applied Statistics, 29, 50-57. http://dx.doi.org/10.2307/2346410
- Hussin, A.G., Abuzaid, A., Zulkifili, F. and Mohamed, I. (2010) Asymptotic Covariance ad Detection of Influential Observations in a Linear Functional Relationship Model for Circular Data with Application to the Measurements of Wind Directions. Science Asia, 36, 249-253. http://dx.doi.org/10.2306/scienceasia1513-1874.2010.36.249
- Abuzaid, A.H., Mohamed, I.B. and Hussin, A.G. (2012) Boxplot for Circular Variables. Computational Statistics, 27, 381-392. http://dx.doi.org/10.1007/s00180-011-0261-5
- Rambli, A., Mohamed, I., Hussin, A.G. and Ibrahim, S. (2012) On Discordance Test for the Wrapped Normal Data, Sains Malaysiana, 41, 769-778.
- Ibrahim, S., Rambli, A., Hussin, A.G. and Mohamed, I. (2013) Outlier Detection in a Circular Regression Model Using COVRATIO Statistic. Communications in Statistics - Simulation and Computation, 42, 2272-2280. http://dx.doi.org/10.1080/03610918.2012.697239
- Abuzaid, A.H., Mohamed, I.B. and Hussin, A.G. (2014) Procedures for Outlier Detection in Circular Time Series Models. Environmental and Ecological Statistics, 21, 793-809. http://dx.doi.org/10.1007/s10651-014-0281-8
- Lévy, P. (1939) L’addition des variables aléatoires définies sur une circonférence. Bulletin de la Société Mathématique de France, 67, 1-41. http://www.numdam.org/item?id=BSMF_1939__67__1_0
- McCullagh, P. (1996) Mȍbius Transformation and Cauchy Parameter Estimation. Annals of Statistics, 24, 787-808. http://dx.doi.org/10.1214/aos/1032894465
- Fisher, N.I. (1993) Statistical Analysis of Circular Data. Cambridge University Press, London. http://dx.doi.org/10.1017/cbo9780511564345
- Mardia, K.V. and Jupp, P.E. (2000) Directional Statistics. John Wiley & Sons, London.
- Mardia, K.V. (1975) Statistics of Directional Data. Journal of the Royal Statistical Society, Series B, 37, 349-393.
- Rao, J.S. (1969) Some Contributions to the Analysis of Circular Data. Ph.D. Thesis, Indian Statistical Institute, Calcutta.
- Abuzaid, A.H., Mohamed, I.B. and Hussin, A.G. (2009) A New Test of Discordancy in Circular Data. Communications in Statistics―Simulation and Computation, 38, 682-691. http://dx.doi.org/10.1080/03610910802627048
- Abuzaid, A.H., Hussin, A.G., Rambli, A. and Mohamed, I.B (2012) Statistics for a New Test of Discordance in Circular Data. Communications in Statistics―Simulation and Computation, 41, 1882-1890. http://dx.doi.org/10.1080/03610918.2011.624239
- David, H.A. (1970) Order Statistics. Wiley, New York and London.
- Barnett, V. and Lewis, T. (1984) Outliers in Statistical Data. 2nd Edition, John Wiley & Sons, Chichester.
- Jander, R. (1957) Die optische Richtungsorientierung der Roten Waldameise (Formica rufa L.) Zeitschrift tiir vergMehende Physiologie Bd, 40, 162-238.
- Ravindran, P. and Ghosh, S.K. (2012) Bayesian Analysis of Circular Data Using Wrapped Distributions. Journal of Statistical Theory and Practice, 5, 547-561. http://dx.doi.org/10.1080/15598608.2011.10483731
- Kato, S., Shimizu, K. and Shieh, G.S. (2008) A Circular-Circular Regression Model. Statistica Sinica, 18, 633-643.





