﻿Handwritten Character Recognition Using Multiresolution Technique and Euclidean Distance Metric

Journal of Signal and Information Processing
Vol. 3  No. 2 (2012) , Article ID: 19570 , 7 pages DOI:10.4236/jsip.2012.32028

Handwritten Character Recognition Using Multiresolution Technique and Euclidean Distance Metric

Dileep Kumar Patel1, Tanmoy Som1, Sushil Kumar Yadav2, Manoj Kumar Singh3

1Department of Applied Mathematics, Institute of Technology, Banaras Hindu University, Varanasi, India; 2Department of Statistics, Faculty of Science, Banaras Hindu University, Varanasi, India; 3DST Centre for Interdisciplinary Mathematical Sciences, Faculty of Science, Banaras Hindu University, Varanasi, India.

Email: {dkpatel.bhu, sush.stats}@gmail.com, tsom.apm@itbhu.ac.in, mks_kjist@yahoo.co.in

Received July 28th, 2011; revised January 18th, 2012; accepted March 21st, 2012

Keywords: Discrete Wavelet Transform; Euclidean Distance Metric; Feature Extraction; Handwritten Character Recognition; Bounding Box; Mean Vector

ABSTRACT

In the present paper, the problem of handwritten character recognition has been tackled with multiresolution technique using Discrete wavelet transform (DWT) and Euclidean distance metric (EDM). The technique has been tested and found to be more accurate and faster. Characters is classified into 26 pattern classes based on appropriate properties. Features of the handwritten character images are extracted by DWT used with appropriate level of multiresolution technique, and then each pattern class is characterized by a mean vector. Distances from input pattern vector to all the mean vectors are computed by EDM. Minimum distance determines the class membership of input pattern vector. The proposed method provides good recognition accuracy of 90% for handwritten characters even with fewer samples.

1. Introduction

The shape variation of handwritten characters causes the misclassification, therefore multiresolution of handwritten characters is important for the correct recognition. Using multiresolution we can reduce the size of characters without losing the basic characteristics of characters, therefore more accuracy and better recognition rate can be achieved using multiresolution. The extensive applications of Handwritten Character Recognition (HCR) in recognizing the characters in bank checks and car plates etc. have caused the development of various new HCR systems such as optical character recognition (OCR) system. There are so many techniques of pattern recognition such as template matching, neural networks, syntactical analysis, wavelet theory, hidden Markov models, Bayesian theory and minimum distance classifiers etc. These techniques have been explored to develop robust HCR systems for different languages such as English (Numeral) [1-3], Farsi [4], Chinese (Kanji) [5,6], Hangul (Korean) scripts [7], Arabic script [8] and also for some Indian languages like Devnagari [9], Bengali [10], Telugu [11- 13] and Gujarati [14]. HCR is an area of pattern recognition process that has been the subject of considerable research during the last few decades. Machine simulation of human functions has been a very challenging research field since the advent of digital computers [15]. The ultimate objective of any HCR system is to simulate the human reading capabilities so that the computer can read, understand, edit and do similar activities as human do with the text. Mostly, English language is used all over the world for the communication purpose, also in many Indian offices such as railways, passport, income tax, sales tax, defense and public sector undertakings such as bank, insurance, court, economic centers, and educational institutions etc. A lot of works of handwritten English character recognition have been published but still minimum training time and high recognition accuracy of handwritten English character recognition is an open problem. Therefore, it is of great importance to develop automatic handwritten character recognition system for English language. In this paper, efforts have been made to develop automatic handwritten character recognition system for English language with high recognition accuracy and minimum classification time. Handwritten character recognition is a challenging problem in pattern recognition area. The difficulty is mainly caused by the large variations of individual writing style. To get high recognition accuracy and minimum classification time for HCR, we have applied multiresolution technique using DWT [16,17] and EDM [18,19]. Experimental results show that the proposed method used in this paper for handwritten English character recognition is giving high recognition accuracy and minimum classification time. In what follows we briefly describe the different techniques used in our paper.

1.1. Multiresolution

Representation of images in various degrees of resolution is known as multiresolution process. “Inmultiresolution process, images are subdivided successively intosmallerregions [16,17]. Wavelets are used as the foundation of multiresolution process. In 1987, wavelets are first shown to be the foundation of a powerful new approach to signal processing and analysis called multiresolution theory [17]. Multiresolution theory is concerned with the representation and analysis of images at more than one resolution”. We use this technique in HCR system.

1.2. Euclidean Distance Metric

The EDM is significantly simplified under the following assumptions [18,19]:

• The classes are equiprobable.

• The data in all classes follow Gaussian distributions.

• The covariance matrix is the same for all classes.

• The covariance matrix is diagonal and all elements across the diagonal are equal.

Given an unknown pattern vector x, assign it to class if

EDM is often used, even if we know that previously stated assumptions are not valid, because of its simplicity. It assigns a pattern to the class whose mean is closed to it with respect to the Euclidean norm [20].

2. Methodology of the Proposed HCR System

This paper is divided into four sections. In Section 2.1, samples of handwritten characters are collected. In Section 2.2, some preprocessing steps are performed on the collected samples of handwritten characters. In Section 2.3, pattern vectors are generated using multiresolution technique by applying DWT on preprocessed handwritten characters. In Section 2.4, mean vectors are computed of each class and then EDM is used to compute distances from input pattern vector to all the mean vectors and input handwritten character is recognized by the minimum distance obtained in Section 2.4. A complete flowchart of handwritten English character recognition system is given in Figure 1 [21-23].

2.1. Data Collection

First of all, the data of English characters is collected in the written form on blank papers by people of different age groups. These characters are written by different blue/black ball point pens. The collected samples of handwritten characters are scanned by scanner into JPEG format on 600 ppi [24]. Then all the characters are separated and resized into 100 by 100 pixel images. The example of the samples of handwritten characters is given below (Figures 2 and 3).

2.2. Preprocessing

The separated RGB character images of 100 by 100 pixel resolution obtained in the Section 2.1 are converted into grayscale images and then the grayscale images is again converted into binary images using appropriate grayscalethresholding. Now, binary image is thinned using skeletonization infinite times. Edges of these thinned images are detected using appropriate thresholding and then further dilatedusing appropriate structure element [16,20,25]. These steps are known as preprocessing and preprocessing steps are given in Figure 4.

2.3. Feature Extraction

In this section, multiresolution technique is applied on the dilated images obtained in Section 2.2 and this is achieved by applying DWT [26,27]. In this section, pattern vectors are generated [28-30]. DWT has given good results in different image processing applications. It has excellent spatial localization and good frequency localization properties that makes it an efficient tool for image analysis. There are different multiresolution techniques such as image pyramids, subband coding and DWT. We have used DWT in this paper. DWT maps a function of a continuous variable into a sequence of coefficients. If the function being expanded is a sequence of numbers, like samples of a continuous function f(x, y), the resulting coefficients are called the DWT of f(x, y) [16,17]. DWT of an image f(x, y) of size MN is defined as

(1)

(2)

for and

(3)

where, is an arbitrary starting scale and

(4)

Figure 1. A flowchart of HCR system.

Figure 2. Sample of characters written by 10 different persons.

100 x 100         100 x 100        100 x 100       100 x 100     100 x 100

Figure 3. Some separated characters from the sample in Figure 1 and resized into 100 by 100 pixel images.

Figure 4. Results of preprocessing steps applying on a handwritten character.

and

(5)

where index i identifies the directional wavelets that assumes the values H, V and D. Wavelets measure functional variations such as intensity or gray-level variations for images along different directions. Directional wavelets are and. measures variations along columns (like horizontal edges), measures variations along rows (like vertical edges) and measures variations along diagonals [16,17]. Equations (4) and (5) define the scaled and translated basis functions.

f(x, y), and are functions of the discrete variables x = 0, 1, 2, 3, , M – 1 and y = 0, 1, 2, 3, , N – 1. The coefficients defined in Equations (1) and (2) are usually called approximation and detail coefficients, respectively. coefficients define an approximation of f(x, y) at scale. coefficients add horizontal, vertical and diagonal details for scales. We normally let = 0 and select N

= M = so that j = 0, 1, 2, 3, , J – 1 and m, n = 0, 1, 2, 3, , [16,17]. Equation (3) shows that f(x, y) is obtained via the inverse discrete wavelet transform for given and of Equations (1) and (2). DWT can be implemented using digital-filters and down-samplers [16,17]. The block diagram of multiresolution using DWT is shown in the Figure 5.

We have used MATLAB toolbox for multiresolution by DWT [20]. DWT is applied three times successively on each dilated character image generated in the Section 2.2, and finally the reduced character image is captured into a bounding box using MATLAB toolbox and then, resized into 14 by 10 pixels image. The effect of multiresolution technique on a character obtained in Section 2 is given in the Figure 5.

2.4. Classification and Recognition

In this section, EDM is used to classify the input pattern vectors [31,32]. Pattern vectors generated in the Section 2.3 are grouped into 26 classes based on their similar

Figure 5. Character image after applying multiresolution and captured into bounding box.

properties. Each class contains pattern vectors, mean vector of each class is computed. We prepare a matrix whose rows are these mean vectors. Distance from input pattern vector to each mean vector is computed and minimum distance shows the class of input pattern vector to which it belongs [33-36]. Suppose, shows the pattern vector, shows the pattern class, shows the mean vector of jth class and shows the component of pattern vector. Mean vector is used as the representative of the class of vectors:

where, is the number of pattern vectors in the class and W is the number of pattern classes. Euclidean distance classifier is defined as:

where, x is the input pattern vector and is the distance from x to mean vector of the jth class. Input pattern vector x will belong to class wj if is the smallest distance [19,20].

3. Experimental Results

150 samples (3900 characters) were collected from 150 persons of different ages, 26 from each. First, 100 samples (2600 characters) were used to train the proposed HCR system and next 50 samples (1300 characters) were used to test the proposed HCR system. An analysis of experimental results has been performed and shown in Table 1 given below.

Here, we compare our results using the new approach with those of some other authors. The method used by Chen et al. [1] gets recognition rate of 92.20% for handwritten numerals. The method of Romero et al. [28] gets 90.20% recognition accuracy over the test data for handwritten numerals. The method of Mowlaei et al. [26] gets recognition rates of 92.33% and 91.81% for 8 classes of handwritten Farsi characters and numerals respectively.

4. Conclusion

From the above Table 1, we observe that the recognition

Table 1. The results showing the average recognition accuracy, here it is 90%.

accuracy is high for a particular level of multiresolution and appropriate resolution of input character images. When average recognition accuracy is optimal (maximum) for a particular level of multiresolution and appropriate resolution of character images, here it is 90%, then any further increment in the level of multiresolution results in the decrease of the average recognition accuracy. The resolution of input character image and the level of multiresolution are dependent upon each other so the challenge is to find appropriate relationship between them. Work is going on, we are trying to improve the recognition accuracy using some other methods and techniques such as fuzzy logic and hidden Markov models.

REFERENCES

1. G. Y. Chen, T. D. Bui and A. Krzyzak, “Contour-Based Handwritten Numeral Recognition Using Multiwavelets and Neural Networks,” Pattern Recognition, Vol. 36, No. 7, 2003, pp. 1597-1604. doi:10.1016/S0031-3203(02)00252-2
2. J. Sung, S.-Y. Bang and S. J. Choi, “A Bayesian Network Classifier and Hierarchical Gabor Features for Handwritten Numeral Recognition,” Pattern Recognition Letters, Vol. 27, No. 1, 2006, pp. 66-75. doi:10.1016/j.patrec.2005.07.003
3. C.-L. Liu and H. Sako, “Class-Specific Feature Polynomial Classifier for Pattern Classification and Its Application to Handwritten Numeral Recognition,” Pattern Recognition, Vol. 39, No. 4, 2006, pp. 669-681. doi:10.1016/j.patcog.2005.04.021
4. A. Broumandnia and J. Shanbehzadeh, “Fast Zernike Wavelet Moments for Farsi Character Recognition,” Image and Vision Computing, Vol. 25, No. 5, 2007, pp. 717- 726. doi:10.1016/j.imavis.2006.05.014
5. Y. Mizukami, “A Handwritten Chinese Character Recognition System Using Hierarchical Displacement Extraction Based on Directional Features,” Pattern Recognition Letters, Vol. 19, No. 7, 1998, pp. 595-604. doi:10.1016/S0167-8655(98)00034-8
6. A.-B. Wang and K.-C. Fan, “Optical Recognition of Handwritten Chineses Character by Hierarchical Radical Matching Method,” Pattern Recognition, Vol. 34, No. 1, 2001, pp. 15-35. doi:10.1016/S0031-3203(99)00207-1
7. H.-Y. Kim and J. H. Kim, “Hierarchical Random Graph Represetation of Handwritten and Its Application to Hangul Recognition,” Pattern Recognition, Vol. 34, No. 2, 2001, pp. 108-201. doi:10.1016/S0031-3203(99)00222-8
8. A. Cheung, M. Bennamoun and N. W. Bergmann, “An Arabic OCR System Using Recognition-Based Segmentation,” Pattern Recognition, Vol. 34, No. 2, 2001, pp. 215- 233. doi:10.1016/S0031-3203(99)00227-7
9. R. M. K. Sinha and H. N. Mahabala, “Machine Recognition of Devanagari Script,” IEEE Transactions on Systems, Man, and Cybernetics, Vol. 8, 1979, pp. 435-441.
10. B. B. Chaudhuri and U. Pal, “A Complete Printed Bangla OCR System,” Pattern Recognition, Vol. 31, No. 5, 1998, pp. 531-549. doi:10.1016/S0031-3203(97)00078-2
11. S. N. S. Rajasekharan and B. L. Deekshatulu, “Generation and Recognition of Printed Telugu Characters,” Computer Graphics and Image Processing, Vol. 6, No. 4, 1977, pp. 335-360. doi:10.1016/0146-664X(77)90028-4
12. B. B. Chaudhuri, O. A. Kumar and K. V. Ramana, “Automatic Generation and Recognition of Telugu Script Characters,” IETE Journal of Rearch, Vol. 37, 1991, pp. 499-511.
13. M. B. Sukhaswami, P. Seetharamulu and A. K. Pujari, “Recognition of Telugu Characters Using Neural Networks,” International Journal of Neural Systems, Vol. 6, No. 3, 1995, pp. 317-357. doi:10.1142/S0129065795000238
14. A. A. Desai, “Gujarati Handwritten Numeral Optical Character Reorganization through Neural Network,” Pattern Recognition, Vol. 43, No. 7, 2010, pp. 2582-2589. doi:10.1016/j.patcog.2010.01.008
15. N. Arica and F. T. Yarman-Vural, “An Overview of Character Recognition Focused on Off-Line Handwriting,” IEEE Transactions on Systems, Man, and Cybernetics— Part C: Applications and Reviews, Vol. 31, No. 2, 2001, pp. 216-233. doi:10.1109/5326.941845
16. R. C. Gonzalez and R. E. Woods, “Digital Image Processing,” Pearson Education, Upper Saddle River, 2002.
17. S. Mallat, “A Wavelet Tour of Signal Processing,” Academic Press, Burlington, 2008.
18. S. Theodoridis and K. Koutroumbas, “An Introduction to Pattern Recognition: A MATLAB Approach,” Elsevier, Burlington, 2010.
19. R. O. Duda, P. E. Hart and D. G. Stork, “Pattern Classification,” Wily-India, 1993.
20. R. C. Gonzalez and R. E. Woods, “Digital Image Processing Using MATLAB,” Pearson Education, Upper Saddle River, 2004.
21. R. Plamondon and S. N. Srihari, “On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 22, No. 1, 2000, pp. 63-84. doi:10.1109/34.824821
22. U. Pal and B. B. Chaudhari, “Indian Script Character Recognition: A Survey,” Pattern Recognition, Vol. 37, No. 9, 2004, pp. 1887-1899. doi:10.1016/j.patcog.2004.02.003
23. J. Mantas, “An Overview of Character Recognition Methodologies,” Pattern Recognition, Vol. 19, No. 6, 1986, pp. 425-430. doi:10.1016/0031-3203(86)90040-3
24. E. A. Fadhel and P. Bhattacharyya, “Application of a Steerable Wavelet Transform Using Neural Network for Signature Verification,” Pattern Analysis & Applications, Vol. 2, 1999, pp. 184-195. doi:10.1007/s100440050027
25. R. E. Leary, “Unrestricted Off-Line Handwriting Recognition: A Preprocessing Approach,” Unpublished Paper Written at Rensselaer Polytechnic Institute, 2009, pp. 1- 11.
26. A. Mowlaei, K. Faez and A. T. Haghighat, “Feature Extraction with Wavelet Transform for Recognition of Isolated Handwritten Farsi or Arabic Characters and Numerals,” IEEE Digital Signal Processing, Vol. 2, No. 2, 2002, pp. 923-926.
27. P. Wunsch and A. F. Laine, “Wavelet Descriptors for Multiresolution Recognition of Handprinted Characters,” Pattern Recognition, Vol. 28, No. 8, 1995, pp. 1237-1249. doi:10.1016/0031-3203(95)00001-G
28. D. J. Romero, L. M. Seijas and A. M. Ruedin, “Directional Continuous Wavelet Transform Applied to Handwritten Numerals Recognition Using Neural Networks,” Java Caching System, Vol. 7, No. 1, 2007, pp. 66-71.
29. S.-W. Lee, C.-H. Kim, H. Ma and Y. Y. Tang, “Multiresolution Recognition of Unconstrained Handwritten Numerals with Wavelet Transform and Multilayer Cluster Neural Network,” Pattern Recognition, Vol. 29, No. 12, 1996, pp. 1953-1961. doi:10.1016/S0031-3203(96)00053-2
30. J. C. Lee, T. J. Fong and Y. F. Chang, “Feature Extraction for Handwritten Chinese Character Recognition Using X-Y Graphs Decomposition and Haar Wavelet,” IEEE International Conference on Signal and Image Processing Applications, 18-19 November 2009, pp. 10-14.
31. D. Sharma and D. Gupta, “Isolated Handwritten Digit Recognition Using Adaptive Unsupervised Incremental Learning Technique,” International Journal of Computer Applications, Vol. 7, No. 4, 2010, pp. 27-33.
32. T. Hastie and P. Y. Simard, “Metrics and Models for Handwritten Character Recognition,” Statistical Science, Vol. 13, No. 1, 1998, pp. 54-65. doi:10.1214/ss/1028905973
33. S. Basu, N. Das, R. Sarkar, M. Kundu, M. Nasipuri and D. K. Basu, “A Hierarchical Approach to Recognition of Handwritten Bangla Characters,” Pattern Recognition, Vol. 42, No. 7, 2009, pp. 1467-1484. doi:10.1016/j.patcog.2009.01.008
34. C.-L. Liu and C. Y. Suen, “A New Benchmark on the Recognition of Handwritten Bangla and Farsi Numeral Characters,” Pattern Recognition, Vol. 42, No. 12, 2009, pp. 3287-3295. doi:10.1016/j.patcog.2008.10.007
35. M. E. Stevens, “Introduction to the Special Issue on Optical Character Recognition (OCR),” Pattern Recognition, Vol. 2, No. 3, 1970, pp. 147-150. doi:10.1016/0031-3203(70)90026-9
36. J. C. Rabinow, “Whither OCR and Whence?” Datamation, 1969, pp. 38-42.