International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
|
Volume 4 - Issue 6 |
Published: July 2010 |
Authors: B.V.Dhandra, Mallikarjun Hangarge |
![]() |
B.V.Dhandra, Mallikarjun Hangarge . Offline Handwritten Script Identification in Document Images. International Journal of Computer Applications. 4, 6 (July 2010), 1-5. DOI=10.5120/834-1170
@article{ 10.5120/834-1170, author = { B.V.Dhandra,Mallikarjun Hangarge }, title = { Offline Handwritten Script Identification in Document Images }, journal = { International Journal of Computer Applications }, year = { 2010 }, volume = { 4 }, number = { 6 }, pages = { 1-5 }, doi = { 10.5120/834-1170 }, publisher = { Foundation of Computer Science (FCS), NY, USA } }
%0 Journal Article %D 2010 %A B.V.Dhandra %A Mallikarjun Hangarge %T Offline Handwritten Script Identification in Document Images%T %J International Journal of Computer Applications %V 4 %N 6 %P 1-5 %R 10.5120/834-1170 %I Foundation of Computer Science (FCS), NY, USA
Automatic handwritten script identification from document images facilitates many important applications such as sorting, transcription of multilingual documents and indexing of large collection of such images, or as a precursor to optical character recognition (OCR). In this paper, we investigate a texture as a tool for determining the script of handwritten document image, based on the observation that text has a distinct visual texture. Further, K nearest neighbour algorithm is used to classify 300 text blocks as well as 400 text lines into one of the three major Indian scripts: English, Devnagari and Urdu, based on 13 spatial spread features extracted using morphological filters. The proposed algorithm attains average classification accuracy as high as 99.2% for bi-script and 88.6% for tri-script separation at text line and text block level respectively with five fold cross validation test.