Research Article Open Access

Handwritten Characters Extraction from Form Based on Line Shape Characteristics

Ali Qusay Al-Faris, Dzulkifli Mohamad, Umi Kalthum Ngah and Nor Ashidi Mat Isa

Abstract

Problem statement: Data entry form is a convenient and successful tool for information collection by filling in the sheets using pen and handwriting. One of the most important fields in these forms is the data filled boxes. Extracting the handwriting from the data entry forms is important for many purposes such as in documenting and archiving. The extraction process is also important in situations such as when it is necessary to the handwritten recognition process. Approach: A simple and effective approach is presented to extract handwritten characters, including digits and letters of any language from data filled boxes of data entry form and to deal with cases of overlaps between the handwritten characters and boxes’ lines. The proposed approach is based on line shape characteristic by detecting and removing the vertical and horizontal straight boxes’ lines, while preserving the curved lines which represent the handwritten characters. The problem of the handwritten characters overlapping with the data filled boxes’ line is solved using morphology dilation to reconstruct the broken characters after the removal of the boxes’ lines. Results: Experimental results have demonstrated that the proposed approach can extract handwriting from data filled boxes with overall 94.052% for data collection of 150 forms. Conclusion: The proposed algorithm has been successfully implemented and tested to achieve the objectives of handwritten extraction of any language from data filled boxes. However, this work could not deal with situations whereby the characters touch other immediate characters.

Journal of Computer Science
Volume 7 No. 12, 2011, 1778-1783

DOI: https://doi.org/10.3844/jcssp.2011.1778.1783

Submitted On: 28 March 2011 Published On: 15 October 2011

How to Cite: Al-Faris, A. Q., Mohamad, D., Ngah, U. K. & Isa, N. A. M. (2011). Handwritten Characters Extraction from Form Based on Line Shape Characteristics. Journal of Computer Science, 7(12), 1778-1783. https://doi.org/10.3844/jcssp.2011.1778.1783

  • 3,648 Views
  • 2,748 Downloads
  • 2 Citations

Download

Keywords

  • Document image processing
  • overlapping characters
  • proposed algorithm
  • successfully implemented
  • immediate characters
  • handwritten characters
  • boxes’ lines
  • shape characteristic
  • extract handwritten