Handwritten Word Recognition
        Using Continuous Hidden Markov Models and Structural Features

Computer Science and Engineering Depatment    

 M. Mehdi Haji    

  Thesis Report
  Presentation File
  Executable Files
  Defense Pictures

   

    Recognizing handwritten words has been and still is one of the most challenging problems in Artificial Intelligence (AI). Words are rather complex patterns, having many variations in handwriting style. Despite the considerable progress achieved in recent years, performance of handwriting recognition systems is still far from human's both in terms of accuracy and speed.
    I devoted my M.Sc. thesis to this subject, and providing some of the results here. Since there is a lack of research on the problem of Farsi (Persian) handwritten recognition, the focus of the proposed and surveyed methods was on the Farsi scrip; however, most of them can be applied to Roman scripts with no modifications. To the best of our knowledge, this work is the first to use continuous hidden Markov models with structural features to recognize Farsi handwritten words, and almost all parts of a complete recognition system are addressed. The other contributions of this work are:

  • Developing a new machine learning approach based on the naive Bayes classifier for text segmentation.

  • Comparing and contrasting four different algorithms for document image binarization.

  • Surveying different skew and slant correction algorithms for handwritten words and documents.

  • Comparing and contrasting five different skeletonization algorithms with the main focus on preserving text characteristics.

  • Extracting a set of structural features from Farsi handwritten words, independent of the baseline location.

    The proposed system achieves a maximum recognition rate of about 82% on our lexicon of size 100, that contains word images of 100 cities of Iran. The striking aspect of the recognition system is its excellent generalization performance, as in our experiments, the system trained with multi-font machine-printed word images could recognize handwriting.


Thesis Report

    With over 100 pages, containing about 50 figures and C++ pseudo-code of some of the implemented algorithms:
        handrec.pdf
   

Presentation File

    Looking at the problem and solution from a higher abstraction level, with over 70 slides including many illustrating figures:
        presentation.ppt

Executable Files

    With the executable version of training, recognition and evaluation modules, the proposed system can be trained and evaluated on different datasets, so giving the opportunity of comparing this system with others':
        - The Windows executable: handrec.exe.zip
        - Required DLLs (GDK, GTK, Boost, LTILib, OpenCV and MS VC7): dlls.zip
        - The trained models of 100 cities of Iran: models.zip

    The implementation was carried out in standrad C++. All source codes or parts of them are provided at a reasonable cost. Please contact me via mehdi.haji@gmail.com or m2haji@yahoo.com, and describe your specific needs.

Defense Pictures

    The oral defense session was held on Monday, January, 10, 2005 from 11:15 to 12:30. The supervisory committee was consisted of Dr. Katebi, Dr.Eghbali, Dr. Towhidi from the computer science and engineering department, and Dr. Emdad from the mechanical engineering department. All of the following pictures were taken by my dear friend Vahid Daneshmand.

       


For any further information, please don't hesitate to contact me at any time via mehdi.haji@gmail.com or m2haji@yahoo.com.