Document Type
Thesis
Date of Award
5-31-1991
Degree Name
Master of Science in Computer and Information Science - (M.S.)
Department
Computer and Information Science
First Advisor
Frank Y. Shih
Abstract
In this thesis, I present a optical character recognition technique based on the rule-based module and object-oriented design. A document image is first scanned, and then segmented into isolated characters by the use of projection profiles. The recognition procedures include three steps: contour extraction, strokes detection and rule-based classification. The contour extraction is to extract the outbound boundary and to perform the boundary linking and noise removal. The stroke detection is to detect the features or strokes of the outbound boundary, such as horizontal/vertical lines, left/right slash lines and left-/right-opened curves. The rule-based classification consists of two rules: character definition rules and control rules. The character definition rules define the elementary criteria of combinations of detected strokes. The control rules, represented by a hashing table, embody inferences about the order in which the character recognizers are matched. The experimental results show that the recognition rate of this system reaches to 95%.
Recommended Citation
Gao, Min, "Rule-based module and object-oriented design for optical character recognition" (1991). Theses. 2483.
https://digitalcommons.njit.edu/theses/2483