:    Optical Character Recognition (OCR) refers to the process of converting printed Tamil text documents into softwaretranslated Unicode Tamil Text. The printed documents available in the form of books, papers, magazines, etc. are scanned usingstandard scanners which produce an image of the scanned document. As part of the preprocessing phase the image file is checkedfor skewing. If the image is skewed, it is corrected by a simple rotation technique in the appropriate direction. Then the image ispassed through a noise elimination phase and is binarized. The preprocessed image is segmented using an algorithm which decomposes the scanned text into paragraphs using special space detection technique and then the paragraphs into lines using verticalhistograms, and lines into words using horizontal histograms, and words into character image glyphs using horizontal histograms.Each image glyph is comprised of 32×32 pixels. Thus a database of character image glyphs is created out of the segmentationphase. Then all the image glyphs are considered for recognition using Unicode mapping. Each  image glyph is passed throughvarious routines which extract the features of the glyph. The various features that are considered for classification are the characterheight, character width, the number of horizontal lines (long and short), the number of vertical lines (long and short), the horizontally oriented curves, the vertically oriented curves, the number of circles, number of slope lines, image centroid and specialdots. The glyphs are now set ready for classification based on these features. The extracted features are passed to a Support VectorMachine (SVM) where the characters are classified by Supervised Learning Algorithm. These classes are mapped onto Unicodefor recognition. Then the text is reconstructed using Unicode fonts.


Did you like this research project?

To get this research project Guidelines, Training and Code... Click Here


PROJECT TITLE : Deep Neural Network Regression for Automated Retinal Layer Segmentation in Optical Coherence Tomography Images ABSTRACT: The quantification of layer information in early diagnosis of retinal disorders, the primary
PROJECT TITLE :Optimum Monthly Based Selection of Ground Stations for Optical Satellite Networks - 2018ABSTRACT:Satellite communication networks at optical frequencies are proposed to produce a terribly high throughput for backhauling
PROJECT TITLE :Energy-Efficient Transponder Configuration for FMF-Based Elastic Optical Networks - 2018ABSTRACT:We propose an energy-efficient procedure for transponder configuration in few-mode fiber-based elastic optical networks
PROJECT TITLE :Capacity Bounds and High-SNR Capacity of MIMO Intensity-Modulation Optical Channels - 2018ABSTRACT:The capability of the intensity modulation direct detection multiple-input-multiple-output channel is studied. Therein,
PROJECT TITLE : High Accuracy Retinal Layer Segmentation For Optical Coherence Tomography Using Tracking Kernels Based On Gaussian Mixture Model - 2014 ABSTRACT: Ophthalmology needs automated segmentation of retinal layers

Ready to Complete Your Academic MTech Project Work In Affordable Price ?

Project Enquiry