Date of Award
5-2013
Degree Type
Report
Degree Name
Master of Science (MS)
Department
Computer Science
Committee Chair(s)
Vladimir A. Kulyukin
Committee
Vladimir A. Kulyukin
Committee
Daniel W. Watson
Committee
Nicholas Flann
Abstract
Many Visually Impaired individuals are managing their daily activities with the help of smartphones. While there are many vision-based mobile applications to identify products, there is a relative dearth of applications for extracting useful nutrition information. In this report, we study the performance of existing OCR systems available for the Android platform, and choose the best to extract the nutrition facts information from U.S grocery store packages. We then provide approaches to improve the results of text strings produced by the Tesseract OCR engine on image segments of nutrition tables automatically extracted by an Android 2.3.6 smartphone application using real-time video streams of grocery products. We also present an algorithm, called Skip Trie Matching (STM), for real-time OCR output error correction on smartphones. The algorithm’s performance is compared with Apache Lucene’s spell checker. Our evaluation indicates that the average run time of the STM algorithm is lower than Lucene’s. (68 pages)
Recommended Citation
Vanka, Aditya, "Skip Trie Matching for Real-Time OCR Output Error Corrrection on Smartphones" (2013). All Graduate Plan B and other Reports, Spring 1920 to Spring 2023. 298.
https://digitalcommons.usu.edu/gradreports/298
Included in
Copyright for this work is retained by the student. If you have any questions regarding the inclusion of this work in the Digital Commons, please email us at .