130 likes | 233 Vues
Develop an online search and redaction system for correcting OCR-generated errors before or after importing text into a database. Implement text correction tools and create a user-friendly interface for automatic text adjustments. Utilize parsing programs and code optimization for efficient search functionality. Examples include serbian bibliographies and national library documents.
E N D
ONLINE SEARCH AND REDACTION SYSTEM Many concepts of digitalization which aim is to present datas on internet are faced with two main subjects and problems: OCR of scanned papers Redaction of text created by OCR program Import text into database Correction of text and fixing errors made by OCR program - it can be done: before import into database or after import - online redaction Creating searching system which will use redactor of websites and final users program for automatic correction of text created by OCR
1. Example - first page of retrospective Serbian bibliography
2. Example – text of registers: a) nbs and b) msu a) nbs register text
3. Parsing programs – regular expressions, perl and php
4. Examples: a) national library redaction by multiple users, b) msu redaction 4.a) nbs multiredaction
6. OCR correction program, history of redaction for msu and montenegro bibliography