410 likes | 616 Vues
A mathematical formula recognition method and its performance evaluation. Masayuki Okamoto Shinshu University JAPAN. Goal of our study Character and symbol recognition Structure analysis and recognition Performance evaluation method Experimental results Future works.
E N D
A mathematical formula recognition method and its performance evaluation Masayuki Okamoto Shinshu University JAPAN
Goal of our study Character and symbol recognition Structure analysis and recognition Performance evaluation method Experimental results Future works Overview of presentation
High performance formula recognition system for “Archiv der Mathematik” Goal of our study
Overview of Recognition System Labeling Character or symbol recognition Touching character separation Structure recognition
Font type (1/2) • Alphabet • Roman • Italic • Bold • Calligraphy • German • Greek
Font type (2/2) • Digits • Mathematical symbols • Characters Normal size Small size Number of characters or symbols: 650
Dictionary data • Following three features are calculated from each sample image features feature calculation dictionary • Mesh features • Peripheral features • PDC features
features result comparison Dictionary data Character recognition process Given image Feature calculation
Majority vote Character recognition process • We classify the given image with each feature and we use the majority vote Result from mesh features Result from peripheral features Result from PDC features
Touching characters • We assume a character which has a low score of similarity as a touching character ‘(’ /0.980 ‘y’ /0.990 ‘O’/0.847 Result/Score
Blurring the image Calculate minimal points Estimate cutting lines Classification Comparison Touching character segmentation(1)
Make projection profile Projection profile |hi– hi+1 | > θ Recognize Touching character segmentation (2) Image
Segmentation experiment • 47 touching characters found in our experimental data
Touch with fraction bar Correct result • Correct examples
Three touching characters Other types Errors • Errors
Recognition experiment • Number of symbols : 12659 • We excluded touching characters • We distinguished following similar shape characters
Recognition rate • Similar shaped characters
Examples of recognition errors • Most errors occurred at small characters such as scripts
Our previous methods(1) • Projection profile cutting
Core symbol in subexpression Our previous methods(2) • Specific structure processing(Bottom-up) • Script • Root • Matrix • Fundamental structure processing(Top-down) • Vertical division by symbols • Horizontal division by symbols • Horizontal division by blank space
Outline of structure recognition Image Target symbol Top to bottom Character recognition [symbol = fraction,root,matrix] [symbol = script,limit] Structure recognition* Group A processing Group B processing Output Recursion Horizontal connection Output in LaTeX/mathML
Matrix Recognition Target symbol Target symbol Structure Recognition (1/2) • Fractions • Roots • Matrices
Target symbol Target symbol Adjacent symbol Adjacent symbol Structure Recognition (2/2) Scripts Limits
Horizontal Overlap Vertical Overlap Matrix Recognition
Left Parenthesis Right Edge Horizontal Overlap Vertical Overlap Case-distinction
<msubsup rect="1,1,209,210"> Positional Information Answer Database Format <mrow> <msubsup rect="1,1,209,210"> <mrow> <mo>(</mo> <mfrac rect="43,11,87,187"> <mrow> <mi>β</mi> </mrow> <mrow> <mi>α</mi> </mrow> </mfrac rect="43,11,87,187"> <mo>)</mo> </mrow> <mrow> </mrow> <mrow> <mo>(</mo> <msubsup rect="152,24,189,56"> <mi>e</mi> <mrow> <mi>i</mi> </mrow> <mrow> </mrow> </msubsup rect="152,24,189,56"> <mo>)</mo> </mrow> </msubsup rect="1,1,209,210"> <mo>=</mo> . . . Original expression
Correct Recognition Count found Number in original expression (N) Number correctly recognized (C) Scripts 2 1 Fractions 1 1 found Recognition rate = C / N not found Comparison between Results and Answers (a) Original expression (b) Recognition result <mrow> <msubsup rect="1,1,209,210"> <mrow> <mo>(</mo> <mfrac rect="43,11,87,187"> <mrow> <mi>β</mi> </mrow> <mrow> <mi>α</mi> </mrow> </mfrac rect="43,11,87,187"> <mo>)</mo> </mrow> <mrow> </mrow> <mrow> <mo>(</mo> <msubsup rect="152,24,189,56"> <mi>e</mi> <mrow> <mi>i</mi> </mrow> <mrow> </mrow> </msubsup rect="152,24,189,56"> <mo>)</mo> </mrow> </msubsup rect="1,1,209,210"> <mo>=</mo> . . . <mrow> <mrow> <mo>(</mo> <mfrac rect="43,11,87,187"> <mrow> <mi>β</mi> </mrow> <mrow> <mi>α</mi> </mrow> </mfrac rect="43,11,87,187"> <mo>)</mo> </mrow> <mrow> <mo>(</mo> <msubsup rect="152,24,189,56"> <mi>e</mi> <mrow> <mi>i</mi> </mrow> <mrow> </mrow> </msubsup rect="152,24,189,56"> <mo>)</mo> </mrow> <mo>=</mo> . . .
Correct Results (1/4) • limit Arch.Math., Page 44, Vol. 64
Correct Results (2/4) • Multi-fraction Arch.Math., Page 272, Vol. 65
Correct Results (3/4) • Sparse Matrix Original expression Arch.Math., Page 277, Vol. 64 Recognition result
Correct Results (4/4) • Nested case-distinction Original expression Arch.Math., Page 108, Vol. 64 Recognition result
Errors (1/2) • Matrix Original expression Arch.Math., Page 65, Vol. 24 Recognition result
Errors (2/2) • Case-distinction Original expression Arch.Math., Page 104, Vol. 64 Recognition result
Summary of structure recognition • Extension of recognition method • Matrix and case-distinction • Performance evaluation • Quantitative evaluation for a large number of expressions • Automatic calculation of recognition rate for each typical structure