Formula identification method and device

A formula recognition and formula technology, which is applied in the Internet field, can solve the problems of low robustness of short text images, and the statistical properties of long texts are not suitable for short texts, so as to reduce the appearance of garbled characters, improve the accuracy rate, and improve the effect of effective information.

Active Publication Date: 2015-05-20
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF5 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But for short texts, the proportions of formulas, texts, and charts in the layout are not much different, so the statistical properties based on long texts are not suitable for short texts
In addition, the existing formula recognition methods are mostly used on some long text images with little change in illumination, relatively clear, and small deformation, but are less robust to short text images randomly input by users.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Formula identification method and device
  • Formula identification method and device
  • Formula identification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0016] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention. On the contrary, the embodiments of the present invention include all changes, modifications and equivalents coming within the spirit and scope of the appended claims.

[0017] figure 1 It is a flowchart of an embodiment of the formula identification method of the present invention, such as figure 1 As shown, the formula identification method may include:

[0018] Step 101, performing distortion correction on the image layout.

[0019] In the short text image input by the user, the rotation and distortion of the graphics often occur, wh...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a formula identification method and device. The formula identification method comprises the following steps: performing distortion correction on image layout; segmenting basic elements in the corrected image layout, and determining a region where the basic elements are positioned as a formula region according to the features of the basic elements in the image layout; performing formula identification on the formula region according to a formula symbol. By adopting the formula identification method and device, a plurality of local features in a short text image can be realized, a formula in the short text image is detected and identified, valid information in the short text image can be effectively enhanced, random codes are reduced, and the accuracy of a whole answering system can be further increased.

Description

technical field [0001] The invention relates to the technical field of the Internet, in particular to a formula recognition method and device. Background technique [0002] With the rapid development of Internet technology and the popularization of smart phones, images have become the main way for people to record and share information, which has spawned a large number of applications that use photos as retrieval input. As a new form of question answering, automatic question answering systems using images as input have attracted more and more attention. [0003] In the automatic answering system, the detection, recognition and retrieval of mathematical formulas are three key issues. At present, common formula detection and recognition methods are mainly used in long text images. Since long text images have rich global information, various elements in the layout have a large degree of discrimination, and the difference of some simple statistical attributes can be used. It i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/20G06K9/54
Inventor 吴仑王岩梁爽陈恭明邹静
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products