Formula identification method and device

A formula recognition and formula technology, which is applied in the Internet field, can solve the problems of low robustness of short text images, and the statistical properties of long texts are not suitable for short texts, so as to reduce the appearance of garbled characters, improve the accuracy rate, and improve the effect of effective information.
CN104636741AActive Publication Date: 2015-05-20BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

Patent Information

Authority / Receiving Office
CN Β· China
Patent Type
Applications(China)
Current Assignee / Owner
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Publication Date
2015-05-20

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention provides a formula identification method and device. The formula identification method comprises the following steps: performing distortion correction on image layout; segmenting basic elements in the corrected image layout, and determining a region where the basic elements are positioned as a formula region according to the features of the basic elements in the image layout; performing formula identification on the formula region according to a formula symbol. By adopting the formula identification method and device, a plurality of local features in a short text image can be realized, a formula in the short text image is detected and identified, valid information in the short text image can be effectively enhanced, random codes are reduced, and the accuracy of a whole answering system can be further increased.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to the technical field of the Internet, in particular to a formula recognition method and device. Background technique

[0002] With the rapid development of Internet technology and the popularization of smart phones, images have become the main way for people to record and share information, which has spawned a large number of applications that use photos as retrieval input. As a new form of question answering, automatic question answering systems using images as input have attracted more and more attention.

[0003] In the automatic answering system, the detection, recognition and retrieval of mathematical formulas are three key issues. At present, common formula detection and recognition methods are mainly used in long text images. Since long text images have rich global information, various elements in the layout have a large degree of discrimination, and the difference of some simple statistical attributes can be used. It i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More