Network protected index data obtaining method based on OCR technology

An acquisition method and protected technology, applied in the field of network communication, can solve problems such as fixed content, low accuracy of results, and reduced efficiency, and achieve the effect of batch acquisition of acquired data, accurate acquired data, and wide application value
CN106095918AActive Publication Date: 2016-11-09SHANDONG UNIV OF SCI & TECH

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
SHANDONG UNIV OF SCI & TECH
Publication Date
2016-11-09

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention relates to a network protected index data obtaining method based on the OCR technology. The method includes the steps that an automatic testing tool is used for simulating a series of operation on a data platform by a user before index data display, and the operation includes login, search keyword input and search time setting; simulation mouse motion is used for dynamically displaying and collecting values on a curve, and finally the improved OCR technology is used for obtaining numerical values of target data. The protected data obtained through the method has the advantages of being high in obtaining efficiency, accurate and capable of being obtained in batches, effective data support is provided for public opinion analysis and data mining, a new thought is provided for the network big data obtaining method, and valuable information is provided for commercial popularization, precise marketing and market analysis. The network protected index data obtaining method has important theoretical significance and wide application value.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to a method for acquiring network protected index data based on OCR technology, and belongs to the technical field of network communication. Background technique

[0002] OCR technology is the abbreviation of Optical Character Recognition (Optical Character Recognition). It converts the text of various bills, newspapers, books, manuscripts and other printed materials into image information through optical input methods such as scanning, and then uses text recognition technology to convert the image information. Enter technology for computers that can be used.

[0003] The process of OCR technology to recognize characters in images can be summarized as image preprocessing, character feature extraction, and font dictionary comparison, which are the three core processes of OCR. Among them, character feature extraction is the most important. This process first performs line or word segmentation on the character sequence to be recogni...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More