Network protected index data obtaining method based on OCR technology
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- SHANDONG UNIV OF SCI & TECH
- Publication Date
- 2016-11-09
Smart Images
Figure 1 Figure 2 Figure 3
Abstract
Description
technical field
[0001] The invention relates to a method for acquiring network protected index data based on OCR technology, and belongs to the technical field of network communication. Background technique
[0002] OCR technology is the abbreviation of Optical Character Recognition (Optical Character Recognition). It converts the text of various bills, newspapers, books, manuscripts and other printed materials into image information through optical input methods such as scanning, and then uses text recognition technology to convert the image information. Enter technology for computers that can be used.
[0003] The process of OCR technology to recognize characters in images can be summarized as image preprocessing, character feature extraction, and font dictionary comparison, which are the three core processes of OCR. Among them, character feature extraction is the most important. This process first performs line or word segmentation on the character sequence to be recogni...