Unlock instant, AI-driven research and patent intelligence for your innovation.

Rare word processing method, computing device and computer storage medium

A processing method and technology of rare characters, applied in the field of text recognition, can solve problems such as missing characters, limited character encoding methods, missing document content, etc.

Active Publication Date: 2019-02-12
ZHANGYUE TECH CO LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, due to the limited character encoding methods of PDF and other format documents, a large number of rare characters can only be represented in the form of path lines. For these rare characters, in the process of converting them into ePUB, the characters at the corresponding positions cannot be extracted. , so that there is a lack of content in the document presented to the user; and, due to the lack of characters corresponding to the position of the uncommon word, when typesetting the streaming document, the text before and after the uncommon word will be recognized as two lines, resulting in typesetting confusion

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Rare word processing method, computing device and computer storage medium
  • Rare word processing method, computing device and computer storage medium
  • Rare word processing method, computing device and computer storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0028] figure 1 A flow chart of a method for processing rare words according to an embodiment of the present invention is shown. Such as figure 1 As shown, the method includes the following steps:

[0029] Step S101: Recognize each line of text objects in the document to be recognized.

[0030] In the process of converting the layout document into a flow document, text content needs to be extracted from the layout document so as...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an uncommon word processing method, a computing device and a computer storage medium. The method comprises the steps that identification is conducted on each row of text objecTS of a to-be-identified text; an uncommon character area is determined by using a preset rule according to identification resulTS of each row of the text objecTS; screenshot processing is conducted onthe uncommon character area to obtain an uncommon character image; an uncommon character filling object is obtained according to the uncommon character image, and the uncommon character filling object fills the uncommon character area. According to the method, the uncommon character filling object can be obtained according to the uncommon character image, and missing of the text object of the corresponding uncommon character area in a file which is presented to a user is avoided, so that reading of the user is smoother; meanwhile, the problem of disordered typesetting caused by missing of thetext object in the uncommon character area is avoided.

Description

technical field [0001] The invention relates to the technical field of text recognition, in particular to a rare word processing method, computing equipment and computer storage medium. Background technique [0002] At present, with the popularity of mobile terminals such as mobile phones and the development of e-book readers, e-books are becoming more and more popular among readers. At the same time, in the e-reader, in order to make the content of the document be displayed in the most suitable way for reading according to the characteristics of the reading device, it is necessary to convert the format document into a streaming document, for example, convert a PDF document into an electronic publishing document (Electronic Publication, referred to as ePUB). [0003] However, due to the limited character encoding methods of PDF and other format documents, a large number of rare characters can only be represented in the form of path lines. For these rare characters, in the p...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/27
CPCG06F40/279
Inventor 张恒
Owner ZHANGYUE TECH CO LTD