Unlock instant, AI-driven research and patent intelligence for your innovation.

Paragraph recognition method, device and terminal equipment

A recognition method and paragraph technology, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve the problem of unable to recognize paragraphs, and achieve the effect of improving efficiency and accuracy

Active Publication Date: 2018-06-22
ZHANGYUE TECH CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Embodiments of the present invention provide a paragraph recognition method, device, and terminal equipment to solve the problem that the paragraphs in the text page of the layout typesetting cannot be accurately identified during the process of converting the layout typesetting into a stream typesetting process

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Paragraph recognition method, device and terminal equipment
  • Paragraph recognition method, device and terminal equipment
  • Paragraph recognition method, device and terminal equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0021] refer to figure 1 , shows a flow chart of steps of a paragraph recognition method according to Embodiment 1 of the present invention.

[0022] The paragraph identification method of the present embodiment comprises the following steps:

[0023] Step S102: Perform paragraph recognition on the content of the same document through various paragraph recognition rules.

[0024] Wherein, the document content includes multiple paragraphs. In the embodiment of the present invention, unless otherwise specified, document content refers to content in a text page without paragraph information, such as layout and typesetting. The layout of the layout is fixed, and the original editing layout is always displayed during the reading process, and the layout will not be automatically rearranged according to the page width after zooming. For example, PDF files made from scanned initial picture manuscripts, PDF graphics and plain text files made in fixed layout, etc.

[0025] In the em...

Embodiment 2

[0035] refer to figure 2 , shows a flow chart of steps of a paragraph recognition method according to Embodiment 2 of the present invention.

[0036] The paragraph identification method of the present embodiment comprises the following steps:

[0037] Step S202: Obtain various paragraph recognition rules.

[0038] Wherein, the various paragraph identification rules may include one or more of common paragraph identification rules, hanging paragraph identification rules and poetry paragraph identification rules. In this embodiment, the various paragraph identification rules set and used include the above three types.

[0039] Among them, the ordinary paragraph identification rule is used to identify paragraphs according to the settings of ordinary paragraphs. The settings include but are not limited to: the first character of the first line of the paragraph is indented, such as two characters; the last character of the last line of the paragraph and the document boundary have...

Embodiment 3

[0073] refer to Image 6 , shows a structural block diagram of a paragraph recognition device according to Embodiment 3 of the present invention.

[0074] The paragraph identification device in this embodiment includes: an identification module 302, configured to perform paragraph identification on the same document content through various paragraph identification rules, wherein the document content includes multiple paragraphs; an acquisition module 304, configured to acquire each paragraph identification A recognition result corresponding to the rule; a determination module 306 configured to determine the paragraph information of the document content according to the recognition result.

[0075]The paragraph recognition device of this embodiment is used to realize the corresponding paragraph recognition methods in the foregoing multiple method embodiments, and has the beneficial effects of the corresponding method embodiments, which will not be repeated here.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Embodiments of the invention provide a paragraph recognition method and device and terminal equipment. The paragraph recognition method comprises the following steps of: carrying out paragraph recognition on a same document contents through a plurality of paragraph recognition rules, wherein the document content comprises a plurality of paragraphs; obtaining recognition results corresponding to the paragraph recognition rules; and determining paragraph information of the document content according to the recognition results. Through the paragraph recognition method and device and the terminal equipment, the paragraph information of the document content can be correctly determined, and the subsequent streaming composing efficiency and correctness can be improved.

Description

technical field [0001] The embodiments of the present invention relate to the technical field of text typesetting, and in particular to a paragraph recognition method, device and terminal equipment. Background technique [0002] E-books are publications that use computer technology to digitize information such as text, pictures, audio, and video. With the application of Internet technology more and more widely, traditional paper reading methods have been gradually replaced by e-books, and people are increasingly using the Internet and computer technology to download e-books for reading through reading applications used to read e-books . [0003] Most of the current e-books use the flow typesetting method, which requires converting the relevant text pages of the layout typesetting into the flow typesetting pages. How to accurately identify the paragraphs in the typesetting text page in this process has become an urgent problem to be solved by those skilled in the art. Con...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/25G06F40/189
CPCG06F40/189
Inventor 孙上斌成湘均刘伟平于刚
Owner ZHANGYUE TECH CO LTD