Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and device for processing scanned book data

A technology of data and image data, applied in the field of digital typesetting, which can solve the problem of layout rearrangement of scanned books

Active Publication Date: 2013-07-03
NEW FOUNDER HLDG DEV LLC +1
View PDF5 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Aiming at the problem that the layout of scanned books cannot be rearranged in the prior art, the embodiment of the present invention provides a method and device for processing scanned book data, which provides conditions for realizing the layout rearrangement of scanned books, so as to realize the layout of scanned books. Relayout of

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for processing scanned book data
  • Method and device for processing scanned book data
  • Method and device for processing scanned book data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] Aiming at the problem in the prior art that rearrangement of scanned books cannot be realized, the embodiment of the present invention provides a method and device for processing scanned book data, which provides necessary information for rearranging scanned books, thereby realizing the Relayout of scanned books. The method for processing scanned book data may include: reading the page image data of the page document; segmenting and identifying the page image data to obtain the rectangular frame of each character in the page document on the corresponding page document position and character encoding; carry out text line aggregation processing on each line of text in the page document to obtain the text line information of each line of text, and perform each text in each line of text according to the text line information The corresponding rectangular frame is corrected to obtain the exact image rectangular frame position information and text line aggregation information...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and a device for processing scanned book data, which provide necessary conditions for re-typesetting of a page document of a scanned book and therefore can realize the re-typesetting of the scanned book. The method comprises the steps of reading page image data of the page document, dividing and identifying the page image data to obtain rectangular box positions and character codes of letters in the page document on the corresponding page document, conducing letter line aggregation processing on each line of letters in the page document to obtain letter line information of each line of letters, correcting rectangular boxes corresponding to the letters in each line according to the letter line information to obtain exact image rectangular box position information and letter line aggregation information of each letter, and storing the exact image rectangular box position information, the letter line aggregation information and the character code corresponding to each letter in the page document.

Description

technical field [0001] The invention relates to the field of digital typesetting, in particular to a method and device for processing scanned book data. Background technique [0002] The so-called "scanned book" refers to an electronic book obtained by scanning a paper book with a scanner or other equipment. Each page in the scanned book corresponds to a scanned image with a high DPI (Dot Per Inch, resolution). Due to the large amount of data in the scanned image, it is not conducive to data storage and transmission; and, the data on each page It is difficult to be effectively used, such as text copy, layout rearrangement and other applications. [0003] In order to realize text copy, a double-layer page technology is currently proposed, that is, add a transparent layer on the scanned image, and use OCR (Optical Character Recognition, optical character recognition) to add transparent text on the corresponding position of the transparent layer. So that the user can copy the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06T11/60G06K9/20G06V30/224G06V30/40
CPCH04N1/4115G06V30/40
Inventor 仇睿恒李赟
Owner NEW FOUNDER HLDG DEV LLC
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More