A method and system for document speech processing based on intelligent indexing

A processing method and phonetic technology, applied in the field of information processing, can solve the problems of lack of information, poor use flexibility, incoherent reading content, etc., to achieve the effect of improving interest, flexibility, flexibility and diversity

Inactive Publication Date: 2011-12-14
PEKING UNIV FOUNDER GRP CO LTD +1
View PDF5 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method has the following problems: 1. Only distinguishing the attributes of different text contents according to the semantics cannot effectively and correctly identify the attributes of the text contents in digital files with complex layouts; Recognition, therefore, cannot be applied to voice reading of digital files containing pictures, especially for digital files with rich information expressed in pictures, the lack of pictures will inevitably lead to the loss of a large amount of information, resulting in incoherent or wrong reading content; 3. The The method allows users to subscribe to different text content in text files, but it cannot realize paragraph jumps during the reading process, and users cannot set different reading methods and reading orders for different content according to their own needs, so the flexibility of use is poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and system for document speech processing based on intelligent indexing
  • A method and system for document speech processing based on intelligent indexing
  • A method and system for document speech processing based on intelligent indexing

Examples

Experimental program
Comparison scheme
Effect test

no. 1 example

[0022] In this embodiment, the speechification object is the text content contained in the digital file meeting the above conditions. figure 1 It is a flow chart of the method for processing documents into speech based on intelligent indexing according to this embodiment. refer to figure 1 , the method includes the following steps:

[0023] Step S11, file parsing step

[0024] In this step, the file to be phoneticized is parsed to extract original text block information, wherein the original text block information includes text content, position information and style information of the original text block, and the style information includes information such as font, font size, and sequence number.

[0025] Step S12, text block indexing step

[0026] In this step, the original text block is indexed to merge the text block, mark the content attribute of the merged text block and construct the article.

[0027] In the present invention, in order to realize the correct identif...

no. 2 example

[0046] In this embodiment, the speechified object includes text content and picture information contained in a digital file that meets the above conditions. image 3 It is a flow chart of a method for processing documents into speech based on intelligent indexing according to the second embodiment of the present invention. refer to image 3 , the method includes the following steps:

[0047] Step S31, file parsing step

[0048] In this step, the file to be voiced is analyzed, and the original text block information and picture block information are extracted, wherein the original text block information includes at least one of the text content, position information and style information of the original text block, and the picture block information includes The location information of the picture block.

[0049] Step S32, text block and picture block indexing step

[0050] In this step, the original text block and the picture block are indexed to merge the text block, calib...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a file phoneticization processing method based on intelligent indexing, which comprises the following steps of: indexing original text blocks and picture blocks which are extracted from a digital file so as to combine the text blocks, calibrate the content property of the text blocks, associate the picture blocks with illustrated text blocks and construct an article; establishing a text information list which describes a data relation among different articles and/or text contents in the same article, an associating relation between the picture blocks and the illustratedtext blocks and/or picture information and a reading sequence; and transmitting information of the text information list into a voice library to generate a voice record file or product or read with voices. Correspondingly, the invention also provides a file phoneticization processing system. By the method and the system, different phoneticization modes of different text blocks and paragraph switching during phoneticization are realized, the flexibility and the variety of the phoneticization are improved, and the reading interest is enhanced; furthermore, a user can preset the reading sequenceand voice library parameters, so the use flexibility is high.

Description

technical field [0001] The invention belongs to the technical field of information processing, and in particular relates to a method and system for voice processing of documents based on intelligent indexing. Background technique [0002] With the development of voice technology, voice reading has become an important function on various terminal devices, providing terminal users with auditory enjoyment and bringing a new reading experience. However, currently available voice reading software and published document voice reading methods basically read page by page without identifying and distinguishing content, and the method is single. Even some voice software that supports drag-and-drop reading can only be realized under manual intervention. For example, Free Read software requires the user to manually select some text to achieve drag-and-drop reading. For the user, the flexibility is poor and the method is single. [0003] In the Chinese patent application "A System and...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/08
Inventor 邓姿王长桥张军李松峰
Owner PEKING UNIV FOUNDER GRP CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products