Alignment system of on-line speech text and method thereof

A voice-to-text and text-to-text technology, which is applied to TV system components, TV, and color TV components, etc., can solve the problem that there is no corresponding text for news interviews, and it is impossible to take into account the text processing with errors and the real-time acquisition of voice input alignment results. not quite right

Active Publication Date: 2010-02-17
INST OF ACOUSTICS CHINESE ACAD OF SCI +1
View PDF0 Cites 30 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The traditional alignment method is to obtain all the voices at the same time, because there may be inaccuracies in the aligned text, in the news subtitles, it mainly shows that there is no corresponding text for news interviews (simultaneous sound) of some on-site news
Traditional alignment methods cannot handle these errors online
Traditional voice-to-text alignment methods, in order to deal with erroneous segments in the text stream, are usually completed offline after all the voices are acquired, so it is impossible to take into account the processing of erroneous text and the real-time acquisition of real-time voice input alignment results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Alignment system of on-line speech text and method thereof
  • Alignment system of on-line speech text and method thereof
  • Alignment system of on-line speech text and method thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0058] The present invention will be described in detail below through specific embodiments and in conjunction with the accompanying drawings.

[0059] The online voice-text alignment system adopted in this embodiment, such as figure 1 As shown, it includes: text processing module, error detection module, error recovery module and mandatory alignment module.

[0060] Among them, the mandatory alignment module, such as figure 2 As shown, it includes: feature extraction module, search space building module and alignment decoding module.

[0061] Among them, the error recovery module, such as image 3 As shown, it includes: a language model estimation module, a language model interpolation module, a speech recognition module, and a text alignment and similarity calculation module.

[0062] Using the online voice-text alignment method of the above-mentioned system, the steps include (such as Figure 4 shown):

[0063] (1) According to the actual application requirements, the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an alignment system of an on-line speech text and a method thereof. The system comprises a text processing module, an error detection module, an error recovery module and a compulsory alignment module. The compulsory alignment module comprises a characteristics extraction module, a search space construction module and an alignment decoding module. The error recovery modulecomprises a language model estimation module, a language model interpolation module, a speech recognition module and a text alignment and similarity calculation module. Method for the system and themethod to detect end of a sentence is an improvement of a conventional method based on Viterbi alignment, information of a search space by beam search is used for estimating activity degree A (t, somegae) of the search space at the end of the sentence, and estimating sentence end time in partial meaning *<*>. The system and the method have the function for automatically detecting and jumping overunmatched segment errors in a text and a speech; can generate online input speech current and corresponding text alignment result at real time and can process a long text with errors.

Description

technical field [0001] The invention relates to the field of television subtitle display, in particular to an online voice-text alignment system and method. Background technique [0002] The proportion of subtitled programs in a country reflects the level of humanity of a country and the degree of social concern for the disabled. At present, TV programs in many countries such as Japan, the United States, and the United Kingdom have been added with subtitles. However, there are very few domestic programs with subtitles added. Even if there is a small amount of subtitle addition, it is limited to recorded programs, and the addition of subtitles is done manually by professionals, which takes a lot of time and energy. The output speed is added one by one. [0003] The core module of the system of the online speech-to-text method in the prior art is an alignment module based on a hidden Markov model. Its main function is to generate the corresponding real-time time correspond...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04N5/278
Inventor 颜永红高杰赵庆卫潘接林
Owner INST OF ACOUSTICS CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products