Method and system for realizing automatic addition of punctuation marks in speech recognition

A technology of speech recognition and punctuation marks, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of lack of accuracy and flexibility of punctuation marks, and achieve the effect of simple, efficient and automatic addition, ensuring accuracy and flexibility

Active Publication Date: 2011-11-02
IFLYTEK CO LTD
View PDF4 Cites 92 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there are still some problems in the practical application of this scheme. Due to the differences of users and the diversity of punctuation marks, not all users will generate enough noise in speech, so the addition of punctuation marks in this scheme lacks accuracy. and flexibility

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for realizing automatic addition of punctuation marks in speech recognition
  • Method and system for realizing automatic addition of punctuation marks in speech recognition
  • Method and system for realizing automatic addition of punctuation marks in speech recognition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] In order to enable those skilled in the art to better understand the solutions of the embodiments of the present invention, the embodiments of the present invention will be further described in detail below in conjunction with the drawings and implementations.

[0029] The embodiment of the present invention realizes the method and system for automatically adding punctuation marks in speech recognition, by performing speech recognition on the collected user voice signal, generating a text sequence containing a plurality of sentences; and sequentially calculating the duration of pause positions between sentences in the text sequence ; If the duration is less than the preset threshold value, then add a comma at the pause position; if the duration is greater than or equal to the threshold value, then determine the tone type of the sentence before the pause position, and according to the determined sentence type Add punctuation at that pause. Therefore, the automatic additi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of speech recognition and discloses a method and system for realizing automatic addition of punctuation marks in the speech recognition. The method comprises the steps of: collecting user speech signals; carrying out the speech recognition on the user speech signals so as to generate a character sequence containing a plurality of sentences; sequentially calculating duration of pause positions between the sentences in the character sequence; if the duration is less than a preset threshold value, adding commas at the pause positions; and if the duration is greater than or equal to the preset threshold value, confirming the mood types of the sentences in front of the pause positions by utilizing a pre-generated classifier and adding punctuation marks at the pause positions according to the types. By utilizing the method and system which are provided by the invention, the automatic addition of the punctuation marks can be simply and conveniently realized and the accuracy and the flexibility of adding the punctuation marks are increased.

Description

technical field [0001] The invention relates to the technical field of speech recognition, in particular to a method and system for automatically adding punctuation marks in speech recognition. Background technique [0002] At present, most speech recognition systems use methods based on statistical pattern recognition. First, the time-domain sound waves of speech input are converted into a digital vector feature to describe and distinguish different pronunciations, and an acoustic model is established for all pronunciations based on the sound features; At the same time, for the continuous speech recognition system with a large vocabulary, a language model is needed, which includes the usage methods of commonly used words in the recognized language. The working process of a general continuous speech recognition system can be described as, in a huge space of words, words, phrases or sentences, find the word, word, phrase or sentence that matches the given input sound feature ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/26G10L15/02G10L15/14G10L15/18
Inventor 陈志刚蒋成林俞健魏思胡郁胡国平王智国刘庆峰
Owner IFLYTEK CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products