Speech-recognition text classification method and device

A text classification and speech recognition technology, which is applied in speech recognition and communication fields, can solve problems such as classification errors, and achieve the effect of improving the accuracy rate

Active Publication Date: 2014-01-15
CHINA MOBILE GROUP ANHUI
View PDF4 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Because the contribution score of "casualty" to "GRPS" is 0 due to the wrong recognition result, the final score of "telephone charge query" and "GPRS" are both 0.3, resulting in a classification error

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech-recognition text classification method and device
  • Speech-recognition text classification method and device
  • Speech-recognition text classification method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The specific embodiments of the present invention will be described in detail below in conjunction with the accompanying drawings, but it should be understood that the protection scope of the present invention is not limited by the specific embodiments.

[0031] The word confusion network (Word Confusion Network, WCN) produced by the speech recognition system of the present invention is used as the input of the text classifier, and the word confusion network not only includes the first preferred result of the speech recognition system, but also includes other most likely several recognition As a result, i.e. confusing words such as Figure 4 shown. Wherein, the text classifier is one of the Support Vector Machine classifiers (Support Vector Machine, SVM), and the SVM classifier is a general term for classifiers.

[0032] The text classifier uses the word confusion network as input, when a word is recognized as the first preferred result by the speech recognition system...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a speech-recognition text classification method and device. The method comprises the steps of respectively collecting training texts and training speeches identical with contents of the training texts for all service classes according to service class types, decoding the training speeches to obtain a work confusion network of the training speeches, extracting text characteristics of the training texts according to the training texts and the work confusion network, training a support vector machine classifier in a set according to the text characteristics, and using the trained support vector machine classifier to classify the texts. The speech-recognition text classification method and device converts a word graph network into the word confusion network suitable for text classification. After confusion words contained in the word confusion network are converted into text characteristics, a support vector machine algorithm is utilized to carry out text classification based on the confusion words. Thus, more accurate classification results can be obtained, and the accuracy of speech-recognition text classification is improved.

Description

technical field [0001] The present invention relates to the technical field of speech recognition in the field of communication, in particular to a text classification method and device for speech recognition. Background technique [0002] Text classification refers to the process of automatically determining the text category according to the text content under the given classification target. With the help of text classification technology, classifying text can allow machines to understand human language, thereby realizing intelligent voice interaction. Text classification technology has been widely used in the fields of human-computer interaction such as Internet search and speech recognition. [0003] In the self-service speech recognition service system, the text classification technology is used to classify the text results of speech recognition, and according to the different final categories, the self-service speech service system provides different self-service spe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/353G06F18/2411
Inventor 万鹏梁政刘江鹿晓亮李钊辉刘庆峰
Owner CHINA MOBILE GROUP ANHUI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products