Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Scene-based real-time voice recognition system and method

A real-time speech and recognition system technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of slow update speed, complex corpus, and lack of pertinence, so as to improve accuracy and efficiency, improve efficiency and accuracy, The effect of improving accuracy

Active Publication Date: 2016-03-30
MOBVOI INC
View PDF4 Cites 81 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Among them, the corpus in the general language model is generally complex, the update speed is slow, and it is not targeted, resulting in low accuracy of speech recognition results
Especially for homophonic or similar speech, the existing speech recognition technology cannot provide accurate recognition results. For example, the collected speech of the user is "xinxinjie", the existing speech recognition technology cannot judge the speech Whether the text corresponding to the voice is "Xinxing Street" or "Xinxin Street" or other similar texts

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Scene-based real-time voice recognition system and method
  • Scene-based real-time voice recognition system and method
  • Scene-based real-time voice recognition system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] The present invention will be described in further detail below in conjunction with the accompanying drawings. Those of ordinary skill in the art will appreciate that although the following detailed description refers to the illustrated embodiments and accompanying drawings, the present invention is not limited to these embodiments. Rather, the scope of the invention is broad and it is intended that the scope of the invention be limited only by the appended claims.

[0055] figure 1 Shows a schematic block diagram of speech recognition in the prior art, the following figure 1 The speech recognition technology shown is briefly explained.

[0056] according to figure 1 , in the prior art, a voice database and a text database are usually established respectively based on a large amount of voice data and text data, and the acoustic model is trained by extracting voice features from the voice data, and the language model is trained using the text data. When the input spe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a real-time voice recognition system and method. The real-time voice recognition method includes the steps of: collecting current voice and current scene information of a current user; constructing a current scene language model corresponding to the current user; judging the type of the current scene according to the current scene information, and searching a static language model corresponding to the type of the current scene from static language models corresponding to different scene types which are constructed according to historical scene information; and calling a universal language model, and recognizing voice of the current user based on a hybrid and acoustic model of the universal language model, the searched static language model and the scene language model corresponding to the current user. The real-time voice recognition method constructs language models in an offline and online combination manner based on various scene information, and thus can effectively improve the accuracy rate of voice recognition.

Description

technical field [0001] The invention relates to speech recognition technology, in particular to a scene-based real-time speech recognition system and method. Background technique [0002] In existing speech recognition, a general language model based on corpus in various fields is usually combined with a corresponding acoustic model to recognize the text corresponding to the speech. Among them, the corpus in the general language model is generally complex, the update speed is slow, and it is not targeted, resulting in low accuracy of speech recognition results. Especially for homophonic or similar speech, the existing speech recognition technology cannot provide accurate recognition results. For example, the collected speech of the user is "xinxinjie", the existing speech recognition technology cannot judge the speech Whether the text corresponding to the voice is "Xinxing Street" or "Xinxin Street" or other similar texts. Contents of the invention [0003] One of the te...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/06G10L15/28
Inventor 雷欣沈李斌
Owner MOBVOI INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products