Speech recognition method and device based on Chinese and English mixed dictionary

A speech recognition, Chinese and English technology, applied in speech recognition, speech analysis, natural language data processing, etc., can solve the problem of low accuracy of speech recognition

Active Publication Date: 2017-09-22
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF6 Cites 26 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] For this reason, the first object of the present invention is to propose a kind of speech recognition method based on Chinese-English mixed dictionary, for solving the low problem of speech recognition accuracy in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech recognition method and device based on Chinese and English mixed dictionary
  • Speech recognition method and device based on Chinese and English mixed dictionary
  • Speech recognition method and device based on Chinese and English mixed dictionary

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary and are intended to explain the present invention and should not be construed as limiting the present invention.

[0042] The speech recognition method and device based on the Chinese-English mixed dictionary according to the embodiment of the present invention will be described below with reference to the accompanying drawings.

[0043] figure 1 It is a schematic flowchart of a speech recognition method based on a Chinese-English mixed dictionary provided by an embodiment of the present invention. Such as figure 1 Shown, this speech recognition method based on Chinese-English mixed dictionary comprises the following steps:

[0044] S101. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a speech recognition method and a speech recognition device based on a Chinese and English mixed dictionary. The speech recognition method comprises the steps of: acquiring the Chinese and English mixed dictionary marked by the International Phonetic Alphabet IPA, wherein the Chinese and English mixed dictionary comprises a Chinese dictionary and an English dictionary corrected by means of the Chinese dictionary; training the model by regarding the Chinese and English mixed dictionary as a training dictionary, regarding a layer of convolutional neural network CNN plus five layers of Long Short-Term Memory (LSTM) network as a model, regarding status of the IPA as a target and regarding a connectionist time classifier CTC as a training criterion, so as to obtain a trained CTC acoustic model; and combining with the trained CTC acoustic model for performing speech recognition on a Chinese and English mixed language. According to the speech recognition method and the speech recognition device, the Chinese and English mixed dictionary comprising the Chinese dictionary and the English dictionary corrected by means of the Chinese dictionary is adopted for training, the English word coverage is comprehensive and Chinglish can be recognized, and the accuracy degree of Chinese and English mixed language recognition is further improved by combining the application of the CTC acoustic model.

Description

technical field [0001] The invention relates to the technical field of voice recognition, in particular to a voice recognition method and device based on a Chinese-English mixed dictionary. Background technique [0002] Currently, with the globalization of life, the phenomenon of using mixed languages ​​to communicate has become a common phenomenon. Statistically, there are more people who speak multiple languages ​​than monolingual speakers. Acoustics between mixed languages ​​and complexities between languages ​​pose challenges for speech recognition. Therefore, the study of mixed language acoustic models is an important research direction. [0003] Hybrid speech recognition technology refers to the use of Chinese and English mixed dictionaries to train the mixed language acoustic model to obtain a speech recognition model. At present, the method of obtaining a Chinese-English mixed dictionary is to obtain a Chinese dictionary including a phoneme set marked with consona...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/00G10L15/06G10L15/16
CPCG10L15/005G10L15/063G10L15/16G10L15/193G10L15/187G10L2015/0635G10L2015/025G06F40/242G10L15/22G10L15/02
Inventor 李先刚张雪薇
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products