Modeling method, device and equipment for voice recognition

A technology of speech recognition and modeling method, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of high workload and maintenance cost, cumbersome operation, etc., to ensure recognition accuracy, simplify user operations, and reduce maintenance costs. Effect

Active Publication Date: 2019-07-19
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF15 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In related technologies, the Mandarin recognition model is usually used for speech recognition of Mandarin, and the corresponding dialect recognition model is used for speech recognition of dialects. However, when users switch languages, they need to select the corresponding speech recognition model back and forth, which is cumbersome to operate.
Moreover, with more and more dialects to be supported, the workload and maintenance costs are higher

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Modeling method, device and equipment for voice recognition
  • Modeling method, device and equipment for voice recognition
  • Modeling method, device and equipment for voice recognition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary and are intended to explain the present invention and should not be construed as limiting the present invention.

[0033] The speech recognition modeling method, device and equipment of the embodiments of the present invention will be described below with reference to the accompanying drawings.

[0034] figure 1 A schematic flow chart of a speech recognition modeling method provided by an embodiment of the present invention, such as figure 1 As shown, the method includes:

[0035] Step 101, process the first speech data of Mandarin and the first speech data of P dialects respectively according to the pre-trained alignment model, obtain the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a modeling method, device and equipment for voice recognition, wherein the method includes determining N-type labels; training a neural network according to second voice data ofMandarin to generate a recognition model with N-type label output; inputting the second voice data of P-type dialects into the recognition model respectively for processing to acquire an output labelof the second voice data of each frame of the dialects; according to the output label and a real labeled label, determining the error rate of the N-type labels for each dialect in the P-type dialects, and newly generating M-type target labels according to labels with the error rate larger than a preset threshold value; and training an acoustic model according to third voice data of Mandarin and the third voice data of P-type dialects, wherein the output of the acoustic model is the N-type labels and the M-type target labels corresponding to each dialect in the P-type dialects. Therefore, themixed modeling of Mandarin and dialects is realized, while the accuracy of recognition is ensured, the same model can support both Mandarin and multiple dialects.

Description

technical field [0001] The invention relates to the technical field of speech recognition, in particular to a speech recognition modeling method, device and equipment. Background technique [0002] With the development of speech recognition technology, the performance of speech recognition has been practical. For example, various input methods on mobile phones have voice interaction functions. In practical applications, in addition to the speech recognition of the Mandarin scene, there is also the speech recognition of the dialect scene. At present, there are many voice interaction products that support dialect voice recognition, such as voice recognition options on mobile phone input methods, users can choose the corresponding dialect according to their needs, and some smart TVs and smart refrigerators customized for specific dialects. [0003] In related technologies, a Mandarin recognition model is usually used for speech recognition of Mandarin, and a corresponding dial...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/06G10L15/14G10L15/18G10L25/18G10L25/24G10L25/30
CPCG10L15/063G10L15/144G10L15/18G10L25/18G10L25/24G10L25/30G10L15/16G10L15/142G10L15/02
Inventor 袁胜龙
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products