Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech recognition processing method and device

A technology of speech recognition and processing methods, applied in speech recognition, speech analysis, instruments, etc., can solve the problem of "h" and "f" not distinguishing, not taking into account the difference of user's Mandarin accent, etc., to ensure the effect of practicability

Active Publication Date: 2016-12-21
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF5 Cites 52 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Under normal circumstances, the user's Mandarin pronunciation may have a certain degree of dialect accent. For example, in the Mandarin pronunciation of users with a Hunan accent, "h" and "f" often appear indistinguishable, while Mandarin speech recognition products The Mandarin acoustic model is for users all over the country, and does not take into account the differences in the accents of users in Mandarin

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech recognition processing method and device
  • Speech recognition processing method and device
  • Speech recognition processing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary and are intended to explain the present invention and should not be construed as limiting the present invention.

[0056] The voice recognition processing method and device according to the embodiments of the present invention will be described below with reference to the accompanying drawings.

[0057] figure 1 is a flowchart of a speech recognition processing method according to an embodiment of the present invention, such as figure 1 As shown, the method includes:

[0058]S110, performing training on a preset processing model according to speech sample data from all regions of the country to generate a common Mandarin acoustic model.

...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech recognition processing method and device. The method comprises: according to speed sample data of all areas of the country, training is carried out on a preset processing model to generate a universal mandarin acoustic model; and on the basis of the speech sample data of all provinces, adaptive training is carried out on the universal mandarin acoustic model respectively to generate mandarin acoustic models with dialectal accents, wherein the mandarin acoustic models corresponding to all provinces. Therefore, on the basis of the accent difference of users at different areas, mandarin acoustic models with dialectal accents are established, so that the speech recognition performance is improved.

Description

technical field [0001] The invention relates to the technical field of voice recognition, in particular to a voice recognition processing method and device. Background technique [0002] The performance of speech recognition is one of the key factors affecting the practicality of speech recognition products. As the main component of speech recognition, the acoustic model plays a key role in the performance of speech recognition. In the training of the acoustic model, how to comprehensively utilize various information to improve the performance and promotion ability of the acoustic model has important theoretical research and practical application value for the speech recognition industry. [0003] Under normal circumstances, the user's Mandarin pronunciation may have a certain degree of dialect accent. For example, in the Mandarin pronunciation of users with a Hunan accent, "h" and "f" often appear indistinguishable, while Mandarin speech recognition products The Mandarin a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/06
CPCG10L15/063G10L15/16G10L15/065G10L15/01G10L15/07G10L2015/0631
Inventor 李先刚蒋兵
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products