Speech recognition processing method and device

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of speech recognition and processing methods, applied in speech recognition, speech analysis, instruments, etc., can solve the problem of "h" and "f" not distinguishing, not taking into account the difference of user's Mandarin accent, etc., to ensure the effect of practicability

Active Publication Date: 2016-12-21

BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

View PDF5 Cites 52 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] Under normal circumstances, the user's Mandarin pronunciation may have a certain degree of dialect accent. For example, in the Mandarin pronunciation of users with a Hunan accent, "h" and "f" often appear indistinguishable, while Mandarin speech recognition products The Mandarin acoustic model is for users all over the country, and does not take into account the differences in the accents of users in Mandarin

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0055] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary and are intended to explain the present invention and should not be construed as limiting the present invention.

[0056] The voice recognition processing method and device according to the embodiments of the present invention will be described below with reference to the accompanying drawings.

[0057] figure 1 is a flowchart of a speech recognition processing method according to an embodiment of the present invention, such as figure 1 As shown, the method includes:

[0058]S110, performing training on a preset processing model according to speech sample data from all regions of the country to generate a common Mandarin acoustic model.

...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a speech recognition processing method and device. The method comprises: according to speed sample data of all areas of the country, training is carried out on a preset processing model to generate a universal mandarin acoustic model; and on the basis of the speech sample data of all provinces, adaptive training is carried out on the universal mandarin acoustic model respectively to generate mandarin acoustic models with dialectal accents, wherein the mandarin acoustic models corresponding to all provinces. Therefore, on the basis of the accent difference of users at different areas, mandarin acoustic models with dialectal accents are established, so that the speech recognition performance is improved.

Description

technical field [0001] The invention relates to the technical field of voice recognition, in particular to a voice recognition processing method and device. Background technique [0002] The performance of speech recognition is one of the key factors affecting the practicality of speech recognition products. As the main component of speech recognition, the acoustic model plays a key role in the performance of speech recognition. In the training of the acoustic model, how to comprehensively utilize various information to improve the performance and promotion ability of the acoustic model has important theoretical research and practical application value for the speech recognition industry. [0003] Under normal circumstances, the user's Mandarin pronunciation may have a certain degree of dialect accent. For example, in the Mandarin pronunciation of users with a Hunan accent, "h" and "f" often appear indistinguishable, while Mandarin speech recognition products The Mandarin a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/06

CPCG10L15/063G10L15/16G10L15/065G10L15/01G10L15/07G10L2015/0631

Inventor李先刚蒋兵

OwnerBAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

Speech recognition processing method and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology