Voice recognition model acquisition method and device, electronic equipment and storage medium

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A speech recognition model and acquisition method technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of poor speech recognition effect, no solution proposed, unsatisfactory speech separation effect and recognition accuracy, etc.

Pending Publication Date: 2022-01-04

BEIJING BAIDU NETCOM SCI & TECH CO LTD

View PDF0 Cites 2 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] However, the existing speaker recognition system and speech transcription system are not ideal for the situation where the speakers overlap, the speech separation effect and the recognition accuracy are not ideal, and the number of speakers needs to be set in advance to determine the number of branches of the network. Speech recognition doesn't work well with volume changes

[0004] For the above problems, no effective solution has been proposed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0031] According to an embodiment of the present disclosure, an embodiment of a method for acquiring a speech recognition model is provided. It should be noted that the steps shown in the flow chart of the accompanying drawings can be executed in a computer system such as a set of computer-executable instructions, Also, although a logical order is shown in the flowcharts, in some cases the steps shown or described may be performed in an order different from that shown or described herein.

[0032] figure 1 is a schematic flowchart of the steps of the method for acquiring a speech recognition model according to the first embodiment of the present disclosure, as shown in figure 1 As shown, the method includes the following steps:

[0033]Step S102, obtaining multiple sets of label data, wherein each set of data in the above multiple sets of label data includes: audio sample data of sample objects, and a set of sample objects obtained by performing feature vector extraction proc...

Embodiment 2

[0094] According to an embodiment of the present disclosure, an embodiment of an apparatus for implementing the above speech recognition method is also provided, Figure 4 is a schematic structural diagram of an acquisition device for a speech recognition model according to a second embodiment of the present disclosure, as shown in Figure 4 As shown, the acquisition device of the above-mentioned speech recognition model includes: an acquisition unit 40 and a training unit 42, wherein:

[0095] The acquisition unit 40 is configured to acquire multiple sets of label data, wherein each set of data in the above multiple sets of label data includes: audio sample data of sample objects, and a set of sample objects obtained by performing feature vector extraction processing on the above audio sample data, The above-mentioned audio sample data includes dialogue content of multiple above-mentioned sample objects;

[0096] The training unit 42 is configured to use multiple sets of lab...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a voice recognition model acquisition method and device, electronic equipment and a storage medium, and relates to the fields of natural voice understanding, voice technologies, intelligent customer service and voice transcription. According to the specific implementation scheme, obtaining multiple sets of label data, wherein each set of data in the multiple sets of label data comprises audio sample data of sample objects and a sample object set obtained by conducting feature vector extraction processing on the audio sample data, and the audio sample data comprise dialogue content of the multiple sample objects; and training a neural network model through machine learning by using the multiple groups of label data to obtain a speech recognition model. According to the invention, the technical problem of poor speech recognition effect of a speech recognition model in the prior art is solved.

Description

technical field [0001] The present disclosure relates to the field of artificial intelligence technology, in particular to the fields of natural speech understanding, speech technology, intelligent customer service, and speech transcription, and in particular to a method, device, electronic device, and storage medium for acquiring a speech recognition model. Background technique [0002] Most common speech recognition methods in the related art firstly separate the audio speakers, and then perform voice transcription on the separated audio to obtain the distinguished corresponding speaker's text. [0003] However, the existing speaker recognition system and speech transcription system are not ideal for the situation where the speakers overlap, the speech separation effect and the recognition accuracy are not ideal, and the number of speakers needs to be set in advance to determine the number of branches of the network. Speech recognition does not work well with varying numbe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/06G10L15/16G10L15/26G10L15/18G10L17/00G10L21/0272G10L21/0308

CPCG10L15/063G10L15/16G10L15/26G10L15/1822G10L17/00G10L21/0272G10L21/0308G10L2015/0631G10L2015/0635

Inventor赵情恩

OwnerBEIJING BAIDU NETCOM SCI & TECH CO LTD

Voice recognition model acquisition method and device, electronic equipment and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements:Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology