An end-to-end speaker confirmation method, device and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speaker confirmation and level technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as inaccurate processing results and poor recognition results

Active Publication Date: 2021-05-18

GUILIN UNIV OF ELECTRONIC TECH

View PDF5 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Speaker identification includes speaker confirmation technology and speaker identification technology. "Speaker confirmation" refers to judging whether a passage is spoken by someone, which is a "one-to-one" problem. "Speaker identification" refers to Selecting an audio that is most similar to an unknown audio sample from the known samples is a "multiple choice" problem; however, in the current "speaker confirmation" technology, the extracted speech frame-level features are usually averaged for processing, and the Some non-important frames in speech features are processed together, resulting in inaccurate processing results and poor recognition results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0027] The principles and features of the present invention are described below in conjunction with the accompanying drawings, and the examples given are only used to explain the present invention, and are not intended to limit the scope of the present invention.

[0028] figure 1 A method flowchart of an end-to-end speaker confirmation method provided by an embodiment of the present invention;

[0029] figure 2 A method flowchart of an end-to-end speaker confirmation method provided by an embodiment of the present invention;

[0030] Such as Figure 1-2 As shown, an end-to-end speaker confirmation method includes the following steps:

[0031] Build a speaker to confirm the end-to-end network, and the speaker confirms that the end-to-end network includes the ResCNN residual convolutional neural network model of the front end and the threshold reweighted attention model of the back end;

[0032] The speaker confirmation end-to-end network is trained, including:

[0033] A...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The present invention provides an end-to-end speaker confirmation method, device and storage medium. The method includes constructing a speaker confirmation end-to-end network, and the speaker confirmation end-to-end network includes a front-end ResCNN residual convolutional neural network model And at the back end, the ResCNN residual convolutional neural network model is used to extract speech frame-level features, and the threshold reweighted attention model converts speech frame-level features into sentence-level features, thereby completing the end-to-end network for speaker confirmation Training; the speakers obtained through training confirm the end-to-end network to determine the registrant of the test voice; the present invention realizes end-to-end processing, and the threshold reweighted attention model extracts key speech frame level features by assigning weights, Screen out non-key speech frame-level features, and then perform weighted average processing to amplify key speech frame-level features, transforming frame-level features into sentence-level features, which greatly improves speech recognition.

Description

technical field [0001] The present invention mainly relates to the technical processing field of voiceprint recognition, in particular to an end-to-end speaker confirmation method, device and storage medium. Background technique [0002] Voiceprint recognition, also known as speaker recognition, is a biometric technology, which is the process of extracting, analyzing and extracting the speaker's personality characteristics from a piece of speech, and automatically determining the speaker. Speaker identification includes speaker confirmation technology and speaker identification technology. "Speaker confirmation" refers to judging whether a passage is spoken by someone, which is a "one-to-one" problem. "Speaker identification" refers to Selecting an audio that is most similar to an unknown audio sample from the known samples is a "multiple choice" problem; however, in the current "speaker confirmation" technology, the extracted speech frame-level features are usually averaged...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L17/00G10L17/04G10L17/18G10L15/22

CPCG10L15/22G10L17/00G10L17/04G10L17/18

Inventor 蔡晓东李波

Owner GUILIN UNIV OF ELECTRONIC TECH

An end-to-end speaker confirmation method, device and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology