Unlock instant, AI-driven research and patent intelligence for your innovation.

An end-to-end speaker confirmation method, device and storage medium

A speaker confirmation and level technology, applied in speech analysis, speech recognition, instruments, etc., can solve problems such as inaccurate processing results and poor recognition results

Active Publication Date: 2021-05-18
GUILIN UNIV OF ELECTRONIC TECH
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Speaker identification includes speaker confirmation technology and speaker identification technology. "Speaker confirmation" refers to judging whether a passage is spoken by someone, which is a "one-to-one" problem. "Speaker identification" refers to Selecting an audio that is most similar to an unknown audio sample from the known samples is a "multiple choice" problem; however, in the current "speaker confirmation" technology, the extracted speech frame-level features are usually averaged for processing, and the Some non-important frames in speech features are processed together, resulting in inaccurate processing results and poor recognition results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An end-to-end speaker confirmation method, device and storage medium
  • An end-to-end speaker confirmation method, device and storage medium
  • An end-to-end speaker confirmation method, device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The principles and features of the present invention are described below in conjunction with the accompanying drawings, and the examples given are only used to explain the present invention, and are not intended to limit the scope of the present invention.

[0028] figure 1 A method flowchart of an end-to-end speaker confirmation method provided by an embodiment of the present invention;

[0029] figure 2 A method flowchart of an end-to-end speaker confirmation method provided by an embodiment of the present invention;

[0030] Such as Figure 1-2 As shown, an end-to-end speaker confirmation method includes the following steps:

[0031] Build a speaker to confirm the end-to-end network, and the speaker confirms that the end-to-end network includes the ResCNN residual convolutional neural network model of the front end and the threshold reweighted attention model of the back end;

[0032] The speaker confirmation end-to-end network is trained, including:

[0033] A...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides an end-to-end speaker confirmation method, device and storage medium. The method includes constructing a speaker confirmation end-to-end network, and the speaker confirmation end-to-end network includes a front-end ResCNN residual convolutional neural network model And at the back end, the ResCNN residual convolutional neural network model is used to extract speech frame-level features, and the threshold reweighted attention model converts speech frame-level features into sentence-level features, thereby completing the end-to-end network for speaker confirmation Training; the speakers obtained through training confirm the end-to-end network to determine the registrant of the test voice; the present invention realizes end-to-end processing, and the threshold reweighted attention model extracts key speech frame level features by assigning weights, Screen out non-key speech frame-level features, and then perform weighted average processing to amplify key speech frame-level features, transforming frame-level features into sentence-level features, which greatly improves speech recognition.

Description

technical field [0001] The present invention mainly relates to the technical processing field of voiceprint recognition, in particular to an end-to-end speaker confirmation method, device and storage medium. Background technique [0002] Voiceprint recognition, also known as speaker recognition, is a biometric technology, which is the process of extracting, analyzing and extracting the speaker's personality characteristics from a piece of speech, and automatically determining the speaker. Speaker identification includes speaker confirmation technology and speaker identification technology. "Speaker confirmation" refers to judging whether a passage is spoken by someone, which is a "one-to-one" problem. "Speaker identification" refers to Selecting an audio that is most similar to an unknown audio sample from the known samples is a "multiple choice" problem; however, in the current "speaker confirmation" technology, the extracted speech frame-level features are usually averaged...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L17/00G10L17/04G10L17/18G10L15/22
CPCG10L15/22G10L17/00G10L17/04G10L17/18
Inventor 蔡晓东李波
Owner GUILIN UNIV OF ELECTRONIC TECH