A fusion method and device for voiceprint features

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A voiceprint feature and fusion method technology, which is applied in speech analysis, machine learning, instruments, etc., can solve the problem of not fully considering the complementarity and differentiation between features and fusion features, so as to improve user experience, improve pass rate, and improve extraction. Effect

Active Publication Date: 2021-05-18

SOUNDAI TECH CO LTD

View PDF4 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the above-mentioned simple method of calculating the mean value of features or similarity scores to realize the fusion of voiceprint features does not fully consider the complementarity between features and the discrimination of fusion features.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0049] The present disclosure provides a voiceprint feature fusion method, which solves the problem that the existing voiceprint feature fusion method by calculating the average value of voiceprint features or similarity scores is too simple, and the obtained new features are not sufficiently distinguishable for speakers. The problem.

[0050] In order to make the objectives, technical solutions and advantages of the present disclosure clearer, the present disclosure will be further described in detail below with reference to the specific embodiments and the accompanying drawings.

[0051] Certain embodiments of the present disclosure will be described more fully hereinafter with reference to the accompanying drawings, some but not all embodiments of which are shown. Indeed, various embodiments of the present disclosure may be embodied in many different forms and should not be construed as limited to the embodiments set forth herein; rather, these embodiments are provided so t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The disclosure provides a voiceprint feature fusion method, including: extracting voice spectrum features, using the voice spectrum features as input, using a general background model and a global difference space matrix to extract i-vector voiceprint features; using deep neural network Network, extracting x-vector voiceprint features and d-vector voiceprint features; using the i-vector voiceprint features, the x-vector voiceprint features and the d-vector voiceprint features as samples, based on linear discrimination The fusion of the voiceprint features is completed through analysis. By introducing a method based on linear discriminant analysis to fuse multiple voiceprint features, the complementarity of multiple voiceprint features and the discrimination of fusion features are improved, which can ensure that the target speaker can pass through voiceprint authentication. rate, reduce the misrecognition rate of non-target speakers, and improve personalized user experience.

Description

technical field [0001] The present disclosure relates to the field of speech recognition, and in particular, to a method and device for fusing voiceprint features. Background technique [0002] At present, with the popularization of information technology, automatic speech recognition technology is playing an increasingly important role, and its application prospects are also broader. The speech signal mainly contains three aspects of information: who is speaking, what language is spoken, and what is said content. The automatic speech recognition technologies involved are: speaker recognition, language recognition and semantic recognition. Speaker recognition technology, also known as voiceprint recognition, mainly studies the technology of authenticating the identity of the speaker according to the input voice signal. Like other recognition technologies, speaker recognition recognizes the input speaker audio through certain features, so as to confirm the identity of the in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L17/02G10L17/04G10L17/14G10L17/18G10L17/22

CPCG10L17/18G10L17/02G10L17/04G10L25/78G06N3/04G06N20/00G06F17/16G10L17/06G10L25/18G10L25/24

Inventor冯大航陈孝良苏少炜常乐

OwnerSOUNDAI TECH CO LTD

A fusion method and device for voiceprint features

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology