Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A fusion method and device for voiceprint features

A voiceprint feature and fusion method technology, which is applied in speech analysis, machine learning, instruments, etc., can solve the problem of not fully considering the complementarity and differentiation between features and fusion features, so as to improve user experience, improve pass rate, and improve extraction. Effect

Active Publication Date: 2021-05-18
SOUNDAI TECH CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the above-mentioned simple method of calculating the mean value of features or similarity scores to realize the fusion of voiceprint features does not fully consider the complementarity between features and the discrimination of fusion features.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A fusion method and device for voiceprint features
  • A fusion method and device for voiceprint features
  • A fusion method and device for voiceprint features

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] The present disclosure provides a voiceprint feature fusion method, which solves the problem that the existing voiceprint feature fusion method by calculating the average value of voiceprint features or similarity scores is too simple, and the obtained new features are not sufficiently distinguishable for speakers. The problem.

[0050] In order to make the objectives, technical solutions and advantages of the present disclosure clearer, the present disclosure will be further described in detail below with reference to the specific embodiments and the accompanying drawings.

[0051] Certain embodiments of the present disclosure will be described more fully hereinafter with reference to the accompanying drawings, some but not all embodiments of which are shown. Indeed, various embodiments of the present disclosure may be embodied in many different forms and should not be construed as limited to the embodiments set forth herein; rather, these embodiments are provided so t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The disclosure provides a voiceprint feature fusion method, including: extracting voice spectrum features, using the voice spectrum features as input, using a general background model and a global difference space matrix to extract i-vector voiceprint features; using deep neural network Network, extracting x-vector voiceprint features and d-vector voiceprint features; using the i-vector voiceprint features, the x-vector voiceprint features and the d-vector voiceprint features as samples, based on linear discrimination The fusion of the voiceprint features is completed through analysis. By introducing a method based on linear discriminant analysis to fuse multiple voiceprint features, the complementarity of multiple voiceprint features and the discrimination of fusion features are improved, which can ensure that the target speaker can pass through voiceprint authentication. rate, reduce the misrecognition rate of non-target speakers, and improve personalized user experience.

Description

technical field [0001] The present disclosure relates to the field of speech recognition, and in particular, to a method and device for fusing voiceprint features. Background technique [0002] At present, with the popularization of information technology, automatic speech recognition technology is playing an increasingly important role, and its application prospects are also broader. The speech signal mainly contains three aspects of information: who is speaking, what language is spoken, and what is said content. The automatic speech recognition technologies involved are: speaker recognition, language recognition and semantic recognition. Speaker recognition technology, also known as voiceprint recognition, mainly studies the technology of authenticating the identity of the speaker according to the input voice signal. Like other recognition technologies, speaker recognition recognizes the input speaker audio through certain features, so as to confirm the identity of the in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L17/02G10L17/04G10L17/14G10L17/18G10L17/22
CPCG10L17/18G10L17/02G10L17/04G10L25/78G06N3/04G06N20/00G06F17/16G10L17/06G10L25/18G10L25/24
Inventor 冯大航陈孝良苏少炜常乐
Owner SOUNDAI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products