Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method, device, computer equipment and storage medium for establishing voiceprint model

A model and voiceprint technology, applied in the computer field, can solve the problem of high error rate of recognition and achieve the effect of reducing the error rate of voice recognition

Active Publication Date: 2020-06-05
PING AN TECH (SHENZHEN) CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The current voiceprint recognition method works well for long voice texts (the length of the speaker's voice is more than 1 minute), but for short voice texts (the length of the speaker's voice is less than 1 minute, such as about 20s). The error rate is still relatively high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, device, computer equipment and storage medium for establishing voiceprint model
  • Method, device, computer equipment and storage medium for establishing voiceprint model
  • Method, device, computer equipment and storage medium for establishing voiceprint model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0058] It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0059] refer to figure 1 , the embodiment of the present application provides a method for establishing a voiceprint model, including steps:

[0060] S1. Framing the voice signal of the input target user, and respectively extracting the voice acoustic features of the voice signal after the frame division;

[0061] S2. Input a plurality of said speech acoustic features into a deep learning model based on neural network training, and assemble into at least one cluster structure;

[0062] S3. Calculate the mean value and standard deviation of at least one cluster structure;

[0063] S4. Perform coordinate transformation and activation function calculation on the mean value and standard deviation to obtain eigenvector parameters;

[0064]S5. Input the feature vector parameters and the identity veri...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application discloses a method, device, computer equipment and storage medium for establishing a voiceprint model, wherein the method includes: dividing the input target user's voice signal into frames, and extracting the voice acoustic features of the framed voice signals respectively; A plurality of said speech acoustic features are input into a deep learning model based on neural network training, and are assembled into at least one cluster structure; calculating the mean value and standard deviation of at least one said cluster structure; performing coordinate transformation on said mean value and standard deviation and calculating the activation function to obtain feature vector parameters; inputting the feature vector parameters and the identity verification result of the target user into a preset basic model to obtain a voiceprint model corresponding to the target user. The speech acoustic features extracted in this application are based on the cluster structure obtained in the deep neural network training, and then the cluster structure is subjected to coordinate mapping and activation function calculation to obtain a voiceprint model, which can reduce the voice recognition error rate of the voiceprint model.

Description

technical field [0001] The present application relates to the field of computer technology, in particular to a method, device, computer equipment and storage medium for establishing a voiceprint model. Background technique [0002] A voiceprint is a sound wave spectrum that carries speech information displayed by an electroacoustic instrument. Modern scientific research shows that voiceprint is not only specific, but also relatively stable. After adulthood, the human voice can remain relatively stable for a long time. The voiceprint recognition algorithm extracts various speech features from the sound spectrum and establishes a recognition model to confirm the speaker. The current voiceprint recognition method works well for long voice texts (the length of the speaker's voice is more than 1 minute), but for short voice texts (the length of the speaker's voice is less than 1 minute, such as about 20s). The error rate is still relatively high. [0003] Therefore, how to es...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L17/04G10L17/02G10L17/18
CPCG10L17/02G10L17/04G10L17/18G10L17/20G06N3/08G06F17/18G10L15/02G10L15/16G06F18/23G06N3/048
Inventor 蔡元哲王健宗程宁肖京
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products