Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voiceprint recognition system and method based on multi-scale multi-level model

A voiceprint recognition, multi-level technology, applied in speech analysis, instruments, etc., can solve the problems of not being able to consider the distinction of speaker identity in combination, and it is difficult to solve the problem of system performance degradation, so as to achieve enhanced model effect and good robustness. Effect

Pending Publication Date: 2022-03-22
四川启睿克科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] 1. Only the weight of each frame can be considered fixedly, but the discrimination of a continuous speech signal sub-interval as a whole for the speaker's identity cannot be considered in combination;
[0006] 2. It is difficult to solve the system performance degradation caused by the large difference between the length of the speech signal of the training corpus and the length of the test speech signal;

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voiceprint recognition system and method based on multi-scale multi-level model
  • Voiceprint recognition system and method based on multi-scale multi-level model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0039] Such as figure 1 As shown, a voiceprint recognition method based on a multi-scale multi-level model, including:

[0040] Step 1. Obtain voice data with speaker annotation; the data includes audio and annotation, and the specific annotation content is the identity of the speaker.

[0041] After acquiring voice data, the original data can be directly used for subsequent operations, or the original data can be augmented before use. Augmentation methods include adding noise and reverberation to the original data, or splicing, truncating, and inverting the data, etc. operate. The robustness of the system can be improve...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of voiceprint recognition, and provides a voiceprint recognition method based on a multi-scale and multi-level model in order to improve the accuracy of voiceprint recognition, comprising the following steps: step 1, acquiring voice data with speaker labels; step 2, dividing the voice data into feature segments according to different scales, wherein each division scale corresponds to a hierarchy; step 3, constructing and training a multi-scale multi-level model corresponding to the data division mode; and 4, inputting to-be-recognized voice data into the multi-scale multi-level model obtained by training in the step 3 for voiceprint recognition. The invention discloses a voiceprint recognition system based on a multi-scale and multi-level model. The voiceprint recognition system comprises a data acquisition unit, a data division unit, a model construction unit, a model training unit and a voiceprint recognition unit. By adopting the mode, the accuracy of the voiceprint recognition model is improved.

Description

technical field [0001] The invention relates to the technical field of voiceprint recognition, in particular to a voiceprint recognition system and method based on a multi-scale and multi-level model. Background technique [0002] With the rapid development of artificial intelligence technology, more and more products incorporating artificial intelligence technology appear in people's daily life. Among them, voiceprint information, as an important biometric feature, is one of the effective ways for user identity verification. The mining and recognition of voiceprint information has also achieved good development and wide application in recent years, especially in the field of security and smart device products. [0003] However, the identity information contained in a piece of voice data is not evenly distributed on the voice signal, that is, different positions of the same voice signal show different distinctions for speakers. Therefore, the method of giving equal importa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L17/00G10L17/04G10L25/24
CPCG10L17/00G10L17/04G10L25/24
Inventor 汪欣谢川展华益
Owner 四川启睿克科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products