Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Audio feature extraction method and device based on re-parameterized decoupling mode

An audio feature and extraction method technology, applied in the field of voiceprint feature extraction, can solve the problems of impracticality, unsimple model, and poor performance of multi-branch structure, etc., and achieve the effect of fast speed, good convergence effect, and low memory consumption

Pending Publication Date: 2021-07-23
SPEAKIN TECH CO LTD
View PDF7 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The core of these attempts is to train a deeper network, but there are no good results. The performance is generally not as good as the multi-branch structure, and the obtained models are often neither simple nor practical.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio feature extraction method and device based on re-parameterized decoupling mode
  • Audio feature extraction method and device based on re-parameterized decoupling mode
  • Audio feature extraction method and device based on re-parameterized decoupling mode

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] In order to enable those skilled in the art to better understand the solution of the application, the technical solution in the embodiment of the application will be clearly and completely described below in conjunction with the drawings in the embodiment of the application. Obviously, the described embodiment is only It is a part of the embodiments of this application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0053] figure 1 It is a method flowchart in an embodiment of an audio feature extraction method based on a decoupling method based on reparameterization in the present application, such as figure 1 as shown, figure 1 Including:

[0054] 101. Obtain the speech sample to be tested of the target speaker;

[0055] It should be noted that the present application can acquire the target s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an audio feature extraction method and device based on a re-parameterized decoupling mode. The method comprises the following steps: acquiring a to-be-detected voice sample of a target speaker; preprocessing the to-be-detected voice sample; extracting acoustic features of the preprocessed to-be-detected voice sample; and inputting the acoustic features into a network reasoning module to obtain voiceprint feature vectors, wherein the network reasoning module is a network model of a single-path structure converted by a trained multi-layer network training module through re-parameterization. According to the method, the multi-branch structure is used in the training stage to achieve a better convergence effect, the multi-branch structure is re-parameterized into the single-path structure in the reasoning stage to obtain a better effect than the multi-branch structure with equivalent parameter quantity, the speed is higher, and the memory consumption is lower.

Description

technical field [0001] The present application relates to the technical field of voiceprint feature extraction, in particular to an audio feature extraction method and device based on reparameterized decoupling. Background technique [0002] Existing high-performance network structures include multi-branch structures and network components with excellent performance. Among them, the performance of the multi-branch structure can be greatly improved compared with the previous single-channel structure. Like GoogleNet, Inception, etc., all belong to the multi-channel structure. And network components with excellent performance, including depthwise separable convolution, group convolution, etc., can significantly increase network performance. However, although the multi-branch structure and high-performance components can significantly improve the performance of the model, it will eventually cause the model to slow down and consume memory during inference, which is very unfavor...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/30G10L17/18G10L17/02G06N3/04G06N3/08
CPCG10L25/30G10L17/18G10L17/02G06N3/08G06N3/045
Inventor 许敏强马雨枫赵淼刘敏
Owner SPEAKIN TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products