Audio processing method, vocoder, device, equipment and storage medium

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
An audio processing and audio technology, applied in the field of audio and video processing, can solve the problems of slow audio synthesis processing speed and lower audio processing efficiency, and achieve the effects of improving efficiency, increasing processing speed, and reducing the number of cycles

Pending Publication Date: 2021-10-22

TENCENT TECH (SHENZHEN) CO LTD

View PDF0 Cites 4 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the current vocoder usually needs to perform multiple cycles based on multiple sampling time points in the audio feature signal to complete speech prediction, and then complete speech synthesis, which leads to slower processing speed of audio synthesis and reduces the efficiency of audio processing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0056] In order to make the present application, the present application will be further described in detail below with reference to the accompanying drawings, and the embodiments described will not be considered in the limitation of the present invention. All other embodiments obtained under the premise of creative labor belong to the scope of this application.

[0057] In the following description, "some embodiments" describe the subset of all possible embodiments, but it can be understood that "some embodiments" may be the same subset or different subset of all possible embodiments, and It can be combined with each other without conflict.

[0058] In the following description, the term "first \ second \ third" involved is only a different object, does not represent specific sorting for the object, can understand, "first \ second \ third" The specific order or prior order can be interchanged in the case of allowing, so that the present application embodiments described herein ca...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides an audio processing method, a vocoder, a device, equipment and a storage medium. The method comprises the following steps: performing speech feature conversion on a to-be-processed text to obtain at least one acoustic feature frame; extracting condition features from each acoustic feature frame through a frame rate network; performing frequency band division and time domain downsampling on the current frame to obtain n subframes containing a preset number of sampling points; synchronously performing sampling value prediction on the current m adjacent sampling points corresponding to the n sub-frames in the ith round of prediction process through the sampling prediction network to obtain m * n sub-prediction values, and further obtaining n sub-prediction values corresponding to each sampling point in the preset number of sampling points; obtaining an audio prediction signal corresponding to the current frame according to the n sub-prediction values corresponding to each sampling point; and performing audio synthesis on each acoustic feature frame of the at least one acoustic feature frame to obtain a target audio. According to the invention, the audio processing speed and efficiency can be improved.

Description

Technical field [0001] The present application relates to audio and video processing techniques, and more particularly to an audio processing method, a vocoder, a device, a device, and a storage medium. Background technique [0002] With the rapid development of smart equipment (such as smartphone, smart speakers, etc.), speech interaction technology is a growing application as a natural interaction. As an important part of speech interaction technology, speech synthesis technology has also achieved great progress. Voice synthesis technology converts text into a corresponding audio content through a certain rule or model algorithm. Traditional speech synthesis is based primarily based on splicing methods or statistical parameters. With the continuous breakthrough in the field of speech recognition, deep learning is gradually introduced into the field of speech synthesis. Thanks to this, the neural network-based vocoder (Neural Vocoder) has made great progress. However, the curren...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L13/04G10L13/02G10L13/08

CPCG10L13/02G10L13/08G10L13/047

Inventor林诗伦李新辉卢鲤

OwnerTENCENT TECH (SHENZHEN) CO LTD

Audio processing method, vocoder, device, equipment and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology