Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Audio synthesis method and device, electronic equipment and computer readable storage medium

A synthesis method and audio technology, applied in speech synthesis, neural learning methods, speech analysis, etc., can solve the problems of poor sound quality and poor sound quality of synthesized audio.

Pending Publication Date: 2021-01-01
TENCENT MUSIC ENTERTAINMENT TECH SHENZHEN CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the related art, a method based on parameter synthesis is used to synthesize dry sound, which can achieve the effect of accurate pronunciation time and controllable rhythm under the condition of accurate parameters, but the sound quality is generally poor, which leads to poor sound quality of synthesized audio
It can be seen that in the process of realizing the present invention, the inventors have found that there are at least the following problems in the related art: the sound quality of the synthesized audio is relatively poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio synthesis method and device, electronic equipment and computer readable storage medium
  • Audio synthesis method and device, electronic equipment and computer readable storage medium
  • Audio synthesis method and device, electronic equipment and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

preparation example Construction

[0035] see figure 2 , a flowchart of an audio synthesis method provided in an embodiment of the present application, such as figure 2 shown, including:

[0036] S101: Acquire target dry sound audio, and generate phoneme information corresponding to the target dry sound audio;

[0037] The executor of this embodiment is the server in the audio synthesis system provided by the above embodiments, for the purpose of synthesizing dry audio. In this step, the audio collection device collects the target dry sound audio and sends it to the server, and the server generates phoneme information corresponding to the target dry sound audio. The target dry sound audio is the dry sound wave file recorded by the user, and the audio format is WAV (Waveform Audio File Format). It should be noted that due to lossy encoding methods such as MP3, the actual read audio will have a time offset at the beginning or end of the audio due to differences in different decoders. Therefore, in order to e...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an audio synthesis method and device, electronic equipment and a computer readable storage medium, and the method comprises the steps: obtaining a target dry sound audio, and generating phoneme information corresponding to the target dry sound audio; extracting audio features from the target dry sound audio; wherein the audio features comprise any one or a combination of several of fundamental frequency features, energy features and perception linear prediction features; and inputting the target dry sound audio, the phoneme information and the audio features into the trained neural network model to obtain a synthesized dry sound audio. Therefore, according to the audio synthesis method provided by the invention, the audio features are embedded into the synthesis process of the neural network model, so that the efficiency and accuracy of synthesizing the dry sound audio by the neural network model are improved, and the depicting ability of the neural network model on the target dry sound audio can be enhanced; therefore, the trained neural network model can generate the synthetic dry sound audio with better tone quality and richer sound details, and the tonequality of the final synthetic song is improved.

Description

technical field [0001] The present application relates to the technical field of sound synthesis, and more specifically, to an audio synthesis method, device, electronic equipment, and computer-readable storage medium. Background technique [0002] With the development of deep learning technology and audio signal processing technology, artificially synthesized singing voices have gradually become possible. People can use technology to generate dry voices, that is, pure human voices without music. These synthesized dry sounds are accompanied by an accompaniment to obtain a song. [0003] In the related art, a method based on parameter synthesis is used to synthesize dry sound, which can achieve accurate pronunciation time and controllable rhythm under the condition of accurate parameters, but the sound quality is generally poor, which in turn leads to poor sound quality of the synthesized audio. It can be seen that during the process of implementing the present invention, th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L13/04G10L13/047G10L13/08G10L25/30G06N3/08
CPCG10L13/047G10L13/08G10L25/30G06N3/08
Inventor 徐东
Owner TENCENT MUSIC ENTERTAINMENT TECH SHENZHEN CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products