Unlock instant, AI-driven research and patent intelligence for your innovation.

Voice conversion method and device, electronic equipment and storage medium

A voice conversion and voice technology, applied in voice analysis, voice recognition, instruments, etc., can solve problems such as the gap between the effect tone and the target tone, and achieve the effect of improving the effect.

Pending Publication Date: 2022-04-12
成都爱奇艺智能创新科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The present application provides a voice conversion method, device, electronic equipment, and storage medium to solve the problem in the related art that when performing voice conversion on the voice to be converted, there is a gap between the effect timbre of the voice to be converted and the target timbre

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice conversion method and device, electronic equipment and storage medium
  • Voice conversion method and device, electronic equipment and storage medium
  • Voice conversion method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] In order to make the purposes, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments It is a part of the embodiments of this application, but not all of them. All other embodiments obtained by persons of ordinary skill in the art based on the embodiments in the present application without creative efforts shall fall within the protection scope of the present application.

[0026] figure 1 A schematic flow chart of a speech conversion method provided by the embodiment of the present application, such as figure 1 As shown, the voice conversion method includes:

[0027] S101. Perform content extraction on the speech to be converted to obtain target features;

[0028] It should be understood that the voice...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a voice conversion method and device, electronic equipment and a storage medium, and the method comprises the steps: carrying out the content extraction of a to-be-converted voice, and obtaining a target feature; encoding the target feature to convert the target feature into a target vector; target rhythm parameters corresponding to the to-be-converted voice are obtained, the target vector is decoded on a target tone according to the target rhythm parameters, the Mel-Per features of the target tone are obtained, and the target rhythm parameters do not contain the tone of the to-be-converted voice; by removing the timbre of the to-be-converted voice contained in the target rhythm parameter, the problem that the timbre of the original speaker is leaked into the converted voice due to the fact that the feature of the to-be-converted voice is directly used for generating the Mel-Per feature, and thus the effect timbre after voice conversion is different from the timbre of the target speaker is solved; therefore, the to-be-converted voice is closer to the target tone after being converted, and the voice conversion effect is improved.

Description

technical field [0001] The present application relates to the technical field of voice processing, and in particular to a voice conversion method, device, electronic equipment and storage medium. Background technique [0002] With the continuous development of deep learning technology, neural network-based voice conversion (Voice Conversion, VC) technology is also becoming more and more mature. Speech conversion refers to changing the acoustic feature parameters related to the personality characteristics of the source speaker to make it sound like the voice of the target speaker without changing the semantics, that is, to remove the timbre of the original speaker. . In the related technology, in the voice conversion technology, the mel spectrum feature is directly generated according to the prosody parameters of the voice to be converted, so as to improve the expressiveness of the model. Therefore, there is a big defect in the related technology. , the obtained speech cont...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/24G10L19/16G10L15/26G10L15/06G10L25/87
Inventor 闫影甘文东文博龙李海陈海涛
Owner 成都爱奇艺智能创新科技有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More