Unlock instant, AI-driven research and patent intelligence for your innovation.

Voice conversion method and device, electronic equipment and storage medium

A speech conversion and speech technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of poor speech conversion effect, easy loss of target speech details and content information, etc., and achieve the effect of improving the effect.

Pending Publication Date: 2022-04-12
成都爱奇艺智能创新科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The present application provides a voice conversion method, device, electronic equipment and storage medium to solve the problem in the related art that the target voice details and content information are easily lost during voice conversion, resulting in poor voice conversion effect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice conversion method and device, electronic equipment and storage medium
  • Voice conversion method and device, electronic equipment and storage medium
  • Voice conversion method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] In order to make the purposes, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments It is a part of the embodiments of this application, but not all of them. Based on the embodiments in the present application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present application.

[0025] figure 1 It is a schematic flow chart of a speech conversion method provided by the embodiment of the present application. Such as figure 1 As shown, the voice conversion method includes:

[0026] S101. Extracting the prosody of the target speech to obtain target prosody parameters;

[0027] It should be understood that the t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a voice conversion method and device, electronic equipment and a storage medium, and the method comprises the steps: extracting the rhythm of a target voice, and obtaining a target rhythm parameter, the target voice being a voice needing voice conversion; inputting the target voice into an end-to-end voice recognition encoder to obtain a content vector output after the end-to-end voice recognition encoder processes the target voice; and performing voice conversion on the target voice according to the target rhythm parameter and the content vector, and because the time resolution of the content vector is consistent with the time resolution of the target voice, making the phoneme feature represented by the content vector consistent with the phoneme feature contained in the target voice, so that the content vector can completely reflect the details of the target voice, and the accuracy of voice conversion is improved. The voice conversion effect is improved, the phoneme features included in the content vector are not classified, the problem that phoneme classification is inaccurate and voice conversion is affected is avoided, and the voice conversion effect is further improved.

Description

technical field [0001] The present application relates to the field of voice conversion, in particular to a voice conversion method, device, electronic equipment and storage medium. Background technique [0002] With the continuous development of deep learning technology, neural network-based voice conversion (Voice Conversion, VC) technology is also becoming more and more mature. Speech conversion refers to changing the acoustic feature parameters related to the speaker's personality characteristics, so that the timbre of the target voice sounds like the target speaker's timbre, while the semantics do not change; at present, voice conversion technology is easy to lose the target Voice detail content information, resulting in poor voice conversion effect. Contents of the invention [0003] The present application provides a voice conversion method, device, electronic equipment and storage medium to solve the problem in the related art that the detailed content information...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/24G10L25/63G10L19/16G10L25/30G10L15/16
Inventor 文博龙甘文东陈海涛闫影李海
Owner 成都爱奇艺智能创新科技有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More