Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Singing voice synthesis method and device

A synthesis method and technology of singing voices, applied in speech synthesis, voice analysis, instruments, etc., can solve problems such as differences in singing voices

Active Publication Date: 2021-07-02
GUANGZHOU KUGOU COMP TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The embodiment of the present invention provides a singing voice synthesis method and device, which can solve the problem that the user's singing voice synthesized by the related technology is quite different from the user's voice

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to make the object, technical solution and advantages of the present invention clearer, the implementation manner of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0042] figure 1 It is a flow chart of a singing voice synthesis method provided by an embodiment of the present invention. see figure 1 , the method includes:

[0043] 101. When the user's voice is acquired, extract the fundamental frequency, envelope and consonant information of each word in the user's voice.

[0044] 102. Adjust the fundamental frequency of each word in the user's voice according to the pitch frequency of each word in the song, and the pitch frequency of each word in the song is the frequency corresponding to the pitch of each word in the song .

[0045] 103. Synthesize the adjusted fundamental frequency, the envelope of each word in the user's voice, and the consonant information to obtain synthesized audio.

...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a singing voice synthesis method and device, belonging to the technical field of voice synthesis. The method includes: when the user's voice is obtained, extracting the fundamental frequency, envelope and consonant information of each word in the user's voice; adjust the pitch frequency of each word in the song, the pitch frequency of each word in the song is the frequency corresponding to the pitch of each word in the song; the adjusted base frequency, the pitch frequency of each word in the user voice The envelope and consonant information are synthesized to obtain synthesized audio; according to the duration of each word in the song, the duration of each word in the synthesized audio is adjusted to obtain a synthesized user singing voice. The present invention uses the user's original envelope and auxiliary information to synthesize the user's singing voice, which can retain the user's original timbre, and the synthesized user's singing voice is closer to the user's voice.

Description

technical field [0001] The invention relates to the technical field of speech synthesis, in particular to a singing voice synthesis method and device. Background technique [0002] With the development of speech synthesis technology, speech synthesis technology is gradually applied in people's daily life. For example, some users who sing with pentaphonic insufficiency wish to read out the lyrics and then generate their own singing voice, which can be realized by using speech synthesis technology. [0003] At present, the related technology generally firstly recognizes the voice spoken by the user, correspondingly finds the inherent singing voice in the speech synthesis database, then extracts the timbre of the singing voice, and then uses a pre-established conversion model to convert the timbre of the singing voice into the user's timbre , to obtain the synthesized user singing voice. Wherein, the filter model is used to convert the timbre of the inherent singing voice in t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L13/02G10L13/033
CPCG10L13/02G10L13/033
Inventor 劳振锋
Owner GUANGZHOU KUGOU COMP TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products