Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Singing speech synthesis method and synthesis device, and computer storage medium

A technology of speech synthesis and speech data, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of singing effect discount, model over-smoothing effect, limited statistical parameter modeling ability, etc., to ensure accuracy and naturalness. Effect

Pending Publication Date: 2021-05-07
IFLYTEK CO LTD
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, some widely popular singing speech synthesis methods still use context-dependent hidden Markov models, but these models face serious over-smoothing effects, and the ability to model statistical parameters is limited, so the generated singing speech is not as good as the timbre. Compared with the naturalness, the real singing effect is greatly reduced

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Singing speech synthesis method and synthesis device, and computer storage medium
  • Singing speech synthesis method and synthesis device, and computer storage medium
  • Singing speech synthesis method and synthesis device, and computer storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] The following will clearly and completely describe the technical solutions in the embodiments of the present application with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are only some of the embodiments of the present application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0018] First of all, it needs to be explained that the singing speech synthesis method of the present application is executed by a singing speech synthesis device, which can be any device with information processing capabilities such as a mobile phone, a computer, a smart watch, etc. When the user inputs the score information After the singing speech synthesis device, the singing speech synthesis device outputs the corresponding singing voice data, that is, for...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a singing voice synthesis method and a synthesis device, and a computer storage medium. The synthesis method comprises the following steps: acquiring music score information; performing feature extraction on the music score information to obtain music score features; performing fundamental frequency feature prediction on the music score features to obtain fundamental frequency features; performing acoustic feature prediction on the music score features in combination with the fundamental frequency features to obtain acoustic features; and obtaining synthesized singing voice data according to the acoustic characteristics. According to the synthesis method provided by the invention, the accuracy and naturalness of singing speech synthesis can be improved.

Description

technical field [0001] The present application relates to the technical field of speech synthesis, in particular to a singing speech synthesis method, a synthesis device, and a computer storage medium. Background technique [0002] Text To Speech (TTS) is a technology that converts text into speech. In recent years, due to the development of tools such as deep learning, great progress has been made and it has been widely used, which has led to Singing Voice Synthesis (SVS) ) has received more attention and has gradually become one of the important functions of virtual idols, voice assistants and many smart devices. At the same time, singing speech synthesis can be easily combined with other artificial intelligence technologies, such as machine composition, automatic lyrics, etc., forming a broad application space. With the advancement of multimodal technology, artificial intelligence singers are becoming more and more popular with the public. [0003] Research on computer-...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/08G10H1/00
CPCG10L13/08G10H1/0033G10H2240/121G10H2250/471
Inventor 殷锋胡亚军
Owner IFLYTEK CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products