Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Song training data processing method and device and computer readable storage medium

A technology for training data and processing methods, applied in computer components, computing, speech analysis, etc., can solve problems such as multi-manpower, uneven pitch distribution, etc.

Pending Publication Date: 2019-05-31
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In addition, among all musical factors, pitch is one of the main reasons affecting singing quality, so making the corpus evenly and comprehensively summarizing each pitch is a major point when training data. In the prior art, many training samples need to be recorded. In order to get a comprehensive summary of the corpus of each pitch, it not only requires a lot of manpower, material resources and time, but also the pitch distribution is uneven

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Song training data processing method and device and computer readable storage medium
  • Song training data processing method and device and computer readable storage medium
  • Song training data processing method and device and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0043] The invention provides a method for processing song training data. refer to figure 1 As shown, it is a schematic flowchart of a method for processing song training data provided by an embodiment of the present invention. The method may be performed by a device, and the device may be implemented by software and / or hardware.

[0044] In this embodiment, the song training data processing method includes:

[0045] S10. Acquire initial sample data, where the initial sample data includes a musical score of each song and a cappella recording corresponding to the musical score of each song.

[0046] Preferably, the initial sample data includes songs in various vocal ranges. The pitch range refers to the range from the lowest to the highest pitch that a human voice or musical instrument can achieve. The characteris...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of voice semantics, and discloses a song training data processing method. The song training data processing method comprises the steps that initial sampledata are acquired, the initial sample data comprise the music score of each song and the singing record corresponding to the music score of each song, the initial sample data are marked, training dataare obtained, and an acoustic feature output model is obtained through training based on the training data. Afterwards, the method processes the target training data, takes the linguistic feature andthe music feature of each song in the target training data as input data of the trained acoustic feature output model, outputs the acoustic feature of each song, and performs pitch transfer on each song according to the acoustic feature of each song and the music feature of each song. The invention further provides a song training data processing device and a computer readable storage medium. According to the method and the device, the number of training samples is increased under the condition that extra corpora is not recorded.

Description

technical field [0001] The invention relates to the technical field of speech semantics, in particular to a song training data processing method, device and computer-readable storage medium. Background technique [0002] The concept of singing synthesis has attracted people's attention since it was introduced. Its ultimate goal is to allow machines to sing songs of various melodies with a natural degree comparable to that of a real singer. Parametric synthesis is one of the mainstream technologies of singing synthesis. Its technical core is to let the model learn how to convert the language features of lyrics and music features of music scores into the acoustic features of singing through training models. Therefore, model training is a crucial step in parameter synthesis technology, and the performance of the trained model depends on the quality of the training corpus. If some contextual factors rarely or never appear in the training corpus, the model will not be able to le...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62G06N3/04G10L15/06G10L25/30
Inventor 朱清影程宁王健宗
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products