Training method and system of singing sound synthesis model and singing sound synthesis method
A training method and singing technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problems of low training and synthesis efficiency, lack of synthesis, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0056] refer to figure 1 , shows a flow chart of the steps of the training method of the singing voice synthesis model in Embodiment 1 of the present invention. It can be understood that the flowchart in this method embodiment is not used to limit the sequence of execution steps. details as follows.
[0057] Step S100, acquiring multiple singing voice data of multiple songs, and building a training database based on the multiple singing voice data and multiple music scores corresponding to the multiple songs.
[0058] The singing voice data is recorded audio data, and generally speaking, the singing voice data includes the singing voice of a designated person (professional singer) and the voice of an accompanying instrument. But, when there is no accompaniment instrument, then described singing voice data is the singing voice sent by the appointed person.
[0059] Exemplarily, the singing voice of a designated person (professional singer) can be recorded through a recording...
Embodiment 2
[0092] read on figure 2, shows a schematic diagram of the program modules of Embodiment 2 of the training system for the singing voice synthesis model of the present invention. In this embodiment, the training system 20 of the singing synthesis model may include or be divided into one or more program modules, one or more program modules are stored in a storage medium, and are executed by one or more processors, To complete the present invention, and can realize the training method of above-mentioned singing voice synthesis model. The program module referred to in the embodiment of the present invention refers to a series of computer program instruction segments capable of completing specific functions, which is more suitable than the program itself to describe the execution process of the training system 20 of the singing voice synthesis model in the storage medium. The following description will specifically introduce the functions of each program module of the present embo...
Embodiment 3
[0102] refer to image 3 , is a schematic diagram of the hardware architecture of the computer device according to Embodiment 3 of the present invention. In this embodiment, the computer device 2 is a device capable of automatically performing numerical calculation and / or information processing according to preset or stored instructions. The computer device 2 may be a rack server, a blade server, a tower server or a cabinet server (including an independent server, or a server cluster composed of multiple servers) and the like. As shown in the figure, the computer device 2 at least includes, but is not limited to, a memory 21, a processor 22, a network interface 23, and a training system 20 for singing voice synthesis models that can communicate with each other through a system bus. in:
[0103] In this embodiment, the memory 21 includes at least one type of computer-readable storage medium, and the readable storage medium includes flash memory, hard disk, multimedia card, ca...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


