Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech synthesis method and device based on low-resource language, equipment and storage medium

A speech synthesis and language phoneme technology, applied in the field of artificial intelligence, can solve the problems of large capital and time consumption, low resource language effect, no phonetic symbols, etc., to improve stability and accuracy, improve speech synthesis effect, and shorten training the effect of time

Pending Publication Date: 2021-07-23
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] There are more than 6,000 languages ​​in the world, and traditional speech synthesis methods cover only dozens of languages. The reason behind this cannot be separated from the difficulty of data set development. The development of a new language data set usually requires the hiring of professional voice actors for the language. The language customizes a large amount of high-quality voice data, and then uses these high-quality voice data as a training data set to train the model. However, for many low-resource languages, such as dialects and other minority languages, there is usually no international phonetic alphabet or even the phonetic alphabet of the language. At the same time, the process of hiring linguistic experts to customize and analyze technical data such as the pronunciation of the language and phonemes and tones needs to consume a lot of money and time. Therefore, the training data set is very small
However, the effect of directly using the traditional speech synthesis model for speech synthesis of low-resource languages ​​is poor. Currently, there is no speech synthesis method specifically for low-resource languages.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis method and device based on low-resource language, equipment and storage medium
  • Speech synthesis method and device based on low-resource language, equipment and storage medium
  • Speech synthesis method and device based on low-resource language, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0048] An embodiment of the present application provides a speech synthesis method based on a low-resource language. The executor of the low-resource language-based speech synthesis method includes, but is not limited to, at least one of electronic devices such as a server and a terminal that can be configured to execute the method provided by the embodiment of the present application. In other words, the speech synthesis method based on a low-resource language can be executed by software or hardware installed on a terminal device or a server device, and the software can be a blockchain platform. The server includes, but is not limited to: a single server, a server cluster, a cloud server or a cloud server cluster, and the like.

[0049] refer to figure 1 As shown, it is a schematic flow chart of a speech synthesis ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to an artificial intelligence technology, and discloses a speech synthesis method based on a low-resource language, which comprises the following steps: determining a low-resource language and a high-resource language, and obtaining a text corresponding to the low-resource language to obtain a low-resource language text; converting the low-resource language text into a low-resource language phoneme text; translating the low-resource language phoneme text into a high-resource language phoneme text by using a translation model trained based on dual learning; and performing speech synthesis on the high-resource language phoneme text by using a pre-trained speech synthesis model to obtain language speech. In addition, the invention also relates to a block chain technology, and the low-resource language text can be stored in a node of a block chain. The invention further provides a speech synthesis device based on the low-resource language, electronic equipment and a computer readable storage medium. The invention can provide a speech synthesis method for improving the speech synthesis effect and aiming at low-resource languages.

Description

technical field [0001] The present invention relates to the technical field of artificial intelligence, in particular to a speech synthesis method, device, electronic equipment and computer-readable storage medium based on a low-resource language. Background technique [0002] Text-to-speech (TTS) has developed rapidly in recent years and has attracted extensive attention from academia and industry. The demand for speech synthesis in all walks of life is getting higher and higher. For example, the customer service industry can use speech synthesis to complete self-service voice services. [0003] There are more than 6,000 languages ​​in the world, and traditional speech synthesis methods cover only dozens of languages. The reason behind this cannot be separated from the difficulty of data set development. The development of a new language data set usually requires the hiring of professional voice actors for the language. The language customizes a large amount of high-qualit...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/08G06F40/58G06N20/00
CPCG10L13/08G06F40/58G06N20/00
Inventor 孙奥兰王健宗程宁
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products