Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech synthesis method and speech synthesis device

A speech synthesis and language technology, applied in speech synthesis, speech analysis, semantic analysis, etc., can solve the problems of affecting the naturalness of speech and user experience, high cost of data collection, and inconsistency of synthesized timbre, so as to reduce data cost and realize Difficulty, reducing the difficulty of implementation, and improving the effect of user experience

Active Publication Date: 2019-05-03
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF5 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the current problems are: (1) the method of using data of different native speakers for different languages ​​will cause the problem of inconsistent synthesized timbres, which will affect the naturalness and user experience of speech synthesis; (2) the use of multilingual In the method of speaker data, most of the speakers are not idiomatic in languages ​​other than their native language, and have an accent, which is quite different from that of native speakers, which reduces user experience. Pronunciation is not standard enough, and the speakers who are standard in multiple languages ​​are usually professionals, and the cost of data collection is high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis method and speech synthesis device
  • Speech synthesis method and speech synthesis device
  • Speech synthesis method and speech synthesis device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary and are intended to explain the present invention and should not be construed as limiting the present invention.

[0022] It can be understood that in daily life, multilingual speech synthesis applications have been gradually demanded by people. For example, taking the news application program in a mobile terminal as an example, when the user uses the news application program to listen to news through the function of speech synthesis, the news content , especially technology news, in addition to Chinese, there is also a lot of English, so this application is a typical multilingual speech synthesis. However, the naturalness, accuracy and unifor...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech synthesis method and device. The method includes: determining the language type to which the sentence text information to be synthesized belongs, wherein the language type includes a first language type and a second language type; determining the first basic model corresponding to the first language type, and determining the corresponding language type of the second language type The second basic model: determine the target timbre, and perform adaptive transformation on the first basic model and the second basic model respectively according to the target timbre, and treat the synthetic sentence text information according to the first basic model and the second basic model after the adaptive transformation Perform training to generate corresponding spectral parameters and fundamental frequency parameters; adjust the fundamental frequency parameters of the first language type and the second language type according to the target timbre; according to the spectral parameters of the first language type and the spectral parameters of the second language type , the adjusted fundamental frequency parameters of the first language type, and the adjusted fundamental frequency parameters of the second language type to synthesize the target speech.

Description

technical field [0001] The invention relates to the technical field of speech synthesis, in particular to a speech synthesis method and a speech synthesis device. Background technique [0002] With the development of speech synthesis technology and popularization of applications, speech synthesis services are being accepted and used by more and more users. Among users of speech synthesis services, a large part are bilingual or multilingual users, and speech synthesis is increasingly applied to multilingual content occasions. Therefore, there is a demand for multilingual speech synthesis, among which the mixed reading of Chinese and English is the most common. Users generally require intelligibility for multilingual speech synthesis, followed by accurate, natural and uniform timbre. Under the condition that the current speech synthesis technology has basically solved the intelligibility, how to synthesize natural, accurate and uniform multilingual speech has become a techni...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L13/047
CPCG10L13/047G10L13/0335G10L13/086G06F40/30G10L13/00G10L13/08
Inventor 李昊康永国
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products