Speech synthesis method and speech synthesis device

A speech synthesis and language technology, applied in speech synthesis, speech analysis, semantic analysis, etc., can solve the problems of affecting speech naturalness and user experience, high data collection cost, and inconsistent synthesized timbre, so as to reduce data cost and realize Difficulty, reducing the difficulty of implementation, and improving the effect of user experience

Active Publication Date: 2016-08-10
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF5 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the current problems are: (1) the method of using data of different native speakers for different languages ​​will cause the problem of inconsistent synthesized timbres, which will affect the naturalness and user experience of speech synthesis; (2) the use of multilingual In the method of speaker data, most of the speakers are not idiomatic in languages ​​other than their native language, and have an accent, which is quite different from that of native speakers, which reduces user experience. Pronunciation is not standard enough, and the speakers who are standard in multiple languages ​​are usually professionals, and the cost of data collection is high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis method and speech synthesis device
  • Speech synthesis method and speech synthesis device
  • Speech synthesis method and speech synthesis device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary and are intended to explain the present invention and should not be construed as limiting the present invention.

[0022] It can be understood that in daily life, multilingual speech synthesis applications have been gradually demanded by people. For example, taking the news application program in a mobile terminal as an example, when the user uses the news application program to listen to news through the function of speech synthesis, the news content , especially technology news, in addition to Chinese, there is also a lot of English, so this application is a typical multilingual speech synthesis. However, the naturalness, accuracy and unifor...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a speech synthesis method. The method comprises the steps that the language type of statement text information to be synthesized is determined, wherein the language type comprises a first language type and a second language type; a first base model corresponding to the first language type and a second base model corresponding to the second language type are determined; a target tone is determined; according to the target tone, self-adaptive transforming is carried out on the first base model and the second base model; according to first and second base models after self-adaptive transforming, the statement text information to be synthesized is trained to generate corresponding spectrum parameters and fundamental frequency parameters; according to the target tone, the fundamental frequency parameters of the first language type and the second language type are adjusted; and according to the spectrum parameter of the first language type, the spectrum parameter of the second language type and the adjusted fundamental frequency parameters of the first language type and the second language type, target speech is synthesized.

Description

technical field [0001] The invention relates to the technical field of speech synthesis, in particular to a speech synthesis method and a speech synthesis device. Background technique [0002] With the development of speech synthesis technology and popularization of applications, speech synthesis services are being accepted and used by more and more users. Among users of speech synthesis services, a large part are bilingual or multilingual users, and speech synthesis is increasingly applied to multilingual content occasions. Therefore, there is a demand for multilingual speech synthesis, among which the mixed reading of Chinese and English is the most common. Users generally require intelligibility for multilingual speech synthesis, followed by accurate, natural and uniform timbre. Under the condition that the current speech synthesis technology has basically solved the intelligibility, how to synthesize natural, accurate and uniform multilingual speech has become a techni...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/047
CPCG10L13/047G10L13/0335G10L13/086G06F40/30G10L13/00G10L13/08
Inventor 李昊康永国
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products