Voice synthesis method and device

A speech synthesis and speech technology, applied in speech synthesis, speech analysis, instruments, etc., to achieve the effects of emotional expressiveness, high sound quality, and accurate acoustic characteristic parameters

Active Publication Date: 2018-09-28
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF5 Cites 41 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Embodiments of the present invention provide a speech synthesis method and device to solve the problem that existing speech synthesis methods cannot provide high-quality, expressive synthesized speech on the premise of meeting real-time requirements

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice synthesis method and device
  • Voice synthesis method and device
  • Voice synthesis method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the following exemplary examples do not represent all implementations consistent with the present invention. Rather, they are merely examples of apparatuses and methods consistent with aspects of the invention as recited in the appended claims.

[0055] The terms "comprising" and "having" and any variations thereof in the description and claims of the present invention are intended to cover a non-exclusive inclusion. For example, a process, method, system, product or device comprising a series of steps or units is not limited to the listed steps or units, but optionally also includes unlisted steps or units, or optionally further includes ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a voice synthesis method and device. The voice synthesis method includes the steps that phoneme characteristics and rhythm and affective characteristics of a to-be-processed text are obtained; according to the phoneme characteristics and the rhythm and affective characteristics, the voice duration of the to-be-processed text is determined through a pre-trained duration model, wherein the duration model is obtained based on convolutional neural network training; according to the phoneme characteristics, the rhythm and affective characteristics and the voice duration, acoustic characteristic parameters of the to-be-processed text are determined through a pre-trained acoustic parameter model, wherein the acoustic parameter model is obtained based on convolutional neural network training; according to the acoustic characteristic parameters, a voice of the to-be-processed text is synthesized. By means of the voice synthesis method in the embodiment,on the premise that the real-time requirement can be met, the synthesized voice which is higher in voice quality, better in emotion expressiveness and more natural and smoother can be provided.

Description

technical field [0001] Embodiments of the present invention relate to the technical field of Text To Speech (TTS for short), and in particular, to a method and apparatus for speech synthesis. Background technique [0002] With the continuous development of multimedia communication technology, speech synthesis technology, which is one of the important ways of human-computer interaction, has attracted extensive attention of researchers because of its convenience and speed. Speech synthesis is a technology that generates artificial speech by mechanical and electronic methods. It is a technology that converts text information generated by a computer or input from external sources into comprehensible and fluent spoken language output. The purpose of speech synthesis is to convert text into speech and play it to the user, and the goal is to achieve the effect of real text broadcasting. [0003] Speech synthesis technology has been widely used, for example, speech synthesis techno...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/10G10L13/08G10L13/04G10L25/30
CPCG10L13/04G10L13/08G10L13/10G10L25/30
Inventor 李昊康永国王振宇
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products