Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech synthesis method, device and system, and storage medium

A speech synthesis and target speech technology, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as poor user experience, inability to adapt to scene changes, and affecting speech synthesis presentation effects

Pending Publication Date: 2019-04-12
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF10 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the existing speech synthesis technology cannot adapt to the change of the scene, which affects the final presentation effect of speech synthesis, and the user experience is not good

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis method, device and system, and storage medium
  • Speech synthesis method, device and system, and storage medium
  • Speech synthesis method, device and system, and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0067] In order to make the objectives, technical solutions, and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be described clearly and completely in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of the embodiments of the present invention, not all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.

[0068] The terms "first", "second", "third", "fourth", etc. (if any) in the description and claims of the present invention and the above-mentioned drawings are used to distinguish similar objects, without having to use To describe a specific order or sequence. It should be understood that the data used in this way can be interchange...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a speech synthesis method, device and system, and a storage medium. The method comprises the following steps: determining current scene information; acquiring all candidate speakers in consistent with the current scene information; ranking the candidate speakers according to a preset rule to obtain a candidate speaker list; determining a target speaker according to the candidate speaker list; converting text information into a target speech according to the voice of the target speaker. Therefore, the speaker in consistent with the scene is automatically selected according to the received text and scene attribute, so that the synthesized speech can be transformed for the most suitable speaker according to different scenes, the finally synthesized speech is real, the speech synthesis effect is improved, and the user experience is excellent.

Description

Technical field [0001] The present invention relates to the technical field of speech processing, in particular to a speech synthesis method, device, system and storage medium. Background technique [0002] Speech synthesis (Text to Speech) is one of the important technologies and applications in the field of artificial intelligence speech. It is the process of converting text input by users or products into speech. The machine imitates the way humans "speak" and outputs anthropomorphic sounds. Mainly used in audio reading, human-machine dialogue, smart speakers, smart customer service and other scenarios, it is one of the main ways for humans and machines to interact naturally. [0003] At present, the existing speech synthesis is a process in which a user (or product) inputs text for text-to-speech. The input text is synthesized by a pre-selected speaker, and the speaker's timbre style is the only reference method for the speaker to select. In terms of realization, with the expa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/08G10L13/02
CPCG10L13/02G10L13/08
Inventor 杨杰
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products