Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Speech synthesis method and device, computer equipment, storage medium and product

A speech synthesis and speech technology, applied in the field of communication, can solve the problems of strong mechanical sense of speech, poor speech synthesis effect, and unnatural speech transition.

Pending Publication Date: 2022-04-12
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Among them, the waveform splicing method needs to collect a large amount of audio for each speech type to cover all pronunciation units, and due to the splicing, the synthesized speech transition is unnatural, and the speech synthesis effect is poor. The statistical parameter method does not need to collect a large amount of audio audio, but due to the way of mapping, the synthesized voice has a strong mechanical sense and the synthesis effect is poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech synthesis method and device, computer equipment, storage medium and product
  • Speech synthesis method and device, computer equipment, storage medium and product
  • Speech synthesis method and device, computer equipment, storage medium and product

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0061] The following will clearly and completely describe the technical solutions in the embodiments of the application with reference to the drawings in the embodiments of the application. Apparently, the described embodiments are only some of the embodiments of the application, not all of them. Based on the embodiments in this application, all other embodiments obtained by those skilled in the art without making creative efforts belong to the scope of protection of this application.

[0062] Embodiments of the present application provide a speech synthesis method, device, computer equipment, and computer-readable storage medium. The speech synthesis device can be integrated in computer equipment, and the computer equipment can be a server or a terminal or other equipment.

[0063] Wherein, the terminal may include a mobile phone, a wearable smart device, a tablet computer, a notebook computer, a personal computer (PC, Personal Computer), and a vehicle-mounted computer.

[0...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a speech synthesis method and device, computer equipment, a storage medium and a product, and the method comprises the steps: obtaining a text of a to-be-synthesized speech, and determining the type of the to-be-synthesized speech; performing fusion processing on the reference audio feature information corresponding to the voice type and a text unit in the text to obtain text voice feature information; determining a target duration prediction network according to the voice type; predicting audio duration information corresponding to the text unit according to the target duration prediction network and the text voice feature information; performing duration matching processing on the text voice feature information according to the audio duration information to obtain matched text voice feature information; and performing speech synthesis processing according to the matched text speech feature information to obtain target speech. According to the scheme, accurate text speech feature information can be extracted, and the corresponding duration prediction network is adopted according to the speech type, so that the synthesized target speech keeps the tone, rhythm and other information of the speech type, and the speech synthesis effect is improved.

Description

technical field [0001] The present application relates to the technical field of communication, and in particular to a speech synthesis method, device, computer equipment, storage medium and product. Background technique [0002] Speech synthesis technology converts text into corresponding audio content through certain rules or model algorithms, also known as Text to Speech (TTS), and its function is to convert text information generated by the computer itself or externally input Understandable, fluent voice and read it out. The traditional speech synthesis technology is mainly based on the waveform splicing method or the statistical parameter method. The splicing method needs to collect the waveforms corresponding to all pronunciation units in advance, and obtain the corresponding voice through waveform splicing. The statistical parameter method needs to model the spectrum characteristic parameters of the existing audio first. , to construct a mapping relationship between ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/02G10L13/047G10L13/10
Inventor 林诗伦蒙力苏文超李新辉卢鲤
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products