Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Coding method and device for speech synthesis

A technology of speech synthesis and coding method, which is applied in speech synthesis, speech analysis, instruments, etc., and can solve the problems of noise, long waiting time for synthesis, and occupying a large amount of memory to store pcm.

Pending Publication Date: 2021-09-10
SHENZHEN TONGXINGZHE TECH
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] 1. One-time synthesis: input the text into the speech synthesis engine, obtain the encoded pcm data at one time, and pass the pcm data to the player for playback at one time; this method requires a large amount of memory to store pcm, and the waiting time for synthesis is longer , you need to wait for all the data to be synthesized before starting to play
[0004] 2. Sleep during streaming synthesis: While synthesizing pcm data, stuff it into the player for processing, sleep for a certain period of time during the synthesis process, continue to synthesize and play; the data block size of a single synthesis in this method is fixed, there are cpu fluctuations, sleep time If the sleep time is too short, the cpu may be too high. If the sleep time is too long, the player may stop broadcasting or cause noise.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Coding method and device for speech synthesis
  • Coding method and device for speech synthesis

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] In order to make the purpose, technical solutions and advantages of the present disclosure clearer, the present disclosure will be further described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0047] In order to achieve the above object, an embodiment of the present invention provides a speech synthesis coding method, including:

[0048] Start to synthesize text data into pcm stream data, and store the pcm stream data in a buffer;

[0049] Dynamically calculate the initial buffer threshold Tstart required to start playing according to the current system load;

[0050] If the buffer buffer duration is longer than the initial buffer threshold Tstart, read the pcm stream data in the buffer to play;

[0051] Dynamically calculate the continuous buffer threshold Tblock required for continuous playback according to the current system load;

[0052] If the buffer buffer duration is longer than the continuous buf...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

One or more embodiments of the invention provide a coding method and device for speech synthesis. According to the coding method, after text data is synthesized into pcm stream data, the initial buffer threshold T(start) required for starting playing is dynamically calculated according to a current system load condition, and if the buffer duration of a buffer area is greater than the initial buffer threshold T(start), the pcm stream data in the buffer area is read and played; a continuous buffer threshold T(block) required by continuous playing is dynamically calculated according to the current system load condition; and in the process of playing, whether the text data continues to be synthesized into the pcm stream data or synthesis is paused is judged according to a relation between the buffering duration of the buffer area and the continuous buffer threshold T(block), so the stability and smoothness of playing are guaranteed, and smooth occupation of the cpu and the memory is also achieved.

Description

technical field [0001] The present invention relates to the technical field of speech synthesis methods, in particular to a speech synthesis encoding method and device. Background technique [0002] The current speech synthesis (text-to-speech) encoding playback schemes are: [0003] 1. One-time synthesis: input the text into the speech synthesis engine, obtain the encoded pcm data at one time, and pass the pcm data to the player for playback at one time; this method requires a large amount of memory to store pcm, and the waiting time for synthesis is longer , you need to wait until all the data is synthesized before starting to play. [0004] 2. Sleep during streaming synthesis: While synthesizing pcm data, stuff it into the player for processing, sleep for a certain period of time during the synthesis process, continue to synthesize and play; the data block size of a single synthesis in this method is fixed, there are cpu fluctuations, sleep time If the sleep time is too...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L13/02G10L13/04
CPCG10L13/02G10L13/04
Inventor 皮碧虹杨德文龙丁奋
Owner SHENZHEN TONGXINGZHE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products