Supercharge Your Innovation With Domain-Expert AI Agents!

Method, device and system for solving pulse signal generated at splicing position in speech synthesis

A technology of speech synthesis and pulse signal, applied in speech synthesis, speech analysis, instruments, etc., can solve problems such as easy noise generation, and achieve the effect of avoiding noise phenomenon and smoothing speech

Pending Publication Date: 2021-03-26
BEIJING UNISOUND INFORMATION TECH +1
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] One or more embodiments of this specification describe a method, device and system for solving the pulse signal generated at the splicing point in the speech synthesis, which can solve the problem in the current technology that the pulse signal is often generated at the splicing point of two speeches during the speech synthesis process. , prone to noise problems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, device and system for solving pulse signal generated at splicing position in speech synthesis
  • Method, device and system for solving pulse signal generated at splicing position in speech synthesis
  • Method, device and system for solving pulse signal generated at splicing position in speech synthesis

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain related inventions, rather than to limit the invention. It should also be noted that, for the convenience of description, only the parts related to the related invention are shown in the drawings.

[0040] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present application will be described in detail below with reference to the accompanying drawings and embodiments.

[0041] figure 1 It shows a flow chart of a method for solving the pulse signal generated at the splicing part in speech synthesis in an embodiment, and the execution subject of the method can be any device, device, platform, or device cluster with computing and processing ca...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method, a device and a system for solving a pulse signal generated at a splicing position in speech synthesis, and the method comprises the following steps: extracting two speech segments to be spliced from a database as a first speech segment and a second speech segment respectively, and reserving N sampling points from the first speech segment and the second speech segment as overlapping parts, wherein N is greater than 256; calculating a gradual-out coefficient vector and a gradual-in coefficient vector according to the sampling points; utilizing the gradual-out coefficient vector and the gradual-in coefficient vector to obtain a sampling point value of an overlapped part; and completing splicing of the first speech segment and the second speech segment based onthe obtained sampling point values of the overlapped parts. According to the method, the original reason of the pulse signal generated at the splicing position during speech synthesis is not concerned, the speech smoothing effect is achieved through weighted average of the sampling point values of the front segment and the rear segment, the negative influence of the pulse signal on the overall rhythm, tone quality and listening feeling of speech is greatly improved, and the noise phenomenon can be reduced or avoided.

Description

technical field [0001] One or more embodiments of the present invention relate to the technical field of natural language processing, and in particular to a method, device and system for solving pulse signals generated at splicing points in speech synthesis. Background technique [0002] This section is intended to provide a background or context for implementations of the invention that are recited in the claims. The descriptions herein may include concepts that could be explored, but not necessarily concepts that have been previously thought of or explored. Therefore, unless otherwise indicated herein, what is described in this section is not prior art to the description and claims in this application and is not admitted to be prior art by inclusion in this section. [0003] With the rapid development of intelligent voice technology, voice interaction has become a necessary solution for human-computer interaction in many smart devices. For example, more and more enterpris...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/02G10L13/033
CPCG10L13/02G10L13/033
Inventor 高洋
Owner BEIJING UNISOUND INFORMATION TECH
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More