Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice splicing method and voice splicing device

A voice splicing and voice technology, which is applied in voice analysis, voice synthesis, instruments, etc., can solve the problem of not being able to guarantee the naturalness of the rhythm at the splicing place, and achieve the effect of improving naturalness and fluency, and fine prosody representation

Pending Publication Date: 2022-07-29
MIDEA GRP (SHANGHAI) CO LTD +1
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For different clauses, traditional splicing methods only simply process the voice files corresponding to each clause and then splice each voice file, such as waveform-based splicing and statistical parameter-based modeling splicing, etc. These methods require preset voice Splicing the cell library, or using complex algorithms to smooth the joints of the waveforms, cannot guarantee the naturalness of the rhythm at the joints

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice splicing method and voice splicing device
  • Voice splicing method and voice splicing device
  • Voice splicing method and voice splicing device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0048] The embodiments of the present application will be described in further detail below with reference to the accompanying drawings and examples. The following examples are used to illustrate the application, but not to limit the scope of the application.

[0049]In the description of this specification, description with reference to the terms "one embodiment," "some embodiments," "example," "specific example," or "some examples", etc., mean specific features described in connection with the embodiment or example , structures, materials, or features are included in at least one example or example of the embodiments of the present application. In this specification, schematic representations of the above terms are not necessarily directed to the same embodiment or example. Furthermore, the particular features, structures, materials or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, those skilled in ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the field of speech synthesis, and provides a speech splicing method and device, and the method comprises the steps: segmenting a rhythm phoneme sequence of a target text, generating a plurality of clause sequences, and enabling the rhythm phoneme sequence to comprise a plurality of phonemes corresponding to the target text and rhythm identifiers located between the adjacent phonemes; voice synthesis is carried out on the clause sequences, multiple pieces of first clause voice information are generated, and the first clause voice information comprises each rhythm identifier and a first duration corresponding to a phoneme; and based on the first duration and the segmentation sequence of the clause sequence corresponding to the first clause voice information in the rhythm phoneme sequence, splicing the plurality of pieces of first clause voice information, and generating target voice. According to the voice splicing method provided by the invention, the naturalness and fluency of the splicing part of the adjacent first clause voice information can be improved on the basis of not needing to preset a voice splicing unit library and not needing to carry out smooth processing on the voice units to be spliced.

Description

technical field [0001] The present application relates to the technical field of speech synthesis, and in particular, to a speech splicing method and a speech splicing device. Background technique [0002] Text to Speech (TTS) technology is widely used in the field of speech synthesis. For different clauses, the traditional splicing method only performs simple processing on the speech files corresponding to each clause, and then splices each speech file, such as waveform splicing and statistical parameter modeling splicing, etc. These methods require preset speech Splicing the unit library, or using complex algorithms to smooth the waveform connection, cannot guarantee the naturalness of the rhythm at the splicing. SUMMARY OF THE INVENTION [0003] The present application aims to solve at least one of the technical problems existing in the prior art. To this end, the present application proposes a speech splicing method. [0004] The present application also proposes a ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/02G10L13/04G10L13/08G10L13/10
CPCG10L13/02G10L13/04G10L13/08G10L13/10
Inventor 高羽刘雪铃
Owner MIDEA GRP (SHANGHAI) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products