Corpus optimization method and device

An optimization method and corpus technology, applied in speech analysis, speech recognition, speech synthesis, etc., can solve the problems of high cost and complicated process, and achieve the effect of low cost, high pronunciation quality and accurate pronunciation

Active Publication Date: 2020-05-26
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In the above corpus optimization process, it is necessary to improve the synthesis algorithm, which is complicated and costly

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Corpus optimization method and device
  • Corpus optimization method and device
  • Corpus optimization method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0056] At present, in the speech synthesis method based on waveform splicing, a corpus is established in advance. In the process of speech synthesis, the text to be synthesized is analyzed to obtain prosodic information, etc., the prosody information is analyzed, candidate sound segments are selected from the corpus, and the candidate sound segmen...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Embodiments of the invention provide a method and a device for corpus optimization. For flaw voice in a first audio, a first sound segment corresponding to the flaw voice is determined from a corpus,and then the first sound segment in the corpus is marked as an illegal sound segment. In a subsequent process of synthesizing audio, candidate sound segments are selected from the legal sound segmentsin the corpus, and then the optimal sound segment is selected from the candidate sound segments, and waveform concatenation is performed on the optimal sound segment, to obtain a synthesized audio. In the process, optimization of the corpus is realized by marking the sound fragments in the corpus, codes are not needed to be corrected, and a optimization process is simple and is low in cost.

Description

technical field [0001] Embodiments of the present invention relate to the technical field of intelligent information processing, and in particular to a method and device for optimizing a corpus. Background technique [0002] Speech synthesis, also known as text to speech (text to speech) technology, is a technology used to convert text information into audible sound information. The speech synthesis method based on waveform splicing is the mainstream speech synthesis method at present. [0003] In the speech synthesis method based on waveform splicing, according to the linguistic features and acoustic parameters, for each text segment of the text to be synthesized, the synthesis algorithm is used to select candidate sound segments from the pre-recorded and marked corpus, and then according to the synthesis algorithm. The target cost and link cost of the candidate sound clips are used to select the optimal sound clip, and then the volume or speech rate of the optimal sound c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L13/02G10L15/14G06F16/68
CPCG10L13/02G10L15/14
Inventor 祝晓林盖于涛周志平
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products