Unlock instant, AI-driven research and patent intelligence for your innovation.

Chinese and English mixed speech synthesis method and device

A speech synthesis, Chinese-English technology, applied in speech synthesis, speech analysis, instruments, etc., can solve the problem of difficulty in speech synthesis of Chinese-English mixed text, and achieve the effect of high speech synthesis quality

Active Publication Date: 2020-12-29
SICHUAN CHANGHONG ELECTRIC CO LTD
View PDF8 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The present invention provides a Chinese-English mixed speech synthesis method and device, which are used to solve the problem of difficulty in speech synthesis of Chinese-English mixed text in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese and English mixed speech synthesis method and device
  • Chinese and English mixed speech synthesis method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0041] see figure 1 , a Chinese-English mixed speech synthesis method, including a training phase and an inference phase, including the following steps in the training phase:

[0042] S11. Obtain multi-person Chinese and English speech training data, and extract speech acoustic features to obtain a training data set;

[0043] Optionally, the English speech synthesis data set can use LJSpeech, VCTK and other public data sets, and the Chinese speech synthesis data set can use the female voice database of Biaobei Company and the self-recorded voice database covering the voices of more than 20 people.

[0044] Understandably, Chinese and English speech training data include: Chinese speech data and corresponding Chinese text, English speech data and corresponding English text, Chinese-English mixed speech data and corresponding Chinese-English mixed text; extracted speech acoustic features Including but not limited to Mel spectral features.

[0045] S12. Standardize the English ...

Embodiment 2

[0068] A Chinese-English mixed speech synthesis device, comprising:

[0069] The text processing module is used to normalize the Chinese and English texts and convert them into a unified pinyin phoneme expression;

[0070] Optionally, the text processing module processes the mixed text in Chinese and English differently, standardizes the English text, and eliminates illegal characters; unifies the English text into ASCII code; unifies the English characters into lowercase letters; expands the English abbreviation; uses The CMU pronunciation dictionary converts each English word into a CMU pronunciation phoneme. If the word is not in the CMU dictionary key value, the sentence text and the corresponding voice are removed from the training data; a mapping dictionary between the CMU pronunciation phoneme and the Pinyin phoneme is created; through The mapping dictionary converts CMU pronunciation phonemes into Pinyin phonemes; standardizes Chinese text, screens out illegal characte...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of speech synthesis, aims to solve the speech synthesis problem of Chinese and English mixed texts, and provides a Chinese and English mixed speech synthesis method and device. The method comprises a training stage and a reasoning stage, English words are converted into CMU pronunciation phonemes, and then the CMU pronunciation phonemes are converted into pinyin phonemes; Chinese and English are unified into a representation mode of pinyin phonemes, language marks representing different languages are introduced to distinguish pronunciation characteristics of Chinese and English, and speaker recognition vectors are introduced to distinguish acoustic characteristics of different speakers, so that Chinese and English mixed speech synthesis becomespossible, and high speech synthesis quality is achieved. On the basis of a traditional speech synthesis method, the application scene of speech synthesis in Chinese and English mixing is expanded.

Description

technical field [0001] The invention relates to the technical field of speech processing, in particular to a Chinese-English mixed speech synthesis method and device. Background technique [0002] Speech synthesis is a technology for converting text information into speech information, that is, converting text information into arbitrary audible speech. It involves many disciplines such as acoustics, linguistics, and computer science. However, speech synthesis in different languages ​​is different in various aspects, such as differences in front-end processing, differences in pronunciation characteristics, differences in representation methods, etc. The existing method of synthesizing mixed-language texts is that one anchor speaks multiple languages ​​at the same time Synthesis after collection, which makes speech synthesis of mixed language texts more difficult, and over-reliance on anchors who can speak multiple languages ​​at the same time. Contents of the invention ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/02G10L13/08G10L19/16
CPCG10L13/02G10L13/08G10L19/16
Inventor 王昆朱海周琳珉刘书君展华益
Owner SICHUAN CHANGHONG ELECTRIC CO LTD