Chinese mandarin character pronunciation conversion method based on self-attention mechanism
Patent Information
- Authority / Receiving Office
- CN · China
- Current Assignee / Owner
- INST OF ACOUSTICS CHINESE ACAD OF SCI
- Publication Date
- 2020-05-12
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The invention relates to the field of speech synthesis, in particular to a Chinese Mandarin word-to-sound conversion method based on a self-attention mechanism. Background technique
[0002] TTS technology is widely used in e-books, voice assistants, car navigation, voice customer service and other products. In Chinese speech synthesis, whether it is a parametric or sequence-to-sequence model, the phoneme-level modeling unit is compact enough to be trained effectively. The role of phonetic conversion is to map Chinese characters to pronunciation.
[0003] At the heart of transliteration is polyphone disambiguation and tone sandhi, and in some cases, pronunciation is determined by semantics. For example, "also" reads "huan2" such as "return" when it means returning, and reads "hai2" such as "still" when it means still. There is also a part of the tone-changing tone environment, such as two consecutive three-tone readings, the former is usually pronou...