Supercharge Your Innovation With Domain-Expert AI Agents!

Word acoustic feature system and training method and system of word acoustic feature system

A feature system and acoustic feature technology, applied in speech analysis, speech synthesis, speech recognition, etc., can solve the problems of poor synthesis quality, ignoring word pronunciation, only focusing on word meaning, etc., to improve quality and accurate acoustic features of words. Effect

Active Publication Date: 2021-07-13
AISPEECH CO LTD
View PDF3 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] In order to at least solve the problem that the existing models in the existing methods only focus on the meaning of the word and ignore the pronunciation of the word, making the feature vector less effective in improving the quality of speech synthesis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Word acoustic feature system and training method and system of word acoustic feature system
  • Word acoustic feature system and training method and system of word acoustic feature system
  • Word acoustic feature system and training method and system of word acoustic feature system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0046] Such as figure 1 Shown is a schematic structural diagram of a word acoustic feature system provided by an embodiment of the present invention, and the system can be configured in a terminal.

[0047] A word acoustic feature system 10 provided in this embodiment includes: a word encoder 11 and a word-phoneme aligner 12 .

[0048] Wherein, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a training method of a word acoustic feature system. The method comprises the following steps: splicing word acoustic features output by a word acoustic feature system with a phoneme feature sequence output by a phoneme encoder to obtain a phoneme feature sequence with the word acoustic features, and splicing the phoneme feature sequence with actual rhythm features to obtain a phoneme feature sequence with rhythm and word acoustic features; adjusting the coding length, adding pitch and energy characteristics, and performing decoding to obtain a predicted Mel spectrum; and training the word acoustic feature system based on the actual Mel spectrum and the predicted Mel spectrum. The embodiment of the invention further provides a word acoustic feature system and a training system of the word acoustic feature system. According to the embodiment of the invention, the word acoustic features with word meanings and pronunciation are obtained by utilizing the trained word acoustic feature system, and the word acoustic features are more accurate by continuously training the word acoustic feature system, so that the speech synthesis quality is further improved during speech synthesis.

Description

technical field [0001] The invention relates to the field of intelligent speech, in particular to a word acoustic feature system, a training method and a system for the word acoustic feature system. Background technique [0002] End-to-end text-to-speech synthesis models with sequence-to-sequence architectures have achieved great success in generating natural speech. Word features are characterized by text analysis or word vector representations extracted from pre-trained models, and word vector encoders are then aligned and concatenated with phoneme feature sequences (output of phoneme encoders). Ways to obtain these eigenvectors include: [0003] Obtain word features through statistical methods, such as word frequency, etc., and then use text analysis methods to generate word feature vectors; [0004] Extract encoder output from common machine learning tasks (such as translation tasks) as word vectors; [0005] Use the BERT encoding layer to extract word vectors; [00...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/02G10L25/24G10L13/10G10L13/02G10L15/14
CPCG10L15/02G10L25/24G10L13/10G10L13/02G10L15/142G10L2015/025
Inventor 俞凯沈飞宇杜晨鹏
Owner AISPEECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More