Word acoustic feature system and training method and system of word acoustic feature system

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A feature system and acoustic feature technology, applied in speech analysis, speech synthesis, speech recognition, etc., can solve the problems of poor synthesis quality, ignoring word pronunciation, only focusing on word meaning, etc., to improve quality and accurate acoustic features of words. Effect

Active Publication Date: 2021-07-13

AISPEECH CO LTD

View PDF3 Cites 2 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0009] In order to at least solve the problem that the existing models in the existing methods only focus on the meaning of the word and ignore the pronunciation of the word, making the feature vector less effective in improving the quality of speech synthesis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0045] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0046] Such as figure 1 Shown is a schematic structural diagram of a word acoustic feature system provided by an embodiment of the present invention, and the system can be configured in a terminal.

[0047] A word acoustic feature system 10 provided in this embodiment includes: a word encoder 11 and a word-phoneme aligner 12 .

[0048] Wherein, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The embodiment of the invention provides a training method of a word acoustic feature system. The method comprises the following steps: splicing word acoustic features output by a word acoustic feature system with a phoneme feature sequence output by a phoneme encoder to obtain a phoneme feature sequence with the word acoustic features, and splicing the phoneme feature sequence with actual rhythm features to obtain a phoneme feature sequence with rhythm and word acoustic features; adjusting the coding length, adding pitch and energy characteristics, and performing decoding to obtain a predicted Mel spectrum; and training the word acoustic feature system based on the actual Mel spectrum and the predicted Mel spectrum. The embodiment of the invention further provides a word acoustic feature system and a training system of the word acoustic feature system. According to the embodiment of the invention, the word acoustic features with word meanings and pronunciation are obtained by utilizing the trained word acoustic feature system, and the word acoustic features are more accurate by continuously training the word acoustic feature system, so that the speech synthesis quality is further improved during speech synthesis.

Description

technical field [0001] The invention relates to the field of intelligent speech, in particular to a word acoustic feature system, a training method and a system for the word acoustic feature system. Background technique [0002] End-to-end text-to-speech synthesis models with sequence-to-sequence architectures have achieved great success in generating natural speech. Word features are characterized by text analysis or word vector representations extracted from pre-trained models, and word vector encoders are then aligned and concatenated with phoneme feature sequences (output of phoneme encoders). Ways to obtain these eigenvectors include: [0003] Obtain word features through statistical methods, such as word frequency, etc., and then use text analysis methods to generate word feature vectors; [0004] Extract encoder output from common machine learning tasks (such as translation tasks) as word vectors; [0005] Use the BERT encoding layer to extract word vectors; [00...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L15/02G10L25/24G10L13/10G10L13/02G10L15/14

CPCG10L15/02G10L25/24G10L13/10G10L13/02G10L15/142G10L2015/025

Inventor 俞凯沈飞宇杜晨鹏

Owner AISPEECH CO LTD

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Word acoustic feature system and training method and system of word acoustic feature system

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology