Rhythm phrase recognition method and device and electronic equipment

A prosody and phrase technology, applied in the field of devices, electronic equipment, and prosodic phrase recognition methods, can solve the problems of inaccurate prosodic phrases, single model training samples, etc., so as to avoid low recognition accuracy, enrich training samples, and improve accuracy. Effect

Pending Publication Date: 2020-09-08
数据堂(北京)智能科技有限公司
View PDF4 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the above implementation scheme only relies on the prosodic labeling of the text, so there will be a single model training sample, which may cause inaccurate problems in the finally recognized prosodic phrases

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Rhythm phrase recognition method and device and electronic equipment
  • Rhythm phrase recognition method and device and electronic equipment
  • Rhythm phrase recognition method and device and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] At present, there is an implementation scheme based on artificial intelligence prosodic prediction sample labeling. By using sample audio files and corresponding text sequences, the text features and pronunciation duration of each word in the text sequence are obtained, and the pre-trained prosodic phrase recognition model is used to analyze the text The sequence is labeled. Prosodic phrases refer to intermediate rhythmic blocks between prosodic words and intonation phrases.

[0058] The inventors of the present application have found through research that in the above schemes, the boundary points of prosodic phrases are mainly predicted through machine learning and deep learning, or the recognition of prosodic phrases is realized through model fusion, but in this implementation scheme, only text In order to train samples, the prosodic phrase recognition model only relies on text features to identify prosodic phrases, and there may be cases where the recognition is accu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a rhythm phrase recognition method and device and electronic equipment, wherein the method comprises the steps: obtaining to-be-recognized target data which at least comprisestext data and audio data corresponding to the text data, wherein the text data comprises at least one statement; obtaining a text feature code corresponding to the text data and an acoustic feature code corresponding to the audio data; processing the text feature code and the acoustic feature code to obtain multi-modal features related to text and audio alignment; inputting the multi-modal features into a pre-trained rhythm recognition model to obtain a rhythm phrase sequence output by the rhythm recognition model, wherein the rhythm phrase sequence comprising a plurality of rhythm phrases, the rhythm phrases are at least segmented by utilizing rhythm symbols, and the rhythm recognition model is obtained by training at least two statement samples with rhythm phrase tags and audio samples corresponding to the statement samples.

Description

technical field [0001] The present application relates to the technical field of text recognition, in particular to a prosodic phrase recognition method, device and electronic equipment. Background technique [0002] Prosody is an important element of language communication and a concept combining hearing and perception. Prosodic phrases refer to the natural combination of some words in spoken language, while some words are clearly spaced or separated from each other. Prosodic phrase recognition refers to determining whether there is a prosodic boundary behind a given vocabulary. For example, after the prosodic phrase recognition of "Xiaochi Chunshui Dipping Mingxia", "Xiaochi #1 Chunshui #1 Dipping Mingxia #4" is obtained, where "Xiaochi", "Chunshui" and "Jinmingxia" are the recognized rhythms Phrases, separated by the symbol "#", and a number that can represent the pause level is added after the "#". [0003] In current schemes for identifying prosodic phrases, sentence...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L13/10G10L25/51G06F40/289
CPCG10L13/10G10L25/51G06F40/289
Inventor 高岩贾晓丰张晰王大亮赵聃齐红威
Owner 数据堂(北京)智能科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products