Prosodic hierarchy annotation method and device

A prosody-level, long-short-term memory technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of limited expansion range of contextual features and wrong transmission, and achieve the effect of solving limited expansion range of contextual features and avoiding wrong transmission

Active Publication Date: 2015-12-23
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF5 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For this reason, an object of the present invention is to propose a prosodic level labeling method, which is based on a two-way long-short-term memory model to label the prosodic level, effectively solvin...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Prosodic hierarchy annotation method and device
  • Prosodic hierarchy annotation method and device
  • Prosodic hierarchy annotation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0015] The embodiments of the present invention are described in detail below. Examples of the embodiments are shown in the accompanying drawings, in which the same or similar reference numerals indicate the same or similar elements or elements with the same or similar functions. The embodiments described below with reference to the accompanying drawings are exemplary, and are intended to explain the present invention, but should not be construed as limiting the present invention.

[0016] The following describes the prosodic level labeling method and device according to the embodiments of the present invention with reference to the drawings.

[0017] figure 2 It is a flowchart of a prosodic level labeling method according to an embodiment of the present invention.

[0018] Such as figure 2 As shown, the prosodic level labeling method may include:

[0019] S1. Get the text sequence.

[0020] For example, the text sequence is "The collision of old and new ideas was fierce."

[0021] S2...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a prosodic hierarchy annotation method and a prosodic hierarchy annotation device. The prosodic hierarchy annotation method comprises the steps of: S1, acquiring a text sequence; S2, segmenting the text sequence into a plurality of participles, and extracting features of the participles; S3, regarding the features as input, and acquiring corresponding output results based on a two-way long/short-term memory model; and S4, annotating prosodic hierarchies of the text sequence according to the output results. According to the prosodic hierarchy annotation method and the prosodic hierarchy annotation device disclosed by the embodiment of the invention, the prosodic hierarchies are annotated based on the two-way long/short-term memory model, the problem of limited extension range of contextual features of the participles in the text sequence is effectively solved, and the prosodic hierarchies are annotated at one time, thus the problem of error transfer in annotation can be avoided.

Description

Technical field [0001] The present invention relates to the technical field of text-to-speech conversion, and in particular to a method and device for prosodic level marking. Background technique [0002] Speech synthesis, also known as text-to-speech technology, is a technology that can convert text information into speech and read it aloud. The main evaluation indexes of speech synthesis system performance mainly include intelligibility and fluency. The existing speech synthesis system has basically matured in terms of intelligibility, but there is still a certain gap between the fluency and the real pronunciation of people. The key factor affecting the fluency of the speech synthesis system is the accuracy of prosodic level prediction. The method of prosody-level prediction mainly uses the characteristics of people's pause in pronunciation, and divides the prosody into different prosody levels according to the length of the pause. The prosodic hierarchy usually includes pro...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/22G10L15/187
Inventor 付晓寅李秀林康永国徐扬凯陈志杰
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products