Text processing method and device, electronic equipment and storage medium

A text processing and text technology, applied in the fields of electronic equipment, devices, text processing methods, and computer-readable storage media, can solve the problems of complex and laborious construction and maintenance, and obstacles to the deployment of speech synthesis systems

Active Publication Date: 2021-07-30
BEIJING CENTURY TAL EDUCATION TECH CO LTD
View PDF4 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In this way, polyphone disambiguation and prosodic boundary prediction are handled as separate tasks in the front-end module, and the front-end module becomes a long pipeline, and the construction and maintenance of the front-end module becomes a complex and laborious task. work, the storage and computation of various models in the front-end module also impede the deployment of speech synthesis systems on mobile devices

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text processing method and device, electronic equipment and storage medium
  • Text processing method and device, electronic equipment and storage medium
  • Text processing method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0028] refer to Figure 1A , shows a flow chart of the steps of the text processing method according to the first exemplary embodiment of the present disclosure.

[0029] Specifically, the text processing method provided by the present disclosure includes the following steps:

[0030] In step S101, a grammar tree of the text to be processed is generated by a grammar tool.

[0031] In this embodiment, the grammatical tool can be a SyntaxNet toolkit, the text to be processed can be a Chinese text to be speech-synthesized, and the grammatical tree can be understood as a tree representation describing the language dependency between words in the text , which is conducive to understanding the level of the grammatical structure of the text. Simply put, a syntax tree is a tree representation formed when deriving according to preset rules. In the tree structure, directly related words in the text are connected, while others are not directly connected.

[0032] In some optional embodi...

Embodiment 2

[0071] figure 2 It shows a schematic structural diagram of a text processing device according to the second exemplary embodiment of the present disclosure, see figure 2 , the device consists of:

[0072] A generation module 201, configured to generate a syntax tree of the text to be processed through a syntax tool;

[0073] A conversion module 202, configured to convert the syntax tree of the text to be processed to obtain a syntax graph of the text to be processed;

[0074] A grammatical relationship encoding module 203, configured to encode the grammatical relationship of the characters in the text to be processed based on the grammatical graph of the text to be processed, so as to obtain the features of the grammatical relationship of the characters in the text to be processed data;

[0075] Grammatical enhancement processing module 204, configured to perform grammatical enhancement processing on the grammatical relationship of the character based on the grammatical re...

Embodiment 3

[0089] Exemplary embodiments of the present disclosure also provide an electronic device, including: at least one processor; and a memory communicatively connected to the at least one processor. The memory stores a computer program executable by the at least one processor, and when the computer program is executed by the at least one processor, the electronic device executes the text processing method according to the embodiment of the present disclosure.

[0090]Exemplary embodiments of the present disclosure also provide a non-transitory computer-readable storage medium storing a computer program, wherein, when the computer program is executed by a processor of a computer, the computer is used to cause the computer to execute the Text processing method.

[0091] Exemplary embodiments of the present disclosure also provide a computer program product, including a computer program, wherein the computer program is configured to cause the computer to execute the text processing m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a text processing method and device, electronic equipment and a storage medium. The method comprises the steps of generating a syntax tree of a to-be-processed text through a syntax tool; converting the syntax tree to obtain a syntax graph of the text to be processed; on the basis of the syntax graph, grammatical relations of characters in a to-be-processed text being coded, and obtaining grammatical relation feature data of the characters in the to-be-processed text; based on the grammatical relationship feature data of the characters and the obtained semantic feature data of the characters, performing grammatical enhancement processing on the grammatical relationship of the characters to obtain grammatical enhancement feature data of the characters; and performing rhythm boundary prediction and polyphone disambiguation on the characters based on the grammar enhancement feature data of the characters to obtain a rhythm boundary prediction result and a polyphone disambiguation result of the characters. According to the method and the device, while the accuracy of rhythm boundary prediction and polyphone disambiguation is considered, the text processing of the front-end module can be effectively simplified.

Description

technical field [0001] The present invention relates to the technical field of speech synthesis, in particular to a text processing method, device, electronic equipment, and computer-readable storage medium. Background technique [0002] The speech synthesis system is mainly composed of a front-end module and a back-end module. The front-end module is used for text analysis, and the back-end module is used for speech generation. The front-end module lays the foundation for the speech synthesis of the back-end module to ensure smooth speech synthesis. In the Chinese speech synthesis system, the front-end module contains at least two main parts, namely prosodic boundary prediction and polyphone disambiguation. There is a phenomenon of polysemy in Chinese, that is, polyphonic characters. The pronunciation of polyphonic characters will vary greatly in different contexts, and even affect the meaning expressed in the entire sentence. Therefore, in order to ensure the accuracy of...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/253G06F40/232G06F40/126G06F40/30
CPCG06F40/126G06F40/232G06F40/253G06F40/30
Inventor 陈帅婷陈昌滨郭少彤
Owner BEIJING CENTURY TAL EDUCATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products