Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Punctuation predicting, labeling and voice processing method, device, equipment and program product

A prediction processing and punctuation technology, applied in the computer field, can solve the problems of low punctuation prediction accuracy, inability to explicitly model punctuation, slow decoding speed, etc., and achieve the effect of improving the punctuation prediction rate

Pending Publication Date: 2022-06-17
ALIBABA GRP HLDG LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The advantage of the sequence tagging framework is its simple structure, but its disadvantage is that it cannot explicitly model the structural relationship of the output punctuation, resulting in a low accuracy rate of punctuation prediction.
The advantage of the sequence-to-sequence framework is that the structural relationship of the output punctuation is explicitly modeled, and the accuracy rate is high. The disadvantage is that the decoder adopts an autoregressive form, and the decoding speed of the decoder will be affected by the length of the input text sequence, resulting in a slower decoding speed.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Punctuation predicting, labeling and voice processing method, device, equipment and program product
  • Punctuation predicting, labeling and voice processing method, device, equipment and program product
  • Punctuation predicting, labeling and voice processing method, device, equipment and program product

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0054] like image 3 As shown, it is a schematic flowchart of a punctuation prediction processing method according to an embodiment of the present invention. The method can be applied to a tool for punctuation processing of text, and can also be applied to a speech recognition model. Voice punctuation. Specifically, it can be applied to the server side or the terminal side, and the method can include:

[0055] S101: Acquire text feature data after semantic encoding of the text to be processed. The text to be processed may be any text that has not been punctuated, or of course, text that has been partially punctuated, and the length of the text to be processed is not limited. The punctuation prediction processing method in the embodiment of the present invention can be implemented by using the structure of an encoder and a decoder, and specifically, an RNN structure can be used to implement the encoder and the decoder respectively, or an attention mechanism-based Transfo...

Embodiment 2

[0069] like Figure 4 As shown, it is a schematic structural diagram of a punctuation prediction processing device according to an embodiment of the present invention. The device can be applied to a tool for punctuation processing of text, and can also be applied to a speech recognition model for identifying Voice punctuation. Specifically, it can be applied to the server side or the terminal side, and the device can include:

[0070] The text feature data acquisition module 11 is used for acquiring the text feature data after semantic encoding of the text to be processed. The text to be processed may be any text that has not been punctuated, or of course, text that has been partially punctuated, and the length of the text to be processed is not limited. The punctuation prediction processing apparatus according to the embodiment of the present invention can be implemented by using the structure of an encoder and a decoder. Specifically, an RNN structure can be used to i...

Embodiment 3

[0083] Embodiments of the present invention also provide a speech recognition method, which can be applied to speech recognition scenarios, such as Figure 5 and Image 6 As shown, it is a schematic diagram of an application scenario of the speech recognition method according to the embodiment of the present invention. Figure 5 and Image 6 The schematic diagram to illustrate the processing process of the above-mentioned speech recognition method, the processing process includes:

[0084] S201: Recognize input speech, and generate speech recognition text. like Figure 5 and Image 6 As shown in the figure, the input speech may be collected from some APPs with speech recognition function, and then the collected input speech is recognized by the speech recognition model to generate speech recognition text. Among them, speech recognition can be done on the client side, such as Figure 5 As shown in the figure, the speech recognition model is deployed on the clie...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to punctuation prediction, labeling and voice processing methods and devices, equipment and a program product. The method comprises the steps of obtaining text feature data after semantic coding is performed on a to-be-processed text; predicting punctuations and punctuation positions in the text to be processed according to the text feature data or according to the text feature data and the identified punctuations and punctuation positions; and outputting the punctuation and the punctuation position of the to-be-processed text. According to the embodiment of the invention, the punctuations and the punctuation positions in the to-be-processed text are predicted according to the text feature data and the identified punctuations and the punctuation positions, and the punctuation prediction processing is not influenced by the length of the input text any more, so that the length of a punctuation prediction structure is greatly reduced, and the effective punctuation prediction rate is greatly reduced.

Description

technical field [0001] The present application relates to a method, device, device and program product for punctuation prediction, labeling, and speech processing, and belongs to the field of computer technology. Background technique [0002] The punctuation prediction system is used to punctuate the text and can be applied in the ASR system. The output text of the original ASR (Automatic Speech Recognition, automatic speech recognition technology) system does not contain punctuation marks, which will greatly affect the readability of the ASR output text and the accuracy of downstream natural language understanding tasks. Adding a punctuation prediction system can solve this problem. [0003] Punctuation prediction systems in the prior art are generally based on two frameworks, one is a sequence labeling framework and the other is a sequence-to-sequence framework. The advantage of the sequence annotation framework is its simple structure, but the disadvantage is that it ca...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/22G06F40/30G06N3/04G06N3/08G10L15/16G10L15/26G10L15/30
CPCG10L15/22G10L15/30G10L15/16G06F40/30G06N3/04G06N3/08
Inventor 陈谦
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products