Method, system, device and medium for automatically adding punctuation marks to text

A punctuation mark and automatic addition technology, applied in the field of speech recognition, can solve problems such as difficulty in meeting the needs of different fields, and achieve the effect of taking into account the effect and processing speed

Active Publication Date: 2022-04-26
GUANGZHOU YUNCONG INFORMATION TECH CO LTD
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

It is difficult for one model to meet the needs of different fields

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, system, device and medium for automatically adding punctuation marks to text
  • Method, system, device and medium for automatically adding punctuation marks to text
  • Method, system, device and medium for automatically adding punctuation marks to text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] Embodiments of the present invention are described below through specific examples, and those skilled in the art can easily understand other advantages and effects of the present invention from the content disclosed in this specification. The present invention can also be implemented or applied through other different specific implementation modes, and various modifications or changes can be made to the details in this specification based on different viewpoints and applications without departing from the spirit of the present invention. It should be noted that, in the case of no conflict, the following embodiments and features in the embodiments can be combined with each other.

[0065] It should be noted that the diagrams provided in the following embodiments are only schematically illustrating the basic ideas of the present invention, and only the components related to the present invention are shown in the diagrams rather than the number, shape and shape of the compo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A method, system, device and medium for automatically adding punctuation to text, obtaining text by recognizing audio; converting corresponding text into multiple index value sequences, and inputting the multiple index value sequences into a deep neural network model, Obtain the probability distribution of each index value sequence; determine the maximum probability distribution value corresponding to each word in the index value sequence based on the probability distribution of each index value sequence, as the index of the punctuation mark to be added after the word; pass the index Obtain the corresponding punctuation marks from the predetermined punctuation mark index table, and automatically add them to the text sequence to complete the addition of punctuation marks to the text; if the index corresponds to a blank label, the current word is skipped and no punctuation is added to the current word symbol. The invention can realize functions such as automatic punctuation, cross-domain transfer learning and radical adjustment, and can also change the radical of the deep neural network model to meet the requirements of accuracy and recall in different scenarios.

Description

technical field [0001] The invention relates to the technical field of voice recognition, in particular to a method, system equipment and media for automatically adding punctuation marks to text. Background technique [0002] Speech recognition can transcribe speech into corresponding text, but since punctuation marks themselves do not have pronunciation, the transcription result of speech recognition is often a text without punctuation marks. Adding punctuation to transcription results with additional tools can increase the readability of transcription results. Especially in the transcription scenario of long audio, punctuation marks are more critical for humans to understand the content of long texts. Common automatic punctuation tools are implemented by training a deep neural network model. [0003] However, the existing automatic punctuation tools often have the following defects: [0004] 1) The independent punctuation model is often not aimed at the single scene of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/117G06F16/31G06F16/33G06N3/04G06N3/08G06F40/295
CPCG06F40/117G06F16/31G06F16/3346G06N3/04G06N3/08G06F40/295
Inventor 邱实杨学锐
Owner GUANGZHOU YUNCONG INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products