Unlock instant, AI-driven research and patent intelligence for your innovation.

A data processing method, device, equipment and storage medium

A data processing and data technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of difficult parallelization, long training time, information loss, etc., and achieve the effect of improving the speech recognition rate and reducing the loss of speech feature information

Active Publication Date: 2022-07-12
TENCENT TECH (SHENZHEN) CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For the probabilistic model method, the model cannot use the context information of each frame, that is, it cannot use historical information to assist the current task; for the deep learning method, although the model can achieve a better convergence effect, due to the recurrent neural network (Recurrent Neural Network , RNN) itself has a loop structure, more RNN units make the training time longer, and it is difficult to parallelize; and the current self-attention mechanism overcomes the above-mentioned problems to a certain extent, but it is affected by the voice in this method. The time windowing technique of the signal will lead to the loss of information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A data processing method, device, equipment and storage medium
  • A data processing method, device, equipment and storage medium
  • A data processing method, device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0037] Artificial intelligence technology is a comprehensive discipline, involving a wide range of fields, including both hardware-level technology and software-level technology. The basic technologies of artificial intelligence generally include technologies such as sensors, special artificial intelligence chips, cloud computing, distributed storage, big data processing technology, operation / interaction systems, and mechatronics. Ar...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Embodiments of the present invention disclose a data processing method, device, device, and storage medium, wherein the method includes: a server acquires a sequence of speech frames according to a preset time window, determines characteristic information of the sequence of speech frames, and determines according to the characteristic information The input information of the first layer of time-truncated self-attention network, for any layer of time-truncated self-attention network, input the input information and the output of the previous layer of time-truncated self-attention network into any layer of time-truncated self-attention network. Self-attention network to train the speech recognition model and get the trained speech recognition model. Through the above embodiment, the input information of the time-truncated self-attention network of the first layer is input into the time-truncated self-attention network of each layer for training, thereby reducing the loss of speech feature information during the training of the speech recognition model. , to improve the speech recognition rate of the speech recognition model.

Description

technical field [0001] The present application relates to the technical field of speech recognition based on artificial intelligence, and in particular, to a data processing method, apparatus, device and storage medium. Background technique [0002] Speech recognition technology, the purpose of which is to receive human speech signals and make machines responsible for converting the speech signals into text. For speech processing, the entire process can be divided into four parts: front-end processing, acoustic model modeling, language model and dictionary modeling, and decoding. [0003] With the research and development of artificial intelligence technology, especially deep learning, the current speech recognition is divided into three types, one is the probability model method, the second is the deep learning method, and the third is the application of self-attention mechanism. For the probabilistic model method, the model cannot use the context information of each frame...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/06G10L15/16G10L15/18G10L15/183G10L15/22
CPCG10L15/22G10L15/16G10L15/063G10L15/18G10L15/183
Inventor 曹松军马龙
Owner TENCENT TECH (SHENZHEN) CO LTD