Unlock instant, AI-driven research and patent intelligence for your innovation.

Voice processing method and device based on neural network model, and electronic equipment

A neural network model and speech processing technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problem of large consumption of computing resources and achieve the effect of reducing the amount of computation

Pending Publication Date: 2021-06-11
BIGO TECH PTE LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention provides a speech processing method, device and electronic equipment based on a neural network model, so as to solve the problem of large consumption of computing resources in the process of speech recognition to a certain extent

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice processing method and device based on neural network model, and electronic equipment
  • Voice processing method and device based on neural network model, and electronic equipment
  • Voice processing method and device based on neural network model, and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The following will clearly and completely describe the technical solutions in the embodiments of the present application with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of this application.

[0030] The terms "first", "second" and the like in the specification and claims of the present application are used to distinguish similar objects, and are not used to describe a specific sequence or sequence. It should be understood that the terms so used are interchangeable under appropriate circumstances such that the embodiments of the application can be practiced in sequences other than those illustrated or described herein, and that references to "first," "second," et...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides a voice processing method and device based on a neural network model, and electronic equipment, and relates to the technical field of voice recognition. The method comprises the following steps: acquiring a voice signal to be processed; selecting a first time point fragment in the voice signal; intercepting a target fragment of the voice signal through a first window by taking the first time point fragment as a reference; and according to the target segment, acquiring a voice recognition character related to the voice signal. According to the method, the perception domain of the encoder core component MHA can be reduced, that is, the unit of each hidden layer only needs to perceive part of voice segments corresponding to the upper layer, and therefore the calculation amount can be reduced.

Description

technical field [0001] The invention relates to the technical field of speech recognition, in particular to a speech processing method, device and electronic equipment based on a neural network model. Background technique [0002] For the current live broadcast software, it is often necessary to supervise the content of the anchors in a large number of live broadcast rooms, including images and sounds. As for the sound, the sound in the live broadcast is mainly the voice spoken by the anchors. For the supervision of voice content, one of the methods is to recognize the voice, convert it into text content, and then screen the text content. [0003] In the process of speech recognition, it is necessary to use the end-to-end deep neural network to model the segmented speech. Among them, the more commonly used loss function has an encoder encoder and attention decoding in the seq2seq structure of the neural network model. There are two parts: attention-decoder, the encoder enco...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/16G10L15/04G10L15/22G10L15/26
CPCG10L15/16G10L15/04G10L15/22G10L15/26
Inventor 唐浩雨
Owner BIGO TECH PTE LTD