Voice processing method and device based on neural network model, and electronic equipment

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A neural network model and speech processing technology, applied in speech analysis, speech recognition, instruments, etc., can solve the problem of large consumption of computing resources and achieve the effect of reducing the amount of computation

Pending Publication Date: 2021-06-11

BIGO TECH PTE LTD

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The present invention provides a speech processing method, device and electronic equipment based on a neural network model, so as to solve the problem of large consumption of computing resources in the process of speech recognition to a certain extent

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0029] The following will clearly and completely describe the technical solutions in the embodiments of the present application with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of this application.

[0030] The terms "first", "second" and the like in the specification and claims of the present application are used to distinguish similar objects, and are not used to describe a specific sequence or sequence. It should be understood that the terms so used are interchangeable under appropriate circumstances such that the embodiments of the application can be practiced in sequences other than those illustrated or described herein, and that references to "first," "second," et...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The embodiment of the invention provides a voice processing method and device based on a neural network model, and electronic equipment, and relates to the technical field of voice recognition. The method comprises the following steps: acquiring a voice signal to be processed; selecting a first time point fragment in the voice signal; intercepting a target fragment of the voice signal through a first window by taking the first time point fragment as a reference; and according to the target segment, acquiring a voice recognition character related to the voice signal. According to the method, the perception domain of the encoder core component MHA can be reduced, that is, the unit of each hidden layer only needs to perceive part of voice segments corresponding to the upper layer, and therefore the calculation amount can be reduced.

Description

technical field [0001] The invention relates to the technical field of speech recognition, in particular to a speech processing method, device and electronic equipment based on a neural network model. Background technique [0002] For the current live broadcast software, it is often necessary to supervise the content of the anchors in a large number of live broadcast rooms, including images and sounds. As for the sound, the sound in the live broadcast is mainly the voice spoken by the anchors. For the supervision of voice content, one of the methods is to recognize the voice, convert it into text content, and then screen the text content. [0003] In the process of speech recognition, it is necessary to use the end-to-end deep neural network to model the segmented speech. Among them, the more commonly used loss function has an encoder encoder and attention decoding in the seq2seq structure of the neural network model. There are two parts: attention-decoder, the encoder enco...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/16G10L15/04G10L15/22G10L15/26

CPCG10L15/16G10L15/04G10L15/22G10L15/26

Inventor 唐浩雨

Owner BIGO TECH PTE LTD

Voice processing method and device based on neural network model, and electronic equipment

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology