Speech recognition method, device and electronic equipment

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of voice recognition and voice files, which is applied in the field of data processing, can solve problems such as signal distortion, and achieve the effect of improving the accuracy of time stamps

Active Publication Date: 2022-01-21

BEIJING YOUZHUJU NETWORK TECH CO LTD

View PDF7 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

The signal processing flow is as follows: Collect and sample signals: use a microphone or various radio devices to collect analog voice signals, and then use an ADC device (such as an analog-to-digital conversion card) to convert the analog signal into a digital signal, and then according to the Nyquist theory For sampling, if it does not conform to the theory, it will cause signal distortion

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0050] Embodiments of the present disclosure will be described in detail below in conjunction with the accompanying drawings.

[0051] Embodiments of the present disclosure are described below through specific examples, and those skilled in the art can easily understand other advantages and effects of the present disclosure from the contents disclosed in this specification. Apparently, the described embodiments are only some of the embodiments of the present disclosure, not all of them. The present disclosure can also be implemented or applied through different specific implementation modes, and various modifications or changes can be made to the details in this specification based on different viewpoints and applications without departing from the spirit of the present disclosure. It should be noted that, in the case of no conflict, the following embodiments and features in the embodiments can be combined with each other. Based on the embodiments in the present disclosure, a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

An embodiment of the present disclosure provides a speech recognition method, device, and electronic equipment, which belong to the field of data processing technology. The method includes: using pre-set training samples to randomly initialize and train the LSTM network model; based on the set and CTC loss function The related first training parameter and the second training parameter related to KL divergence utilize the training result of the LSTM network to form a loss function for training the BLSTM network; in the process of training the BLSTM network, gradually increasing While increasing the value of the first training parameter, gradually reduce the value of the second training parameter; when the performance index output by the BLSTM network meets the preset requirements, stop the training of the BLSTM network, so that Use the BLSTM network to perform real-time text prediction on the input sound file. Through the processing solution disclosed in the present disclosure, the accuracy of the time stamp predicted by the acoustic network model can be improved.

Description

technical field [0001] The present disclosure relates to the technical field of data processing, and in particular to an acoustic network model training method, device and electronic equipment. Background technique [0002] Speech processing (Speech processing), also known as voice signal processing, human voice processing, its purpose is to make the desired signal, further do voice recognition, apply to the mobile phone interface and even ordinary life, so that people and computers can communicate. [0003] In the process of speech processing, the audio-like sound signal received by a microphone or other devices can be used to process the data through an analog-to-digital conversion device, and finally output through a digital-to-analog conversion device. Therefore, the processing is for digital signals, and the speech signal is a discrete time signal. The signal processing flow is as follows: Collect and sample signals: use a microphone or various radio devices to collect...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L15/06

CPCG10L15/063G10L2015/0631

Inventor 张骏黄露

Owner BEIJING YOUZHUJU NETWORK TECH CO LTD

Speech recognition method, device and electronic equipment

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology