Supercharge Your Innovation With Domain-Expert AI Agents!

Speech recognition method, device and electronic equipment

A technology of voice recognition and voice files, which is applied in the field of data processing, can solve problems such as signal distortion, and achieve the effect of improving the accuracy of time stamps

Active Publication Date: 2022-01-21
BEIJING YOUZHUJU NETWORK TECH CO LTD
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The signal processing flow is as follows: Collect and sample signals: use a microphone or various radio devices to collect analog voice signals, and then use an ADC device (such as an analog-to-digital conversion card) to convert the analog signal into a digital signal, and then according to the Nyquist theory For sampling, if it does not conform to the theory, it will cause signal distortion

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech recognition method, device and electronic equipment
  • Speech recognition method, device and electronic equipment
  • Speech recognition method, device and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] Embodiments of the present disclosure will be described in detail below in conjunction with the accompanying drawings.

[0051] Embodiments of the present disclosure are described below through specific examples, and those skilled in the art can easily understand other advantages and effects of the present disclosure from the contents disclosed in this specification. Apparently, the described embodiments are only some of the embodiments of the present disclosure, not all of them. The present disclosure can also be implemented or applied through different specific implementation modes, and various modifications or changes can be made to the details in this specification based on different viewpoints and applications without departing from the spirit of the present disclosure. It should be noted that, in the case of no conflict, the following embodiments and features in the embodiments can be combined with each other. Based on the embodiments in the present disclosure, a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

An embodiment of the present disclosure provides a speech recognition method, device, and electronic equipment, which belong to the field of data processing technology. The method includes: using pre-set training samples to randomly initialize and train the LSTM network model; based on the set and CTC loss function The related first training parameter and the second training parameter related to KL divergence utilize the training result of the LSTM network to form a loss function for training the BLSTM network; in the process of training the BLSTM network, gradually increasing While increasing the value of the first training parameter, gradually reduce the value of the second training parameter; when the performance index output by the BLSTM network meets the preset requirements, stop the training of the BLSTM network, so that Use the BLSTM network to perform real-time text prediction on the input sound file. Through the processing solution disclosed in the present disclosure, the accuracy of the time stamp predicted by the acoustic network model can be improved.

Description

technical field [0001] The present disclosure relates to the technical field of data processing, and in particular to an acoustic network model training method, device and electronic equipment. Background technique [0002] Speech processing (Speech processing), also known as voice signal processing, human voice processing, its purpose is to make the desired signal, further do voice recognition, apply to the mobile phone interface and even ordinary life, so that people and computers can communicate. [0003] In the process of speech processing, the audio-like sound signal received by a microphone or other devices can be used to process the data through an analog-to-digital conversion device, and finally output through a digital-to-analog conversion device. Therefore, the processing is for digital signals, and the speech signal is a discrete time signal. The signal processing flow is as follows: Collect and sample signals: use a microphone or various radio devices to collect...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/06
CPCG10L15/063G10L2015/0631
Inventor 张骏黄露
Owner BEIJING YOUZHUJU NETWORK TECH CO LTD
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More