Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice recognition method and device and computer readable storage medium

A speech recognition and speech recognition model technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems that affect the speed of speech recognition and cannot run multiple speech recognition models in parallel, and achieve the effect of improving efficiency and accuracy

Pending Publication Date: 2021-07-23
GUANGZHOU YUNCONG INFORMATION TECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in practical applications, due to the limitations of equipment configuration and cost, computer equipment equipped with speech recognition models often cannot run multiple speech recognition models in parallel.
If you use serial processing to run each speech recognition model in turn, and then conduct comprehensive analysis to determine the final recognition result based on each speech recognition result, it will greatly affect the speed of speech recognition

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice recognition method and device and computer readable storage medium
  • Voice recognition method and device and computer readable storage medium
  • Voice recognition method and device and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0069] Some embodiments of the present invention are described below with reference to the accompanying drawings. Those skilled in the art should understand that these embodiments are only used to explain the technical principles of the present invention, and are not intended to limit the protection scope of the present invention.

[0070] In the description of the present invention, "module" and "processor" may include hardware, software or a combination of both. A module may include hardware circuits, various suitable sensors, communication ports, memory, and may also include software parts, such as program codes, or a combination of software and hardware. The processor may be a central processing unit, a microprocessor, an image processor, a digital signal processor or any other suitable processor. The processor has data and / or signal processing functions. The processor can be implemented in software, hardware or a combination of both. The non-transitory computer readabl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of voice processing, particularly provides a voice recognition method and device and a computer readable storage medium, and aims to solve the technical problem of how to accurately and efficiently perform voice recognition. Therefore, according to the method provided by the embodiment of the invention, a knowledge distillation algorithm can be adopted to enable a plurality of trained first speech recognition models to guide a second speech recognition model to carry out model training; the phoneme recognition capability and decoding capability of the trained second speech recognition model on the input speech are close to the phoneme recognition and decoding capabilities of the plurality of first speech recognition models; therefore, only one second voice recognition model needs to be operated on the computer equipment, the voice recognition effect of parallel operation of the multiple first voice recognition models can be achieved, and the voice recognition efficiency and accuracy are remarkably improved.

Description

technical field [0001] The present invention relates to the technical field of speech processing, in particular to a speech recognition method, device and computer-readable storage medium. Background technique [0002] Speech recognition refers to the semantic analysis of speech signals to obtain the text information contained in the speech signals, such as converting speech signals into Chinese text information. The current conventional speech recognition method mainly uses training samples to train the speech recognition model, so that the trained speech recognition model has the ability of speech recognition, and then can use the trained speech recognition model to perform speech recognition on the speech to be recognized. At present, in addition to using a single speech recognition model for speech recognition, multiple speech recognition models can also be used for speech recognition at the same time, and then comprehensively analyze each speech recognition result to de...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/06G10L15/02G10L15/197
CPCG10L15/063G10L15/02G10L15/197G10L2015/025
Inventor 王金超
Owner GUANGZHOU YUNCONG INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products