Supercharge Your Innovation With Domain-Expert AI Agents!

Speech recognition method and device thereof and storage medium

A technology of speech recognition and recognition results, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of reduced recognition rate, difficult to use directly, limited effect improvement, etc., so as to improve the recognition accuracy of hot words and improve the stimulation effect of hot words. , Improve the effect of hot word recognition

Active Publication Date: 2021-05-07
UNIV OF SCI & TECH OF CHINA +1
View PDF8 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] In the field of speech recognition, due to the low score of low-frequency words output by the end-to-end model, the effect of the traditional hot word score incentive method is limited.
Google’s CLAS (Contextual Listen, Attend and Spell, CLAS) encourages hot words from the model level, and has achieved good results, but the method is too simple, and it is easy to misidentify sentences that do not contain hot words as hot words. words, leading to a decline in the overall recognition rate, and it is difficult to use them directly in the actual system. Therefore, the problem of how to improve the accuracy of hot word recognition needs to be solved urgently

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech recognition method and device thereof and storage medium
  • Speech recognition method and device thereof and storage medium
  • Speech recognition method and device thereof and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] In order to enable those skilled in the art to better understand the solution of the present application, the technical solution in the embodiment of the application will be clearly and completely described below in conjunction with the accompanying drawings in the embodiment of the application. Obviously, the described embodiment is only It is a part of the embodiments of this application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of this application.

[0031] The terms "first", "second" and the like in the specification and claims of the present application and the above drawings are used to distinguish different objects, rather than to describe a specific order. Furthermore, the terms "include" and "have", as well as any variations thereof, are intended to cover a non-exclusive inclusion. For example, a process, method...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a voice recognition method and a device thereof and a storage medium, and the method comprises the steps: carrying out the coding of to-be-recognized voice data, and obtaining a first feature vector sequence; encoding each hot word in a preset hot word library to obtain a second feature vector sequence; encoding the audio clip of each hot word in the preset hot word library to obtain a third feature vector sequence; performing first attention operation on the first feature vector sequence and the third feature vector sequence to obtain a fourth feature vector sequence; and performing decoding operation according to the second feature vector sequence, the third feature vector sequence and the fourth feature vector sequence to obtain an identification result. By adopting the embodiment of the invention, the hot word recognition precision can be improved.

Description

technical field [0001] The present application relates to the technical field of speech recognition, in particular to a speech recognition method, device and storage medium. Background technique [0002] In the field of speech recognition, due to the low score of low-frequency words output by the end-to-end model, the effect of the traditional hot word score incentive method is limited. Google’s CLAS (Contextual Listen, Attend and Spell, CLAS) encourages hot words from the model level, and has achieved good results, but the method is too simple, and it is easy to misidentify sentences that do not contain hot words as hot words. Words lead to a decline in the overall recognition rate, and it is difficult to use them directly in the actual system. Therefore, the problem of how to improve the accuracy of hot word recognition needs to be solved urgently. Contents of the invention [0003] Embodiments of the present application provide a voice recognition method, device, and s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/02G10L15/26G10L25/24G10L15/16
CPCG10L15/02G10L15/26G10L25/24G10L15/16G10L2015/088
Inventor 方昕吴明辉马志强刘俊华
Owner UNIV OF SCI & TECH OF CHINA
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More