Unlock instant, AI-driven research and patent intelligence for your innovation.

Chinese and English mixed offline voice keyword recognition method in specific scene and system implementation thereof

A keyword, Chinese and English technology, applied in the field of offline voice keyword recognition method and its system, can solve the problem of poor recognition effect of non-specific people, and achieve the effect of improving the rejection rate, low calculation amount, and high recognition accuracy

Pending Publication Date: 2022-05-24
BEIHANG UNIV
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Voice keyword recognition based on embedding learning is to train a neural network feature extractor (for example: LSTM feature extractor), convert the voice to be recognized and the keyword template voice into feature vectors of the same length through the feature extractor, and then calculate Vector distance, this implementation method has a high recognition accuracy rate when identifying a single keyword, so it is widely used in the field of smart device wake-up, but as the number of keywords to be identified increases, the recognition accuracy rate will drop sharply. Although it is only necessary to collect a small amount of keyword template corpus to replace the keywords to be recognized, this method also brings about the problem of poor recognition of non-specific people.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese and English mixed offline voice keyword recognition method in specific scene and system implementation thereof
  • Chinese and English mixed offline voice keyword recognition method in specific scene and system implementation thereof
  • Chinese and English mixed offline voice keyword recognition method in specific scene and system implementation thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] Embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While certain embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided for a thorough and complete understanding of the present disclosure. It should be understood that the drawings and embodiments of the present disclosure are only for exemplary purposes, and are not intended to limit the protection scope of the present disclosure.

[0026] In addition, it should be noted that, for the convenience of description, only the parts related to the related invention are shown in the drawings. The embodiments of this disclosure and features of the embodiments may be combined with each other without conflict.

[0027] It should be noted that concepts ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a Chinese and English mixed offline voice keyword recognition method and system. A specific embodiment of the method comprises the following steps: acquiring a voice digital signal, and performing voice activity detection on the voice digital signal to obtain a voice segment to be recognized; defining a self-adaptive keyword matching window, and segmenting a voice segment to be recognized; performing feature extraction on the voice segment to obtain a Mel-frequency cepstrum coefficient embedded feature vector; analyzing the user-defined keyword list, and obtaining a Chinese decoding network space and an English decoding network space in combination with a pre-trained phoneme whitening model; sequentially inputting the Mel-frequency cepstrum coefficient embedded feature vectors into the decoding network space to obtain an identification result; and performing post-processing on the identification result to generate a target identification result. According to the embodiment, the calculation amount is low, offline recognition can be achieved, the recognition accuracy is high, the response speed is high, Chinese and English mixed recognition is supported, and the keyword list can be flexibly replaced so that the method can adapt to application in different scenes.

Description

technical field [0001] The embodiments of the present disclosure relate to the technical field of speech recognition, and in particular, to a method and a system for offline speech keyword recognition in Chinese and English. Background technique [0002] Speech keyword recognition technology is a research branch in the field of automatic speech recognition. Automatic speech recognition technology requires complete decoding and conversion of the collected continuous speech streams, and requires higher hardware computing capabilities. Online recognition of data uploading to cloud computing is often used. When the network is offline, the recognition effect drops sharply, and online recognition also has the risk of privacy leakage when data is uploaded to the cloud. Speech keyword recognition only pays attention to the keyword part, which is less dependent on hardware. It can complete the decoding calculation locally, so as to realize offline recognition. It has broad applicatio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/00G10L15/02G10L15/08G10L15/14G10L15/183G10L19/00G10L25/24
CPCG10L15/142G10L15/183G10L15/02G10L15/005G10L15/08G10L25/24G10L2015/025G10L2015/088
Inventor 陆峰薛玉敏
Owner BEIHANG UNIV