Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A keyword detection method and system based on unlabeled keyword data

A keyword detection and unlabeled technology, which is applied in audio data retrieval, digital data information retrieval, metadata audio data retrieval, etc., can solve the problems affecting the accuracy of the training model, large manpower, material resources and time, and low classification accuracy and other issues to achieve the effects of saving manpower and time costs, low false negative rate, and high accuracy

Active Publication Date: 2021-11-05
北京快鱼电子股份公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The training of the deep learning model requires sufficiently large labeled audio data, so the above-mentioned traditional data labeling method consumes a lot of manpower, material resources and time, and the quality of the audio data labeling directly affects the accuracy of the training model; Save the manpower and time cost required for labeling. At present, unsupervised big data audio classification methods are used, such as using spectral segments to classify data, that is, audio classification is performed from the perspective of spectral feature values. First, spectral feature data is extracted from audio data. According to Spectrum features find the optimal classification spectrum matrix, and finally classify the spectral data by frequency bands; although this method considers the characteristics of different frequency bands of audio to classify audio from the perspective of frequency domain, it does not make full use of the comprehensive characteristics of audio , especially time-domain features, there is a problem of low classification accuracy and easy to cause misclassification; at the same time, in the traditional method, the final classification model obtained by inputting the marked audio data set into the deep learning model for training has only one model, through a single The detection of keywords by the classification model is also prone to misclassification

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A keyword detection method and system based on unlabeled keyword data
  • A keyword detection method and system based on unlabeled keyword data
  • A keyword detection method and system based on unlabeled keyword data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0055] Such as figure 1 and figure 2 As shown, this embodiment discloses a keyword detection method based on unlabeled keyword data, including the following steps:

[0056] S100: Collect a large amount of unmarked audio data, add preset wake-up word audio and non-wake-up word audio to the unmarked audio data, and form a pre-processing audio library;

[0057] Such as image 3 As shown, add preset wake-up word audio and non-wake-up word audio to the unlabeled audio data. The types of wake-up word audio and non-wake-up word audio to be added are set according to the specific situation, which can be one type or multiple types ;Add N1 audio for each type of wake-up word, add N2 audio for non-awakening word, N1 and N2 are set according to the specific situation, for example, set N1 to 50~200, and set N2 to 0~100.

[0058] S200: Classify the audio data in the preprocessed audio library based on an unsupervised deep learning classification method;

[0059] Set the total number of...

Embodiment 2

[0141] Such as Figure 5 As shown, the present invention provides a keyword detection system based on unlabeled keyword data, including a preprocessing module, a classification module, a feature extraction module, a model training module and a keyword detection module;

[0142] The preprocessing module is used to collect a large amount of unmarked audio data, and adds preset wake-up word audio and non-wake-up word audio to the unmarked audio data to form a pre-processing audio library;

[0143] The classification module is used to classify the audio data in the preprocessed audio library based on an unsupervised deep learning classification method;

[0144] The feature extraction module is used to extract features from the classified audio data to generate feature data;

[0145] The model training module is used to input feature data into different types of neural network models for training to obtain multiple different keyword detection models;

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a keyword detection method and system based on unmarked keyword data, which includes collecting a large amount of unmarked audio data, adding preset wake-up word audio and non-wake-up word audio to the unmarked audio data to form a preset Process the audio library; classify the audio data in the pre-processed audio library based on the unsupervised deep learning classification method; extract features from the classified audio data to generate feature data; input the feature data into different types of neural network models for further processing Training to obtain multiple different keyword detection models; based on multiple different keyword detection models, the audio to be predicted is detected to obtain the final detection result; this method performs comprehensive comparison and classification training on audio data, and the classification accuracy is higher.

Description

technical field [0001] The invention relates to the technical field of big data speech classification methods, in particular to a keyword detection method and system based on unmarked keyword data. Background technique [0002] Keyword wake-up technology is relatively common in daily life. For example, smart speaker devices such as Tmall Genie, when people call out the wake-up word to the device, the device can wake up and then interact with people; the realization of the keyword wake-up function requires the wake-up system The device has the characteristics of low power consumption, low false alarm rate, high accuracy rate, and low false negative rate. In order to achieve this goal, a model based on deep learning is usually used. The training of the model in the traditional method requires a large number of labeled key word data to achieve the desired effect; most of the keyword data collected through smart speakers or microphones are unlabeled data. After accumulating a la...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/65G06F16/683G06F16/68G06K9/62G06N3/04G06N3/08
CPCG06F16/65G06F16/683G06F16/686G06N3/084G06N3/088G06N3/045G06F18/214
Inventor 阮晓辉魏庆凯
Owner 北京快鱼电子股份公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products