Keyword detection method and system based on unlabeled keyword data

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of keyword detection and detection method, applied in audio data retrieval, digital data information retrieval, metadata audio data retrieval, etc., can solve problems such as misclassification, affecting the accuracy of training models, large manpower, material resources and time.

Active Publication Date: 2021-08-20

北京快鱼电子股份公司

View PDF6 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] The training of the deep learning model requires sufficiently large labeled audio data, so the above-mentioned traditional data labeling method consumes a lot of manpower, material resources and time, and the quality of the audio data labeling directly affects the accuracy of the training model; Save the manpower and time cost required for labeling. At present, unsupervised big data audio classification methods are used, such as using spectral segments to classify data, that is, audio classification is performed from the perspective of spectral feature values. First, spectral feature data is extracted from audio data. According to Spectrum features find the optimal classification spectrum matrix, and finally classify the spectral data by frequency bands; although this method considers the characteristics of different frequency bands of audio to classify audio from the perspective of frequency domain, it does not make full use of the comprehensive characteristics of audio , especially time-domain features, there is a problem of low classification accuracy and easy to cause misclassification; at the same time, in the traditional method, the final classification model obtained by inputting the marked audio data set into the deep learning model for training has only one model, through a single The detection of keywords by the classification model is also prone to misclassification

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0055] like figure 1 and figure 2 As shown, this embodiment discloses a keyword detection method based on unlabeled keyword data, including the following steps:

[0056] S100: Collect a large amount of unmarked audio data, add preset wake-up word audio and non-wake-up word audio to the unmarked audio data, and form a pre-processing audio library;

[0057] like image 3 As shown, add preset wake-up word audio and non-wake-up word audio to the unlabeled audio data. The types of wake-up word audio and non-wake-up word audio to be added are set according to the specific situation, which can be one type or multiple types ;Add N1 audio for each type of wake-up word, add N2 audio for non-awakening word, N1 and N2 are set according to the specific situation, for example, set N1 to 50~200, and set N2 to 0~100.

[0058] S200: Classify the audio data in the preprocessed audio library based on an unsupervised deep learning classification method;

[0059] Set the total number of unsup...

Embodiment 2

[0141] like Figure 5 As shown, the present invention provides a keyword detection system based on unlabeled keyword data, including a preprocessing module, a classification module, a feature extraction module, a model training module and a keyword detection module;

[0142] The preprocessing module is used to collect a large amount of unmarked audio data, and adds preset wake-up word audio and non-wake-up word audio to the unmarked audio data to form a pre-processing audio library;

[0143] The classification module is used to classify the audio data in the preprocessed audio library based on an unsupervised deep learning classification method;

[0144] The feature extraction module is used to extract features from the classified audio data to generate feature data;

[0145] The model training module is used to input feature data into different types of neural network models for training to obtain multiple different keyword detection models;

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a keyword detection method and system based on unlabeled keyword data, and the method comprises the steps: collecting a large amount of unlabeled audio data, adding preset wake-up word audio and non-wake-up word audio in the unlabeled audio data, and forming a preprocessing audio library; classifying the audio data in the preprocessed audio library based on an unsupervised deep learning classification method; extracting features from the classified audio data to generate feature data; inputting the feature data into different types of neural network models for training to obtain a plurality of different keyword detection models; and detecting the to-be-predicted audio based on the plurality of different keyword detection models to obtain a final detection result. According to the method, comprehensive comparison classification training is carried out on the audio data, and the classification accuracy is higher.

Description

technical field [0001] The invention relates to the technical field of big data speech classification methods, in particular to a keyword detection method and system based on unmarked keyword data. Background technique [0002] Keyword wake-up technology is relatively common in daily life. For example, smart speaker devices such as Tmall Genie, when people call out the wake-up word to the device, the device can wake up and then interact with people; the realization of the keyword wake-up function requires the wake-up system The device has the characteristics of low power consumption, low false alarm rate, high accuracy rate, and low false negative rate. In order to achieve this goal, a model based on deep learning is usually used. The training of the model in the traditional method requires a large number of labeled key word data to achieve the desired effect; most of the keyword data collected through smart speakers or microphones are unlabeled data. After accumulating a la...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06F16/65G06F16/683G06F16/68G06K9/62G06N3/04G06N3/08

CPCG06F16/65G06F16/683G06F16/686G06N3/084G06N3/088G06N3/045G06F18/214

Inventor 阮晓辉魏庆凯

Owner 北京快鱼电子股份公司

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Keyword detection method and system based on unlabeled keyword data

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology