Training method of recognition model, recognition method, electronic device, and storage medium

By processing the text data of wake words using TTS to generate speech samples and training a recognition model using CNN and GRU models, the problem of high cost and low efficiency in wake word detection in existing technologies is solved, achieving low-cost and high-efficiency wake word recognition.

CN114267342BActive Publication Date: 2026-06-12BEIJING BAIDU NETCOM SCI & TECH CO LTD

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Patents(China)
Current Assignee / Owner
BEIJING BAIDU NETCOM SCI & TECH CO LTD
Filing Date
2021-12-21
Publication Date
2026-06-12

Smart Images

  • Figure CN114267342B_ABST
    Figure CN114267342B_ABST
Patent Text Reader

Abstract

The present disclosure provides a training method of a recognition model, a recognition method, an electronic device and a storage medium, relates to the field of artificial intelligence, and particularly relates to the technical field of speech recognition, deep learning and the like. The specific implementation scheme is as follows: a first speech sample containing a wake-up word, a second speech sample not containing the wake-up word, a first label of the first speech sample, and a second label of the second speech sample are obtained; wherein the first speech sample comprises speech data obtained by performing text-to-speech (TTS) processing on text data containing the wake-up word; a first acoustic feature of the first speech sample and a second acoustic feature of the second speech sample are obtained; and a recognition model is trained by using the first acoustic feature, the second acoustic feature, the first label and the second label. According to the present disclosure, the collection time and cost of sample data can be reduced.
Need to check novelty before this filing date? Find Prior Art