End-to-end voice awakening method and device and computer equipment

A voice wake-up and voice technology, applied in speech analysis, voice recognition, instruments, etc., can solve problems such as false wake-up, reduce the false wake-up rate and improve the wake-up effect.

Pending Publication Date: 2020-02-11
BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But this method will lead to false wake-up, for example, it will also wake up when the user says the word "gou you search".

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • End-to-end voice awakening method and device and computer equipment
  • End-to-end voice awakening method and device and computer equipment
  • End-to-end voice awakening method and device and computer equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] In order to enable those skilled in the art to better understand the solutions of the embodiments of the present invention, the embodiments of the present invention will be further described in detail below in conjunction with the drawings and implementations.

[0055] In view of the false wake-up problem existing in the existing voice wake-up scheme, the embodiment of the present invention provides an end-to-end voice wake-up method and device, which calculates the cumulative value of each target pronunciation unit frame by frame according to the probability of each target pronunciation unit output by the acoustic model. Probability and starting position and end position, calculate the length and average probability of each target pronunciation unit according to described cumulative probability and starting position and end position then, utilize the length of each target pronunciation unit and average probability to carry out voice wake-up, thereby avoided Wake up by m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an end-to-end voice awakening method and device and computer equipment. The method comprises the following steps: receiving voice to be detected; sequentially extracting acoustic features of each voice frame; inputting the extracted acoustic features into an acoustic model to obtain the probability of each target pronunciation unit in each voice frame output by the acousticmodel; calculating the cumulative probability of each target pronunciation unit frame by frame, and determining the starting position and the ending position of each target pronunciation unit; calculating the length and average probability of each target pronunciation unit according to the cumulative probability, the starting position and the ending position; and performing awakening operation according to the length and / or average probability of each target pronunciation unit. According to the invention, the false awakening rate can be reduced, and the awakening effect is improved.

Description

technical field [0001] The invention relates to the technical field of voice wake-up, in particular to an end-to-end voice wake-up method, device and computer equipment. Background technique [0002] Voice wake-up refers to the real-time detection of speaker-specific segments in a continuous speech stream, the purpose of which is to activate the device from a dormant state to an active state. Voice wake-up has a wide range of applications, such as robots, speakers, cars, etc. The main indicators for evaluating the voice wake-up effect are wake-up rate and false wake-up rate, and the performance of the decoder in the speech recognition process plays a key role in it. [0003] For end-to-end speech wake-up, each acoustic model in the decoder corresponds to an utterance unit, and the utterance unit can be a word, character, or syllable. The acoustic model is mainly used to calculate the likelihood between the speech features and each pronunciation template, the input is the s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/26G10L15/22G10L15/06G10L17/00
CPCG10L15/26G10L15/06G10L15/22G10L17/00Y02D30/70
Inventor 姚光超
Owner BEIJING SOGOU TECHNOLOGY DEVELOPMENT CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products