Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Voice keyword recognition method based on end-to-end, device thereof and equipment

A recognition method and keyword technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of mismatched recognition scenarios, low processing efficiency, high false alarms, etc., to improve recognition effects, strengthen utilization capabilities, reduce The effect of false alarms

Pending Publication Date: 2020-07-17
合肥讯飞数码科技有限公司
View PDF10 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the current solution for speech keyword recognition relies on the processing effects of many independent tasks, which have a large mutual influence and are easy to magnify and superimpose errors. From feature selection to recognition modeling, the simple phoneme classification idea and real speech The keyword recognition scene does not match, and can only rely on pronunciation similarity for target recognition
Due to the above defects, the current technical means for speech keyword recognition have problems such as low processing efficiency, poor overall performance, too many false alarms and unsatisfactory recognition results.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice keyword recognition method based on end-to-end, device thereof and equipment
  • Voice keyword recognition method based on end-to-end, device thereof and equipment
  • Voice keyword recognition method based on end-to-end, device thereof and equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach

[0118] Based on the above embodiments and their preferred solutions, those skilled in the art can understand that, in actual operation, the present invention is applicable to various implementation modes, and the present invention uses the following carrier as a schematic illustration:

[0119] (1) A device based on end-to-end voice keyword recognition, which may include:

[0120] one or more processors, memory, and one or more computer programs, wherein the one or more computer programs are stored in the memory, the one or more computer programs include instructions, when the instructions are When the device described above is executed, the device is made to perform the steps / functions of the foregoing embodiments or equivalent implementation manners.

[0121] Figure 5 It is a schematic structural diagram of an embodiment of an end-to-end speech keyword recognition device based on the present invention, wherein the device may be an electronic device or a circuit device buil...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a voice keyword recognition method based on end-to-end, a device thereof and equipment. The concept of the invention is to combine with an end-to-end thought; according to themethod, a pre-built keyword recognition network is directly fitted from the features to the target, so that the recognition process is simpler and more efficient, the superposition effect of adverse effects can be avoided, meanwhile, the keyword recognition network is easier to achieve global optimization, the development cost can be effectively reduced, and therefore, the method has a higher practical value in an actual business scene. According to the invention, an acquisition strategy of identification features is improved; therefore, the pronunciation characteristics adapting to the business scene can be fully represented; therefore, more potential key information can be captured; in addition, the keyword recognition network architecture provided by the invention can utilize the context information from the acoustic perspective, so that the defect that the existing scheme only performs recognition through an isolated pronunciation sample is fundamentally overcome, and the processing effect of locking the keywords from the audio is further obviously improved.

Description

technical field [0001] The present invention relates to the field of speech processing, in particular to an end-to-end speech keyword recognition method, device and equipment. Background technique [0002] The speech key word recognition (Keyword Spotting) that the present invention pays close attention to specifically refers to the technology of identifying and judging the content and position of a specific keyword in a given speech through pronunciation similarity under the condition of no corresponding language speech recognizer, That is to say, the technology of transcribing speech into text by a speech recognizer and then retrieving keywords from the text is not a concept. Speech keyword recognition technology is a technology between continuous speech recognition and isolated word recognition. In a given continuous, unrestricted natural speech stream, it can be recognized from the acoustic level whether a speech contains a given word or not. keywords, and give the star...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/02G10L15/06G10L15/08G10L15/183G10L15/26
CPCG10L15/02G10L15/08G10L15/183G10L15/26G10L15/063G10L15/083G10L2015/088
Inventor 周振昆方磊吴明辉杨帆夏静雯
Owner 合肥讯飞数码科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products