Auditory selection method and device based on memory and attention model

An attention model and memory technology, which is applied in neural learning methods, biological neural network models, speech analysis, etc., and can solve the problems of uncertain number of speakers aliasing and fixed memory unit dimensions.

Active Publication Date: 2021-07-06
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF15 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In order to solve the above problems in the prior art, that is, in order to solve the problems of the arrangement of supervised labels, the uncertain number of speaker aliasing and the fixed dimension of the memory unit in the prior art, an aspect of the present invention provides a method based on memory and attention. Auditory selection methods for force models, including:

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Auditory selection method and device based on memory and attention model
  • Auditory selection method and device based on memory and attention model
  • Auditory selection method and device based on memory and attention model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0053] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention

[0054] Preferred embodiments of the present invention are described below with reference to the accompanying drawings. Those skilled in the art should understand that these embodiments are only used to explain the technical principles of the present invention, and are not intended to limit the protection scope of the present invention.

[005...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of speech separation, and in particular relates to an auditory selection method and device based on memory and attention models. It aims to solve the problems of the arrangement of supervised labels, the uncertain number of speaker aliasing and the fixed dimension of memory units in the prior art. The invention provides an auditory selection method based on a memory and attention model, which includes encoding the original speech signal into a time-frequency matrix, encoding and transforming the time-frequency matrix, converting it into a speech vector, and using a long-term memory unit to store speech People and their corresponding speech vectors, obtain the speech vectors of the target speaker, and separate the target speech from the original speech signal through the attention selection model. The method provided by the invention can separate the target speech from the original speech signal without fixing or specifying the number of speakers.

Description

technical field [0001] The invention belongs to the technical field of speech separation, and in particular relates to an auditory selection method and device based on memory and attention models. Background technique [0002] In recent years, with the rapid development of electronic equipment and artificial intelligence, human-computer voice interaction, as an important part of the field of artificial intelligence, has become increasingly important, and human-computer voice interaction has been widely used in real life. Human-computer voice interaction is the machine recognition and analysis to extract the semantic feature information of the voice signal, compare it with the semantic features in the standard information base, and output the corresponding text or convert it into the output result we want. However, in practical applications, there are a lot of interference in the real environment, and the process of machine recognition, analysis and extraction of semantic fea...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/22G10L19/00G10L21/0208G10L21/0272G10L25/30
CPCG10L15/22G10L19/0017G10L21/0208G10L21/0272G10L25/30G10L2021/02087G06N3/08G06N3/044G06F17/16G06N3/049
Inventor 许家铭石晶徐波
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products