Method and Network for Time-Frequency Channel Attention Weight Calculation and Vectorization

A technology of weight calculation and attention, applied in speech analysis, instruments, etc., can solve the problem of unable to capture healthy individuals and depressed patients

Active Publication Date: 2021-05-25
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Many current methods treat all frequencies and time periods of raw speech indiscriminately to predict levels and fail to target discriminatively relevant frequency bands and time periods to capture differential cues between healthy individuals and depressed patients

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and Network for Time-Frequency Channel Attention Weight Calculation and Vectorization
  • Method and Network for Time-Frequency Channel Attention Weight Calculation and Vectorization
  • Method and Network for Time-Frequency Channel Attention Weight Calculation and Vectorization

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0086] In order to make the purposes, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments It is a part of the embodiments of this application, but not all of them. Based on the embodiments in the present application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present application.

[0087] see figure 1 , the time-frequency channel attention weight calculation and vectorization method provided by the embodiment of the present application, including:

[0088] S10: Collect a voice file, the voice file includes a long-term voice, and extract a logarithmic Fourier amplitude spectrum of the long-time voice.

[0089] In...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

This application relates to the method and network of time-frequency channel attention weight calculation and vectorization, including: using spherical embedding normalization to preprocess the logarithmic Fourier magnitude spectrum; segmenting the logarithmic Fourier magnitude spectrum preprocessing Process data into short-term spectral segments to generate multi-channel tensors; use the attention mechanism to calculate the attention weight of each channel in the time direction and frequency direction; obtain time-frequency attention by matrix multiplication of the attention weights in the time direction and frequency direction Weight coefficient tensor; use the attention mechanism to calculate the attention weight tensor of each channel in the time-frequency attention weight coefficient tensor; use the attention weight tensor of each channel to obtain vectorization in the time and frequency directions through one-dimensional convolution Results; concatenate the vectorized results of the time direction and frequency direction of at least one channel, and use one-dimensional convolution to obtain the channel vectorized results; calculate the average value of the channel vectorized results of the short-term spectrum segment and use it as the entire long-term The results of the time-to-speech correspondence.

Description

technical field [0001] This application relates to the field of artificial intelligence, in particular to a method and network for time-frequency channel attention weight calculation and vectorization. Background technique [0002] The purpose of automatic depression detection is to explore the changes in the speech of healthy individuals and depressed patients, and propose corresponding models and methods to establish the ability of the machine to capture clues of depression, enhance its diagnostic ability, and improve diagnostic efficiency. Automatic depression detection is an important research direction in the field of human-computer interaction and artificial intelligence, involving many fields such as intelligence science, mathematics, psychology, and physiological science. [0003] Physiological and psychological studies have shown that different frequency segments and time segments of speech have different effects on depression detection. In other words, it is neces...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G10L25/63G10L25/18
CPCG10L25/18G10L25/63
Inventor 陶建华牛明月刘斌李永伟
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products