Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

56 results about "Attentional network" patented technology

The Attentional Network theory proposes three independent cognitive concepts: physiological state, and prepares the organism for fast reactions. Orienting involves selective allocation of attention to a source of signals in space.

Multi-label eye fundus image recognition method based on GACNN

The invention relates to the technical field of image processing, in particular to a GACNN-based multi-label eye fundus image recognition method, which comprises the following steps of: acquiring an original eye fundus image and preprocessing the original eye fundus image; constructing a GACNN model, using the preprocessed original fundus image with the label to perform training, wherein the GACNN model comprises a convolutional neural network, a graph attention network and a fusion layer, and the convolutional neural network is used for extracting image features, the graph attention network is used for modeling a relationship among fundus multiple labels, regarding each label of a fundus image as a group of mutually dependent nodes, and training by using historical data to obtain a multi-label classifier, and the fusion layer fuses the features obtained by the convolutional neural network and the graph attention network to obtain a final classification result; and inputting a to-be-detected original fundus image into the trained GACNN model, and outputting an identification result with a label; According to the method, the correlation between the tags is fully considered when the multiple tags in the fundus image are identified, and the identification accuracy of the fundus image is improved.
Owner:CHONGQING UNIV OF POSTS & TELECOMM

Method for solving video question-answer problems by using multi-granularity convolutional network self-attention context network mechanism

The invention discloses a method for solving video question-answer problems by using a multi-granularity convolutional network self-attention context network mechanism. The method mainly comprises thefollowing steps that: 1) for a group of videos, frame-level and segment-level video expressions are obtained by using a pre-trained VGG network and a 3D-Conv network; 2) for question word embedding and answer word embedding of dialogue histories and new questions, the multi-granularity convolution self-attention network mechanism and a sentence-level context attention mechanism are adopted to obtain joint expressions related to the questions; and 3) a question-level time attention mechanism and a fused attention network mechanism are adopted to obtain joint video expressions related to the questions and generate answers to the questions asked about the videos. Compared with a common video question-answer solution, the method is advantageous in that the multi-granularity convolutional self-attention network is utilized, so that visible information and dialogue historical information can be combined to generate answers which better meet requirements. The method achieves a better effectin the video question-answer problems compared with a traditional method.
Owner:ZHEJIANG UNIV

Graph convolutional neural network action recognition method based on attention mechanism

The invention discloses a graph convolutional neural network action recognition method based on an attention mechanism, and relates to the field of human-computer interaction action recognition. The method comprises the steps of labelling N attention joints with the highest motion completion participation degree by a residual attention network, N can be 16, and setting other numerical values according to the actual situation; constructing a three-dimensional skeleton space-time diagram, and carrying out space-time feature coding on the attention joints; and learning the three-dimensional skeleton space-time graph through a graph convolutional neural network GCN to perform action recognition. The joints with high participation degree for completing the specific action are selected based on the residual attention network, so that the information processing redundancy can be reduced, and joint information which does not contribute to action recognition is abandoned; based on space-time constraints between the joints, space-time feature codes about the attention joints are constructed to more effectively represent space-time features of the attention joints; based on human body space structure natural graph representation, a graph convolutional neural network is utilized to obtain depth representation about a three-dimensional skeleton space-time graph so as to effectively recognize actions.
Owner:ZHEJIANG SCI-TECH UNIV

Dual-channel semantic positioning multi-granularity attention mutual enhancement video question answering method and system

The invention discloses a dual-channel semantic positioning multi-granularity attention mutual enhancement video question answering method and system. The method comprises the following steps: constructing a feature extraction coding module to extract features; constructing a coarse and fine granularity global retrieval module to perform attention calculation on the features; constructing a graph attention network module to enhance key information between the features; constructing a local positioning module to position key feature information of different channels; and constructing a fusion attention module to perform multi-level fusion on the enhanced features, and finally obtaining answer index information through an answer prediction module. According to the method, feature information of different granularities in a video is defined as visual semantic and text semantic channels through multi-module design, an auxiliary positioning mechanism is designed for different channels, and feature information most related to a problem is obtained by enhancing sharing representation. Correlation mutual attention is carried out on global and local information by utilizing a graph attention network, so that characteristics related to questions under the same time and space are optimally expressed, and the difficulty of current video question answering is relieved.
Owner:SUN YAT SEN UNIV

Point cloud local feature extraction method and device, equipment and storage medium

ActiveCN113435461AAccurate Semantic PredictionPreserve local geometric detailsCharacter and pattern recognitionNeural architecturesFeature extractionPoint cloud
The invention discloses a point cloud local feature extraction method, device and equipment and a storage medium. The method comprises the following steps: firstly, providing a new graph attention network for point cloud local feature extraction, and enabling the network to rapidly and accurately carry out the semantic segmentation of a three-dimensional point cloud; secondly, constructing a local expansion graph area of each point by using an expanded K nearest neighbor search algorithm, and performing local feature expression by using Euclidean distance geometric correlation between a central point and a neighborhood thereof; finally, applying an attention mechanism to a designed network layer, called as a graph attention layer, dynamically learning the context attention features on the local expansion graph by giving proper weights to adjacent edges of a central point, and better reserving local geometric details of the point cloud through attention pooling operation. Compared with an existing point cloud local feature extraction method, the method achieves better performance in three-dimensional point cloud shape classification and segmentation tasks.
Owner:深圳市规划和自然资源数据管理中心(深圳市空间地理信息中心) +1

Short video classification method based on multi-modal feature complete representation

The invention discloses a short video classification method based on multi-modal feature complete representation, and the method comprises the steps: for the content information of a short video, providing constructing four subspaces from the perspective of modal missing mainly based on a visual modal feature, and obtaining potential feature representations respectively, further fusing the potential feature representations of the four subspaces by using an automatic coding and decoding network to ensure that more robust and effective public potential representations are learned; for label information, using inverse covariance estimation and a graph attention network to explore correlation between labels and update label representation to obtain label vector representation corresponding to the short video; providing a multi-head cross-modal fusion scheme based on multi-head attention for the public potential representation and the label vector representation, wherein the multi-head cross-modal fusion scheme is used for obtaining a label prediction score of the short video, wherein the overall loss function of the model is composed of traditional multi-label classification loss and reconstruction loss of an automatic coding and decoding network and is used for measuring the difference between a network output value and an actual value and guiding the network to find an optimal solution of the model.
Owner:TIANJIN UNIV

Traditional Chinese medicine syndrome type identification method based on graph attention network

The invention discloses a traditional Chinese medicine syndrome type identification method based on a graph attention network, and the method comprises the steps: standardizing a plurality of medical case data, and constructing a training set, namely corpus data; respectively establishing a symptom set and a syndrome set for all non-repeated symptoms and syndromes in the corpus; every two symptoms in each medical case data belonging to the training set being connected to serve as nodes, calculating point mutual information between the two symptoms in the symptom set, and updating nodes in a graph by using a graph attention network; weighting the updated nodes by using an attention mechanism to obtain feature vectors of symptoms; inputting the feature vectors into a linear layer for classification to obtain the probability of each syndrome; calculating a loss function in combination with the probability of each syndrome type and the real condition in the training set; performing back propagation according to the loss function, and completing model iteration; and inputting symptom information to be identified into the trained model to obtain a symptom type identification result. According to the invention, the accuracy of identification of the syndrome type is effectively improved.
Owner:UNIV OF ELECTRONICS SCI & TECH OF CHINA

Traffic flow prediction method fusing space-time attention neural network and traffic model

The invention provides a traffic flow prediction method fusing a space-time attention neural network and a traffic model, and the method comprises the steps: dividing feature data according to time slices, carrying out the GAT operation of the data on each time slice, and obtaining a new representation of a flow feature and a speed feature, the new representation of the speed characteristics is input into a traffic simulation model Greenshifts parabola model for transformation to obtain another new representation of the flow characteristics, then the two new representations of the flow characteristics are respectively subjected to gated loop unit network GRU processing, and then the two obtained flow representations are spliced to obtain input of a full connection layer; and performing full connection layer processing on the spliced feature data to obtain a final prediction result, and finally training a neural network model based on a deep learning theory. And obtaining a prediction result on the test set by using the trained network model. According to the method, the traffic flow prediction in the future time period can be realized under the condition that the traffic network and the flow characteristic and speed characteristic data thereof are known.
Owner:NANJING UNIV OF TECH

Method for extracting fault-tolerant information of contract document based on graph attention network

The invention provides a contract document fault-tolerant information extraction method based on a graph attention network, and relates to the technical field of computers and information processing. The method comprises the following steps: firstly, performing character recognition on a contract through an OCR (Optical Character Recognition) engine to obtain text content and corresponding position coordinates; text information features are extracted, wherein the text information features comprise position vectors of text information and word embedding expressions of text character strings; taking features extracted from the contract document as graph node features, and constructing a fault-tolerant contract text relation graph; then setting each layer structure and an activation function of the graph attention network; inputting the training set into the constructed graph attention network for training until the loss function converges; and finally, modeling a contract to be recognized into a text relation graph, inputting the text relation graph into the trained graph attention network, and finally obtaining the category of text information. According to the method, the dislocation information extraction of the contract document is realized, and the method has higher recognition efficiency and accuracy compared with the existing information extraction technology after OCR, and is beneficial to office intelligentization.
Owner:四川国路安数据技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products