Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Attention-based model training method and device and electronic equipment

A model training and attention technology, applied in the field of artificial intelligence, can solve problems such as consuming hardware resources, and achieve the effect of optimizing network parameters, high accuracy, and optimizing network structure

Pending Publication Date: 2022-05-13
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, technicians need to have a lot of experience in neural network structure design and parameter adjustment, and consume a lot of hardware resources to obtain the neural network structure through repeated replacement and experimentation with different structures of neural networks.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Attention-based model training method and device and electronic equipment
  • Attention-based model training method and device and electronic equipment
  • Attention-based model training method and device and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] Exemplary embodiments of the present disclosure are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present disclosure to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the disclosure. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0030] The present disclosure relates to the field of artificial intelligence technology, specifically the field of deep learning and computer vision technology, and can be applied to scenes such as image processing and image detection, such as face recognition, image recognition, behavior comparison and other scenes. The solutions provided by the present disclosure are...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an attention-based model training method and device and electronic equipment, relates to the technical field of artificial intelligence, in particular to the technical field of deep learning and computer vision, and can be applied to scenes such as image processing and image detection. The specific implementation scheme is as follows: obtaining an attention output matrix of an attention module in a neural network model, performing dimension reduction calculation on a sample dimension and a data block dimension of the attention output matrix based on a pooling layer of the neural network model, and determining a pooled first output matrix; carrying out convolution operation on the first output matrix based on a convolution layer of a neural network model, determining a second output matrix after convolution, and carrying out normalization processing and weighting processing on an output value of each header in the second output matrix to obtain an updated second output matrix; and obtaining an updated attention output matrix based on the updated second output matrix, and training the neural network model based on the updated attention output matrix.

Description

technical field [0001] The present disclosure relates to the field of artificial intelligence technology, specifically the field of deep learning and computer vision technology, which can be applied to image processing, image detection and other scenarios, and specifically relates to an attention-based model training method, device and electronic equipment. Background technique [0002] With the continuous development of computer technology, various neural network models have been widely used in fields such as images, texts, and voices. For example, Convolutional Neural Network (CNN) is a feedforward neural network with a deep structure , which extracts features through convolution calculations, captures features from local to global by deepening the network structure, and realizes the superposition of multi-dimensional features by adding channels. At present, technicians need to have a lot of experience in neural network structure design and parameter adjustment, and consum...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06V10/82G06N3/04G06N3/08
CPCG06N3/08G06N3/047G06N3/045
Inventor 王健韩钧宇
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products