Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Image-text fusion classification method and device, equipment, medium and product

A classification method, graphic technology, applied in the direction of neural learning methods, instruments, biological neural network models, etc., can solve the problems of difficult classification model classification effect, difficult training convergence, long training time, etc., to achieve better learning effect , easy to train to the effect of convergence

Pending Publication Date: 2021-12-24
GUANGZHOU HUADUO NETWORK TECH
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The reason is that in the various neural network models in the prior art that assist e-commerce platforms to classify commodity objects, the comprehensive vectors they rely on for classification have not truly realized the deep semantics of image information and text information. Therefore, it is always difficult for the classification effect of the corresponding classification model to break through its own inherent threshold, and it is prone to missed recognition, misclassification, difficult training convergence, training Various unfavorable situations such as a long time

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Image-text fusion classification method and device, equipment, medium and product
  • Image-text fusion classification method and device, equipment, medium and product
  • Image-text fusion classification method and device, equipment, medium and product

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] Embodiments of the present application are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present application, and are not construed as limiting the present application.

[0058] Those skilled in the art will understand that unless otherwise stated, the singular forms "a", "an", "said" and "the" used herein may also include plural forms. It should be further understood that the word "comprising" used in the specification of the present application refers to the presence of the features, integers, steps, operations, elements and / or components, but does not exclude the presence or addition of one or more other features, Integers, steps, operations, elements, components, and / or groups thereof. It will be under...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an image-text fusion classification method and device, equipment, a medium and a product, and the method comprises the steps: carrying out the feature extraction of a commodity image of a commodity object, and obtaining a corresponding image coding vector, wherein the picture coding vector comprises a single-row vector corresponding to a plurality of primitives formed by segmenting the commodity picture; performing feature extraction on the abstract text of the commodity object to obtain a corresponding text coding vector; performing multi-level coding and decoding on an image-text splicing vector formed by splicing the image coding vector and the text coding vector based on a multi-head attention mechanism to obtain an image-text fusion vector; and carrying out classification according to the image-text fusion vector, and judging a classification label of the commodity object according to a classification result. According to the method, feature-level deep semantic interaction can be carried out according to the commodity picture and the abstract text of the commodity object, classification is carried out after fusion of two types of information is deepened, and the classification accuracy can be improved.

Description

technical field [0001] The present application relates to the field of e-commerce information technology, and in particular to a graphic-text fusion classification method and corresponding devices, computer equipment, computer-readable storage media, and computer program products. Background technique [0002] In the application scenario of the e-commerce platform, the classification of product objects based on product information occurs frequently. For example, it is necessary to identify different types of security properties of commodity objects launched by merchant instances, and identify whether they are not for sale. Or sometimes it is necessary to use classification means to perform commodity identification on the commodity information of the commodity object to identify whether the commodity therein belongs to the target item. Such downstream tasks will rely on the deep semantic feature extraction of the product information of the product object, and on this basis, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/34G06K9/62G06N3/04G06N3/08G06F40/242G06F40/126
CPCG06N3/08G06F40/242G06F40/126G06N3/047G06F18/25G06F18/2415
Inventor 郑彦
Owner GUANGZHOU HUADUO NETWORK TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products