Commodity title generation method based on multi-mode GPT2 model
A multi-modal and commodity technology, applied in image data processing, instrumentation, electrical digital data processing, etc., can solve problems such as difficult-to-control generated content
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0023] The present invention will be further described below in conjunction with accompanying drawing and exemplary embodiment:
[0024] Such as figure 1 As shown, the present invention discloses a product title generation scheme based on multimodal GPT2, including a unified pre-processing and post-processing process.
[0025] GPT is an NLP (Natural Language Processing) model. GPT-2 is an upgraded version of GPT. The biggest difference is that it has more scale and more training data. GPT is a 12-layer transformer, and the deepest BERT is a 24-layer transformer. GPT -2 can be 48 layers, and its training data is a WebText dataset, which has done some simple data cleaning and covers a very wide area.
[0026] Step 1: Preprocess the corpus, count the attribute dictionary and special markers, and obtain the desensitized product title;
[0027] Step 2: Coding the product content, including encoding pictures with ResNet, encoding category names with GPT2, and encoding attributes w...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 

