Automatic generation method of product description text based on pre-training model

An automatic generation and pre-training technology, applied in neural learning methods, text database query, text database indexing and other directions, can solve the problems of insufficient information, small amount of data, reducing consumers' desire to read, etc., to achieve convenient use and simple methods. Effect

Active Publication Date: 2020-08-11
EAST CHINA NORMAL UNIV
View PDF15 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] 1) A small amount of data can easily lead to underfitting and affect the quality of text generation. The acquisition of data, especially the acquisition of high-quality data, requires cost. How to generate smooth, readable and informative texts with a small amount of data? product description for
[0009] 2) Text length control, how to generate a product description with a controllable length, the information expressed by too short text length is insufficient, and too long text length will reduce consumers' desire to read

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic generation method of product description text based on pre-training model
  • Automatic generation method of product description text based on pre-training model
  • Automatic generation method of product description text based on pre-training model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0051] The present invention automatically generates a fluent, readable, and informative product description text according to the given product function attribute characteristics, and its specific generation steps are as follows:

[0052] (1) Construction of the dictionary

[0053] Step A1: adopt Jieba word segmentation tool to carry out word segmentation to the data of corpus, the air-conditioning data crawled on 58 air-conditioning nets are shown in the following table 1 example:

[0054] Table 1 Example of conditioning corpus

[0055]

[0056] Step A2: Load the stop vocabulary list, perform word frequency statistics for each word obtained, and remove words with a word frequency of 1 or appearing in the stop vocabulary list. See the example in Table 2 below for the constructed dictionary:

[0057] Table 2 Air conditioner dictionary example

[0058]

[0059] (2) Construction of co-occurrence relationship database

[0060] Step B1: segment each product description in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an automatic generation method of a product description text based on a pre-training model. The method is characterized in that a framework pre-training model method is adopted, an auxiliary database and a pre-trained BERT model weight are introduced, a product description text is automatically generated according to given product function class attribute characteristics. The method specifically comprises the steps of constructing a dictionary, a co-occurrence relation library and a retrieval database, and generating a model and a text. Compared with the prior art, themethod has the advantages that a section of product description text which is smooth, readable and rich in information amount is generated according to the given product function type attribute characteristics, the three problems of small data size, generated text length control and function type attribute characteristic controllability are effectively solved, and the generation quality and text diversity of the product description text are improved.

Description

technical field [0001] The invention relates to the technical field of natural language processing, in particular to a method for automatically generating product description text based on a pre-trained model. Background technique [0002] The automatic generation of product description is a technology that automatically generates a piece of descriptive text that contains given feature information, is fluent and readable, and has a controllable text length based on given product attribute characteristics. Taking an air conditioner as an example, given the attribute characteristics "quiet, defrost, 5 horsepower", a description text containing the above characteristics will be automatically generated. Writing product descriptions is a tedious and time-consuming job for people, but product descriptions are undoubtedly important, because it is an important way for consumers to understand products, and to a certain extent determines whether consumers buy products or not. no. Au...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/216G06F40/242G06F40/289G06K9/62G06N3/08G06F16/332G06F16/33G06F16/31
CPCG06F40/216G06F40/289G06F40/242G06F16/325G06F16/334G06F16/332G06F16/319G06N3/08G06F18/214
Inventor 高明高宝丽
Owner EAST CHINA NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products