Unlock instant, AI-driven research and patent intelligence for your innovation.

A Generative Model for Small-Sample Multi-round Dialogue for E-commerce Live Scenarios

A technology for generating models and small samples, which is applied to digital data processing, instruments, computing, etc. It can solve the problems of not including the e-commerce live broadcast scene, and the generation model cannot well solve the dialogue task of the e-commerce live broadcast scene.

Active Publication Date: 2022-08-02
北京中科深智科技有限公司
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Existing open-domain pre-trained generative models are trained based on datasets such as persona-chat, DailyDialog, and Wizard of Internet (WiT), which do not contain data from e-commerce live broadcast scenarios, resulting in the inability of these pre-trained generative models to Good solution to dialogue tasks in e-commerce live broadcast scenarios

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Generative Model for Small-Sample Multi-round Dialogue for E-commerce Live Scenarios
  • A Generative Model for Small-Sample Multi-round Dialogue for E-commerce Live Scenarios
  • A Generative Model for Small-Sample Multi-round Dialogue for E-commerce Live Scenarios

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0034] refer to figure 1 , the present invention provides a small sample multi-round dialogue generation model for e-commerce live broadcast scenarios. A unigram language model is used to construct a Chinese vocabulary containing words and words, and jieba is used to refer to the vocabulary to segment the input text. The words and words obtained after word segmentation characterize the input; the sum of the word or word, role, round number and position embedding...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a small sample multi-round dialogue generation model for e-commerce live broadcast scenarios, including: using a unigram language model to construct a Chinese vocabulary containing words and words, and using jieba to refer to the vocabulary to segment the input text; Characterize the input with the words and words obtained after the word segmentation; the sum of the word or word, role, round number and position embedding is input to the model as the embedded representation; the model contains a total of 12 Transformer blocks, and each block combines the decoder and The encoders are fused together to achieve context understanding and generate responses to achieve parameter sharing; two self-attention masks are used in each block to control the access of the current word to the context word; words in the context position can be seen All the context words, the words in the reply position, can only see the words before it; the hidden state corresponding to each word is output in the last layer. The invention uses the dialogue of the real scene in the process of live broadcast of the e-commerce, adopts the prompt method, and realizes the dialogue system on the data set based on a small number of samples.

Description

technical field [0001] The invention belongs to the technical field of dialogue systems, and more particularly relates to a generation model of small-sample multi-round dialogues used in e-commerce live broadcast scenarios. Background technique [0002] Current dialogue systems, whether they are chat systems (e.g., Blender-Bot) or task-oriented dialogue systems (e.g., MinTL), require large dialogue datasets to fine-tune language generation models. Fine-tuning these generative models with large datasets is expensive, and collecting a large number of domain-specific datasets takes a lot of human and material resources, as well as large computing resources and a lot of time. In order to solve the problem of using a large number of training samples and the overhead of fine-tuning, we adopt a learning method that does not require fine-tuning of the gradient, but uses a small number of samples as the context of the generated model. The learning method of the prompt. [0003] Exi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/35G06F40/289G06F40/242G06F40/237
CPCG06F40/289G06F40/35G06F40/242G06F40/237
Inventor 宫明
Owner 北京中科深智科技有限公司