Unlock instant, AI-driven research and patent intelligence for your innovation.

Generation model for small sample multi-round dialogues in e-commerce live broadcast scene

A technology for generating models and small samples, which is applied in the fields of electronic digital data processing, instruments, calculations, etc. It can solve the problems that e-commerce live broadcast scenes are not included, and the generation model cannot well solve the dialogue tasks of e-commerce live broadcast scenes.

Active Publication Date: 2022-04-29
北京中科深智科技有限公司
View PDF8 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Existing open-domain pre-trained generative models are trained based on datasets such as persona-chat, DailyDialog, and Wizard of Internet (WiT), which do not contain data from e-commerce live broadcast scenarios, resulting in the inability of these pre-trained generative models to Good solution to dialogue tasks in e-commerce live broadcast scenarios

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Generation model for small sample multi-round dialogues in e-commerce live broadcast scene
  • Generation model for small sample multi-round dialogues in e-commerce live broadcast scene
  • Generation model for small sample multi-round dialogues in e-commerce live broadcast scene

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the embodiments of the present invention. Apparently, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0034] refer to figure 1 , the present invention provides a small-sample multi-round dialogue generation model for e-commerce live broadcast scenarios, using a unary language model to construct a Chinese vocabulary containing words and words, using jieba to segment the input text with reference to the vocabulary, and using The words and words obtained after word segmentation are used to characterize the input; words or words, roles, round numbers and positions are embedded an...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a small sample multi-round dialogue generation model for an e-commerce live broadcast scene, which comprises the following steps of: constructing a Chinese word list containing characters and words by using a unary language model, carrying out word segmentation on an input text by referring to the word list by using jieba, and representing the input by using the characters and the words obtained after word segmentation; the sum of the characters or words, the roles, the number of rounds and the positions is used as an embedding representation to be input into the model; the model totally comprises 12 Transform blocks, and a decoder and an encoder are fused together in each block, so that context understanding and reply generation can be realized, and parameter sharing can be realized; controlling the access of the current word to the context word in each block by using two self-attention mask modes; the words in the context position can see all the words in the context, and the words in the reply position can only see the previous words; and outputting the hidden state corresponding to each word in the last layer. The dialogue system is realized on a data set based on a small number of samples by using dialogues of a real scene in an e-commerce live broadcast process and adopting a prompt mode.

Description

technical field [0001] The invention belongs to the technical field of dialog systems, and more specifically relates to a generation model of small-sample multi-round dialogs used in e-commerce live broadcast scenarios. Background technique [0002] Current dialogue systems, whether chat systems (e.g., Blender-Bot) or task-oriented dialogue systems (e.g., MinTL), require large dialogue datasets for fine-tuning on language generation models. Fine-tuning these generative models with large datasets is expensive, and collecting large domain-specific datasets takes a lot of manpower and material resources, and also requires large computing resources and a lot of time. In order to solve the problem of using a large number of training samples and the overhead caused by fine-tuning, we have adopted a learning method that does not require fine-tuning the gradient, but uses a small number of samples as the context of the generated model, that is, using a small number of samples. Prom...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/35G06F40/289G06F40/242G06F40/237
CPCG06F40/289G06F40/35G06F40/242G06F40/237
Inventor 宫明
Owner 北京中科深智科技有限公司