Chat corpus labeling method and device, electronic equipment and storage medium

A corpus tagging and corpus technology, applied in the field of information processing, can solve problems such as the quality and quantity of the limited FAQ library, the small quantity, and the impact on user experience, so as to reduce the burden of human work processing, expand content, enhance richness and prospective effect

Pending Publication Date: 2020-05-08
TENCENT TECH (SHENZHEN) CO LTD
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, the intelligence of chatbots will be limited by the quality and quantity of the FAQ library, and a small number and low-quality FAQ library will affect the user experience

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chat corpus labeling method and device, electronic equipment and storage medium
  • Chat corpus labeling method and device, electronic equipment and storage medium
  • Chat corpus labeling method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0066] In order to make the purpose, technical solution and advantages of the present invention clearer, the present invention will be described in further detail below in conjunction with the accompanying drawings, and the described embodiments should not be considered as limiting the present invention, and those of ordinary skill in the art do not make any All other embodiments obtained under the premise of creative labor belong to the protection scope of the present invention.

[0067] In the following description, references to "some embodiments" describe a subset of all possible embodiments, but it is understood that "some embodiments" may be the same subset or a different subset of all possible embodiments, and Can be combined with each other without conflict.

[0068] Before further describing the embodiments of the present invention in detail, the nouns and terms involved in the embodiments of the present invention are described, and the nouns and terms involved in the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a chat corpus labeling method, which comprises the steps of obtaining a question text set matched with chat corpus, the question text set comprising at least one question text which does not obtain a corresponding reply statement; expanding question texts in the question text set through a question text expansion model network in a chat corpus annotation model to obtain corresponding question text pairs; in response to the obtained question text pair, determining a reply statement corresponding to the question text in the question text set through a question-answer modelnetwork in the chat corpus annotation model; and correcting the question text pair and the reply statement, and establishing association between the question text pair and the reply statement. The invention further provides a chat corpus labeling device, electronic equipment and a storage medium. The chatting corpus annotation method and the device can achieve annotation of the chatting corpus.

Description

technical field [0001] The invention relates to information processing technology, in particular to a chat corpus tagging method, device, electronic equipment and storage medium. Background technique [0002] Human-computer interaction (HCI Human–Computer Interaction) refers to the use of a certain dialogue language between humans and computers to determine the information exchange process between humans and computers in a certain interactive way. With the development of the human-computer interaction technology, more and more intelligent products based on the human-computer interaction technology emerge as the times require, such as chatter bots and the like. These smart products can chat and communicate with users, and generate corresponding answer information according to users' questions. However, traditional techniques typically use a database of predefined responses and some sort of heuristic reasoning to select an appropriate response based on the input and context. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/332G06F16/35G06N3/02G06N3/08
CPCG06F16/3329G06F16/35G06N3/02G06N3/088
Inventor 李勤曹云波周昊黄民烈
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products