Unlock instant, AI-driven research and patent intelligence for your innovation.

Quality control method and device for corpus processing

A quality control method and control device technology, applied in the direction of electronic digital data processing, special data processing applications, semantic tool creation, etc., can solve problems such as the inability to ensure the quality of corpus

Active Publication Date: 2021-07-23
BEIJING LAIYE NETWORK TECH CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The main purpose of this application is to provide a quality control method and device for corpus processing to solve the problem that the quality of corpus cannot be guaranteed after obtaining relevant corpus for natural language generation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Quality control method and device for corpus processing
  • Quality control method and device for corpus processing
  • Quality control method and device for corpus processing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] In order to enable those skilled in the art to better understand the solution of the present application, the technical solution in the embodiment of the application will be clearly and completely described below in conjunction with the accompanying drawings in the embodiment of the application. Obviously, the described embodiment is only It is an embodiment of a part of the application, but not all of the embodiments. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without creative efforts shall fall within the scope of protection of this application.

[0031] It should be noted that the terms "first" and "second" in the description and claims of the present application and the above drawings are used to distinguish similar objects, but not necessarily used to describe a specific sequence or sequence. It should be understood that the data so used may be interchanged under appropriate circumstances for...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application discloses a quality control method and device for corpus processing. The method includes receiving a paraphrase corpus obtained through a crowdsourcing task; judging whether the paraphrase corpus satisfies a first quality control condition; if it is judged that the paraphrase corpus satisfies the first quality control condition, putting the paraphrase corpus into If it is judged that the paraphrase corpus does not meet the first quality control condition, then triggering the release of the verification crowdsourcing task; judging whether the verification crowdsourcing task satisfies the second quality control condition; and if it is judged that the verification crowdsourcing task If the package task satisfies the second quality control condition, the paraphrase corpus is put into the database. The present application solves the technical problem that the quality of the corpus cannot be ensured after obtaining the relevant corpus for natural language generation. Through this application, the crowdsourcing quality control method can be integrated, and the NLP natural language processing technology and various indicators can be used to monitor the correctness, diversity and naturalness of the published crowdsourcing task results.

Description

technical field [0001] The present application relates to the field of natural language generation, in particular, to a quality control method and device for corpus processing. Background technique [0002] Natural Language Generation (English full name: Natural Language Generation, abbreviation: NLG) is one of the important components of the task-oriented dialogue system. [0003] The inventors found that the quality of the corpus cannot be ensured after obtaining relevant corpus for natural language generation, which further affects the collection of high-quality corpus data. [0004] Aiming at the problem in related technologies that the quality of the corpus cannot be guaranteed after obtaining the relevant corpus for natural language generation, no effective solution has been proposed yet. Contents of the invention [0005] The main purpose of the present application is to provide a quality control method and device for corpus processing, so as to solve the problem t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/332G06F16/36
Inventor 周义廷汪冠春胡一川张海雷
Owner BEIJING LAIYE NETWORK TECH CO LTD