Unlock instant, AI-driven research and patent intelligence for your innovation.

Data set acquisition method, classification method, device, equipment and storage medium

An acquisition method and data set technology, applied in the field of data processing, can solve the problems of low efficiency, low efficiency of manual quality inspection, missing text content, etc., and achieve the effect of improving the accuracy rate

Active Publication Date: 2021-02-23
PING AN TECH (SHENZHEN) CO LTD
View PDF12 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The method of random extraction and manual analysis is obviously not efficient. On the one hand, if the data in the dialogue text is very large, in order to detect as many irregularities in the dialogue text as possible, the extracted text content will also increase. The content of manual quality inspection will also increase, and the efficiency of manual quality inspection is very low; regular place

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data set acquisition method, classification method, device, equipment and storage medium
  • Data set acquisition method, classification method, device, equipment and storage medium
  • Data set acquisition method, classification method, device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0024] It should be understood that when used in this specification and the appended claims, the terms "comprising" and "comprises" indicate the presence of described features, integers, steps, operations, elements and / or components, but do not exclude one or Presence or addition of multiple other features, integers, steps, operations, elements, components and / or collections thereof. It should also be understood that the term "and / or" used in the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Embodiments of the present invention provide a data set acquisition method, a data set classification method, a device, a computer device, and a storage medium. Wherein, the method for acquiring a data set includes: acquiring message-level dialogue text data and performing preprocessing; according to preset quality inspection points and rules corresponding to quality inspection points, using a full-text search engine to search from the preprocessed Querying the quality inspection points matching the rules in the dialog text data and marking them to obtain the quality inspection results; integrating the marked dialog text data including the quality inspection points into session-level conversation text data including the quality inspection points; The quality inspection result is updated according to the modification request of the user for the quality inspection point in the conversation text data; and the data set is extracted from the updated data according to a preset format. The embodiments of the present invention can extract accurate data sets, and use the extracted accurate data sets for classification, which can improve the classification accuracy of the classification model.

Description

technical field [0001] The present invention relates to the technical field of data processing, and in particular to a data set acquisition method, a method for classifying data sets, a device, computer equipment and a storage medium. Background technique [0002] During the agent sales process, a large number of dialogue texts may be generated with customers, and these dialogue texts will be stored in the agent sales platform. The current method is to randomly extract a certain number of text content, and then analyze it manually, such as finding out the non-compliance in the dialogue text (also called the illegal place, that is, the place where there is an error), and Improve the non-compliance or come to train the agents. The method of random extraction and manual analysis is obviously not efficient. On the one hand, if the data in the dialogue text is very large, in order to detect as many irregularities in the dialogue text as possible, the extracted text content will ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/31G06F16/33G06F16/35G06F40/289
CPCG06F40/289G06F16/35G06F40/20
Inventor 张雨嘉
Owner PING AN TECH (SHENZHEN) CO LTD