Data labeling method, data labeling device and data labeling equipment

A data and data labeling technology, applied in the direction of digital data processing, special data processing applications, natural language data processing, etc., can solve the problems of data classification and classification, difficulty in enumerating data categories, and large workload, etc. The effect of manually labeling workload and improving accuracy

Active Publication Date: 2018-11-13
ADVANCED NEW TECH CO LTD
View PDF5 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, a large number of tables and fields relying on the traditional manual labeling method will bring a lot of workload. In addition, the data security personnel of the general company give priority to investment, and the understanding of the business is relatively limited, so it is difficult to enumerate all data categories, resulting in It is difficult to guarantee the quality of labeling under the large amount of data, which brings great troubles to data classification and classification

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data labeling method, data labeling device and data labeling equipment
  • Data labeling method, data labeling device and data labeling equipment
  • Data labeling method, data labeling device and data labeling equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] The embodiment of this specification provides a data labeling method, device and equipment.

[0033] In order to enable those skilled in the art to better understand the technical solutions in this specification, the technical solutions in the embodiments of this specification will be clearly and completely described below in conjunction with the drawings in the embodiments of this specification. Obviously, the described The embodiments are only some of the embodiments of the present application, but not all of them. Based on the embodiments of this specification, all other embodiments obtained by persons of ordinary skill in the art without creative efforts shall fall within the scope of protection of this application.

[0034] The embodiment of this specification provides a data labeling scheme based on the density clustering algorithm and "partial labeling" combined with "automatic diffusion labeling", which can reduce the workload of manual labeling, improve the rel...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the specification discloses a data labeling method, a data labeling device and data labeling equipment. A solution includes: obtaining feature vectors of all to-be-labeled data; using a density clustering algorithm to carry out clustering on all the feature vectors to obtain a plurality of class clusters; carrying out screening in points contained by the class clusters to obtaina core object set and a non-core object set according to density situations of the class clusters; selecting partial core objects in the core object set of the class clusters and partial non-core objects in the non-core object set of the class clusters, and carrying out labeling; and carrying out automatic diffusion labeling on at least partial other points in the class clusters according to a labeling result.

Description

technical field [0001] This description relates to the technical field of computer software, in particular to a data labeling method, device and equipment. Background technique [0002] Data classification and classification is particularly important as the basic capability of big data security work. However, a large number of tables and fields relying on the traditional manual labeling method will bring a lot of workload. In addition, the data security personnel of the general company give priority to investment, and the understanding of the business is relatively limited, so it is difficult to enumerate all data categories, resulting in It is difficult to guarantee the quality of labeling under the large amount of data, which brings great troubles to data classification and classification. [0003] Based on this, a more effective data labeling scheme is needed. Contents of the invention [0004] The embodiments of this specification provide a data labeling method, devi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06F17/24
CPCG06F40/169
Inventor 侯辉超王心刚许志凯蔡佳良
Owner ADVANCED NEW TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products