Method and system for label-free data classification and predication

A data classification and unlabeled technology, which is applied in data processing applications, character and pattern recognition, instruments, etc., can solve problems such as consuming a lot of manpower and time, is not suitable, and cannot be applied to unlabeled data prediction, so as to achieve accurate optimization and prediction speed, saving energy and time

Inactive Publication Date: 2018-02-09
CHENGDU SEFON SOFTWARE CO LTD
View PDF7 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The usage scenario of the second detection method is similar to that of the first one. It is not suitable for products with many types of products and complex technologies. This method can count the proportion of qualified (normal) tested products, but it will let go of a certain proportion of risks. Detected product
Therefore, the first two detection methods mentioned above require a lot of manpower and time and are not suitable.
The third way to detect the risk of each tested product from the data requires historical label data, but due to various reasons, many systems have not stored the label data, so this method has the technical problem of low prediction accuracy, and, due to its It relies heavily on the labels that have been marked in historical data, and cannot be applied to the environment of unlabeled data prediction, nor can it be used in business scenarios for anomaly detection

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for label-free data classification and predication
  • Method and system for label-free data classification and predication
  • Method and system for label-free data classification and predication

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] In the following, the present invention will be further described in detail in conjunction with the accompanying drawings and embodiments, so as to make the purpose, technical solutions and advantages of the present invention more clear. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0029] figure 1 A method for classifying and predicting unlabeled data according to an embodiment of the present invention is shown, and the method is described in detail below by taking its application to false trade detection in customs business as an example.

[0030] Step 101: Input business process data and obtain multiple business scenario data in the business process

[0031] For example, for the whole process of customs clearance, the customs clearance business scenarios are divided into: on-site customs receipt of customs declarations, general customs centralized examination...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and system for label-free data classification and predication. History data is utilized and rigid requirements on history data labels are removed. Also, experience information database accumulated in a service process is combined and an aim of automatically improving predication precision can be achieved through subsequence continuous optimization of real data. Themethod includes inputting service flow data and acquiring service scene data in a service flow; inputting service content data and grouping the service content data according to the service scene data; constructing characteristic indexes of service content data according to the service scene data and a corresponding service content database; cleaning the characteristic indexes of the service content data; clustering the service content data subjected to characteristic index cleaning and determining the class centers of different classes; calculating the sampling weights of the different classes; sampling the service content data according to the sampling weights and marking predication labels on the sampling results.

Description

technical field [0001] The present invention relates to the technical field of data classification prediction, in particular to a method and system for unlabeled data classification prediction. Background technique [0002] Risk detection is a commonly used quality detection method. This method is widely used in business analysis in various industries to detect potential risks in business for early detection and control. For general enterprises or regulatory agencies, risk detection methods are mainly divided into three types: one is to use quality inspectors to inspect the tested objects one by one to discover the risks of the tested objects; The risk of the tested product; the third is to predict the probability of the risk of each tested object through the information data and historical data of the production of the product, and then conduct actual sampling inspections on the higher risk tested products. [0003] Among the three risk detection methods described above, t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06Q10/06G06K9/62
CPCG06Q10/0635G06Q10/06393G06Q10/06395G06F18/23G06F18/24155
Inventor 田斌王纯斌赵红军覃进学
Owner CHENGDU SEFON SOFTWARE CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products