Unified classification method for multiple types of data and system

A classification method and data type technology, applied in the field of data processing, can solve problems such as large amount of calculation, and achieve the effect of reducing the amount of calculation

Inactive Publication Date: 2016-01-20
SHENZHEN UNIV
View PDF0 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The technical problem to be solved by the present invention is to provide a unified classification method and system for multi-type data in the metric space based on the support point space model, aiming to solve the problem of huge amount of calculation in the complex data type in the era of big data in the prior art question

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unified classification method for multiple types of data and system
  • Unified classification method for multiple types of data and system
  • Unified classification method for multiple types of data and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0058] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0059] A metric space is an abstraction of data types with a wide range of coverage. It abstracts complex data objects into points in metric space, and utilizes the triangular inequality of user-defined distance functions to remove irrelevant data and reduce the number of direct distance calculations. The biggest advantage of the metric space classification algorithm is its high general applicability. However, at the same time this is also its disadvantage. The data is abstracted into points in the metric space. Although the generality is improved, the coordinate information is lost at the same t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention is suitable for data processing and provides a unified classification method for data in multiple types. The method comprises steps that, A, in a metric space, original data sets are converted by utilizing a space model to generate data sets having a unified data type; and B, the data sets are classified according a classification algorithm. Compared with the prior art, in the metric space, only distance information of data in different types is considered, other attributes are not considered, so complex data types are simplified and are easy to operate, moreover, through supporting point space model conversion, data in the different types is converted into data in the unified data type, and thereby a classification algorithm use scope is enlarged to a certain degree.

Description

technical field [0001] The invention belongs to the field of data processing, in particular to a method and system for unified classification of multiple data types in a metric space based on a support point space model. Background technique [0002] Classification algorithm is an important data mining algorithm, which widely exists in applications such as pattern recognition, machine learning and data mining. Most of the existing classification algorithms are aimed at multidimensional data. Although there are some methods that can be used for non-multidimensional data, the weaknesses of these methods are often obvious. In the era of big data, there are more and more complex data types, and massive and complex data based on complex data objects (spatial data, text, images, audio, video, time-space sequences, etc.) Development has also produced a large amount of complex biological data (gene sequences, protein profiles, etc.) Classification has become a basic requirement for...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/241
Inventor 毛睿李萍陆敏华刘刚李荣华王毅罗秋明
Owner SHENZHEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products