Boundary data partitioning method and device

A boundary data and data technology, applied in the computer field, can solve problems such as inability to divide boundary data, and achieve the effect of accurate classification
CN107516101AActive Publication Date: 2017-12-26ALIBABA GRP HLDG LTD

Patent Information

Authority / Receiving Office
CN ยท China
Current Assignee / Owner
ALIBABA GRP HLDG LTD
Publication Date
2017-12-26

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

An aim of the application is to provide a boundary data partitioning method and device. Specifically, a correlation high-density zone of boundary data is obtained via undisputed data of a correlation clustering group in a clustering result; centralized data in the correlation high-density zone is cut out from the undisputed data of the correlation clustering group, similarity between the boundary data and the centralized data in the correlation high-density zone is analyzed, the boundary data is partitioned based on the similarity, and the boundary data can be accurately classified in a lossless manner.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The present application relates to the field of computers, and in particular to a technology for boundary data division. Background technique

[0002] At present, the K-Means clustering algorithm is often used in clustering-related data mining work to cluster and divide the data. The K-Means clustering algorithm mainly divides the data into multiple different points based on the distance between two points. clustering group. In the K-Means clustering algorithm, the closer the distance between a certain data and the center point of the cluster group, the higher the similarity between the data and other data in the cluster group. In other words, the K-Means clustering algorithm divides the data into the cluster group to which the center point closest to the data belongs. However, in actual scenarios, when a piece of data is consistent with the distance between the center points of two or more clustering groups, the data will not be classified into the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More