Clustering center determining method, determining system and clustering method
What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technique for determining the method and clustering center, applied to instruments, character and pattern recognition, computer components, etc., to overcome time-consuming clustering and inaccurate clustering results
Inactive Publication Date: 2017-12-15
HUBEI UNIV OF TECH +1
View PDF0 Cites 3 Cited by
Summary
Abstract
Description
Claims
Application Information
AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology
Problems solved by technology
[0003] However, the initial cluster center of the traditional K-means clustering algorithm is randomly selected
Method used
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more
Image
Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
Click on the blue label to locate the original text in one second.
Reading with bidirectional positioning of images and text.
Smart Image
Examples
Experimental program
Comparison scheme
Effect test
Embodiment 2
[0116] Such as image 3 As shown, a system for determining cluster centers includes:
[0117] The water wave group construction module 201 is configured to construct a water wave group including a plurality of water waves, and randomly initialize the position, wave height, and wavelength of each water wave, wherein each water wave includes m cluster centers;
[0118] The propagation module 202 is configured to perform propagation processing on each of the water waves in the water wave group;
[0119] The first judgment module 203 is configured to respectively judge whether the fitness value of each water wave after the propagation processing is greater than the fitness value of the water wave before the propagation processing, and obtain a first judgment result;
[0120] The replacement processing module 204 is configured to replace the propagation processing in the water wave group with the water wave after the propagation processing if the first determination result indicates that th...
Embodiment 3
[0126] Such as Figure 4 As shown, a clustering method includes:
[0127] Step 301: Obtain a data set to be clustered and an optimal cluster center. The optimal cluster center is the optimal cluster center determined according to the determination method in Embodiment 1, wherein the data set to be clustered contains n data sets, the number of clusters is k;
[0128] Step 302: Perform cluster division on each data in the data set according to the closest distance criterion;
[0129] Step 303: Determine whether the termination condition is met;
[0130] If yes, go to step 304;
[0131] Otherwise, return to step 302;
[0132] Step 304: Output the optimal clustering result.
[0133] In this embodiment, the termination condition in step 303 may be set as: the current iteration number reaches the set maximum iteration number.
[0134] In the K-means clustering process, the closest distance criterion criterion is used to divide samples, that is, when the cluster center is determined, each sample...
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more
PUM
Login to view more
Abstract
The invention discloses a clustering center determining method and a clustering center determining system. The method includes the following steps that: a water wave group including a plurality of water waves is constructed; propagation processing is performed on each water wave in the water wave group; and whether the fitness value of each water wave obtained after the propagation processing is performed is greater than the fitness value of the corresponding water wave obtained before the propagation processing is performed judged; if the fitness value of each water wave obtained after the propagation processing is performed is greater than the fitness value of the corresponding water wave obtained before the propagation processing is performed, the water waves obtained after the propagation processing are adopted to replace the corresponding water waves which are obtained before the propagation processing; if the fitness value of each water wave obtained after the propagation processing is performed is not greater than the fitness value of the corresponding water wave obtained before the propagation processing, the wave height and wave length of the water waves in the water wave group which are obtained before the propagation processing are updated, and the locations of the water waves which are obtained before the propagation processing are kept unchanged; the wave group and the number of iterations are updated; whether the current number of iterations is smaller than a set evolution algebra is judged; if the current number of iterations is smaller than the set evolution algebra, the method shifts to previous steps, propagation processing is performed on the water waves in the updated water wave group; and if the current number of iterations is not smaller than the set evolution algebra, water waves in the updated water wave group, which have the largest fitness value, are selected as optimal water waves. According to the method and system provided by the invention, from the perspective of optimization search, a clustering center is optimized from generation to generation by means of propagation processing, and the clustering center can be approximate to an optimal clustering center.
Description
Technical field [0001] The present invention relates to the field of big data mining, in particular to a clustering method and system. Background technique [0002] With the development of big data technology, the amount of data generated has increased sharply. Traditional data processing methods can no longer meet the requirements. Cluster analysis as a big data mining technology has once again become a research hotspot. Cluster analysis is an important unsupervised learning method. The purpose of cluster analysis is to find the structure hidden in the data, and according to a certain similarity measure, as far as possible to put the data with the same nature into the same category. [0003] However, the initial cluster centers of the traditional K-means clustering algorithm are randomly selected. For a clustering algorithm with multiple iterations, the cluster center of this iteration is obtained by updating the center of the previous iteration. Therefore, as the calculation bas...
Claims
the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more
Application Information
Patent Timeline
Application Date:The date an application was filed.
Publication Date:The date a patent or application was officially published.
First Publication Date:The earliest publication date of a patent with the same application number.
Issue Date:Publication date of the patent grant document.
PCT Entry Date:The Entry date of PCT National Phase.
Estimated Expiry Date:The statutory expiry date of a patent right according to the Patent Law, and it is the longest term of protection that the patent right can achieve without the termination of the patent right due to other reasons(Term extension factor has been taken into account ).
Invalid Date:Actual expiry date is based on effective date or publication date of legal transaction data of invalid patent.