Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Big data classification method, device and equipment based on hard clustering algorithm

A classification method and clustering algorithm technology, applied in the field of data analysis, can solve the problems of poor clustering effect and inaccurate classification results, and achieve the effect of good effect, good distinction and high accuracy

Pending Publication Date: 2019-03-08
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The main purpose is to solve the technical problem that the existing k-means method randomly selects the clustering center. If the clustering center is not selected properly, the clustering effect is not good and the classification results obtained are not accurate enough.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Big data classification method, device and equipment based on hard clustering algorithm
  • Big data classification method, device and equipment based on hard clustering algorithm
  • Big data classification method, device and equipment based on hard clustering algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0025] The embodiment of the present application provides a big data classification method based on a hard clustering algorithm. By performing two or more hard clustering methods on the data information, the cluster centers obtained are more accurate, so that the The classification of categories by the center is more accurate.

[0026] like figure 1 As shown, the embodiment of the present application provides a big data classifica...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a large data classification method, device and equipment based on a hard clustering algorithm, wherein the method comprises the following steps of acquiring the data information, dividing the data information into N sample data; carrying out the initial hard clustering analysis for each sample data, and determining N*K1 initial clustering centers; based on the secondary hard clustering analysis of N*K1 primary clustering centers, identifying K2 secondary clustering centers; according to the K2 secondary clustering centers, dividing the data information into K2 classification items, and storing each classification item and corresponding data information in a database. Through the scheme, the accuracy of the obtained secondary clustering center is higher, so that theclassification effect based on the secondary clustering center is better, and each classification item obtained can have more distinctive characteristics, and accordingly users can better distinguishthe various classification items, and cannot be confused.

Description

technical field [0001] The present application relates to the technical field of data analysis, in particular to a method, device and equipment for classifying big data based on hard clustering algorithms. Background technique [0002] Some companies are developing more and more rapidly, and the company has more and more employees. For these companies with a relatively large number of employees, it is necessary to conduct group analysis of employees and classify employees into categories. [0003] At present, clustering algorithms are usually used to classify the obtained crowd data, classify the characteristics of different categories of personnel, and analyze the crowd according to the classification results. For example, it can help market analysts distinguish different consumer groups from the consumer database. groups, and summarize the consumption patterns or habits of each type of consumers. [0004] The most commonly used clustering algorithm is the K-means algorith...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
CPCG06F18/23213Y02D10/00
Inventor 金戈徐亮肖京
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products