A clustering method for check-in location data considering location repetition and density peak points

A density peak and data clustering technology, applied in character and pattern recognition, instrumentation, computing, etc., can solve problems such as reduced reliability of results, difficulty in ensuring the continuity and integrity of density clusters, and failure to consider the connectivity of peaks and core points. , to achieve the effect of good space applicability and accurate peak value

Inactive Publication Date: 2019-03-12
FUZHOU UNIVERSITY
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, most of these methods directly use the spatial distance of point objects as the similarity measure for clustering, and do not consider the repeatability of elements in spatial position.
Directly using the above method to cluster the check-in data is likely to cluster all objects in a certain repeated position into one category, too few points are not conducive to the spatial expression of active hot spots
Moreover, this type of method regards the inside of the density cluster as having a uniform density, and cannot obtain important information such as the density peak, so it is impossible to understand the central tendency of the check-in behavior
[0005] In 2014, Rodriguez et al. proposed a fast search and find density peak clustering algorithm (CFSFDP), but for check-in data with repeated positions, it is easy to select outliers with high position repeatability as density peaks, resulting in inconsistent results. Reliability declines; in addition, since the clustering process has a division method based on the density threshold, the connectivity between the peak and the core point is not considered, and it is difficult to ensure the continuity and integrity of the density cluster

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A clustering method for check-in location data considering location repetition and density peak points
  • A clustering method for check-in location data considering location repetition and density peak points
  • A clustering method for check-in location data considering location repetition and density peak points

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] The present invention will be further described below in conjunction with the accompanying drawings and embodiments.

[0048] Please refer to figure 1 , the present invention provides a check-in location data clustering method taking into account location repetition and density peak points, characterized in that it includes the following steps:

[0049] Step S1: Extract the number and latitude and longitude information of each piece of check-in information from the check-in text to form the original check-in object, and perform preprocessing on all the original check-in objects to form a set O; the preprocessing includes:

[0050] (1) Convert the latitude and longitude information of the original check-in object into Mercator plane coordinates; this step is to facilitate the calculation and representation of the Euclidean space distance between objects in the subsequent steps. The conversion method is as follows:

[0051]

[0052] Y=L·K

[0053] Where B is the lat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention relates to a check-in location data clustering method that takes into account location repetition and density peak points, including the following steps: 1. Constructing the original check-in object and performing preprocessing; 2. Counting the number of original check-in objects at each location to construct 3. Calculate the cut-off distance dc; 4. Calculate the local density ρi and the high-density nearest neighbor distance δi of each FPi; 5. Calculate the thresholds ρ0 and δ0 and filter out the density peak points; 6. Get The core point cluster connected with each peak point density; 7. Search the boundary points of each core point cluster and add them to the core point cluster to form a peak density cluster to complete the entire clustering process. The invention fully considers the problem of repeated check-in locations, effectively avoids the situation that outliers with a high number of repeated locations are selected as peaks and clustered, and the clustering results are more accurate and reliable, and can better reflect the spatial aggregation and activities of urban residents Changes have high practical value.

Description

technical field [0001] The invention relates to a method for clustering check-in location data in consideration of location repetition and density peak points. Background technique [0002] With the popularity of mobile devices with location-based service functions such as smartphones and tablet computers, the location-based social network LBSN continues to grow, providing a good data source for exploring urban business districts and solving problems such as urban transportation and resource allocation. Location check-in is a representative function in LBSN, which means that users use LBS-enabled devices to record their current location, facial expressions, photos and other information and post them on social networks. Because it is very difficult to obtain the accurate location of the user, the existing LBSN generally has a location candidate module, which lists the known locations that the user may be in for the user to choose. Therefore, when different check-in behaviors...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62
CPCG06F18/2321
Inventor 邬群勇刘萌
Owner FUZHOU UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products