Travel network cell division method based on Simhash algorithm
A network community and algorithm technology, which is applied in the field of tourism complex network community division, can solve problems such as being easily affected by isolated points, affecting clustering results, and large amount of calculation, so as to improve division efficiency, simple and convenient algorithm, and reduce storage space Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0024] Now take Sina Weibo as an example, the travel network community division method based on Simhash algorithm of the present invention can be found in figure 1 , implemented by the following steps:
[0025] (1) Crawl the user ID and text data on the travel network, and store them in the database, specifically including the following steps:
[0026] (1.1) Apply for Sina APPkey;
[0027] (1.2) According to the API interface provided by Sina, check the URL of the required interface, HTTP request method, parameter request crawling user ID, user registered address address1, user microblog information content text, user published microblog address address2, and the interface returns in json format The data;
[0028] (1.3) Use the java program to process the json data returned by Weibo, and judge whether the registered address address1 of the first user is the same as the address address2 where the user publishes the text information content, and if not, determine that the text...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 