Web data clustering method based on Mashup service function feature representation and density peak detection

A density peak, clustering method technology, applied in text database clustering/classification, special data processing applications, electrical digital data processing, etc. Clustering center points, etc.

Active Publication Date: 2020-07-31
ZHEJIANG UNIV OF TECH
View PDF2 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although the idea of ​​the DPC algorithm is concise and efficient, there are still some problems in practical applications: (1) the clustering effect is very dependent on the selection of the cut-off distance; (2) when the amount of data is large, it may not be easy to pick out Appropriate cluster center point

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Web data clustering method based on Mashup service function feature representation and density peak detection
  • Web data clustering method based on Mashup service function feature representation and density peak detection

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0085] The present invention will be further described below in conjunction with the accompanying drawings.

[0086] refer to figure 1 and figure 2 , a clustering method based on Mashup service feature representation and density peak detection, comprising the following steps:

[0087] Include the following steps:

[0088] The first step is to preprocess all Mashup service data that requires feature representation;

[0089] The second step is to extract functional nouns based on the preprocessed Mashup service data;

[0090] Step 3: For the functional noun set FS of each Mashup service, perform semantic association calculation on the semantic weight of each functional noun;

[0091] The fourth step is based on the semantic weight calculation results of the third step, combined with the TF-IDF algorithm and the Word2Vec model, to represent the Mashup semantic feature vector;

[0092] The fifth step is to calculate the density information for all Mashup semantic feature vec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a clustering method based on Mashup service function feature representation and density peak detection. The clustering method comprises the following steps: 1, preprocessing all Mashup service data needing feature representation; 2, carrying out function noun extraction operation; 3, performing semantic association calculation on the semantic weight of each functional noun;4, in combination with a TF-IDF algorithm and a Word2Vec model, performing expression of Mashup semantic feature vectors; 5, calculating density information of all Mashup semantic feature vectors participating in clustering; 6, screening out candidate points of the clustering center from all the Mashup semantic feature vectors; and 7, further screening out the most suitable K initial clustering centers, and carrying out K-means clustering. According to the method, the functional characteristics of the Mashup service can be effectively expressed, and the clustering performance of the Mashup service is enhanced.

Description

technical field [0001] The invention relates to the field of Mashup service data clustering in the Web environment, in particular to a clustering method based on Mashup service function feature representation and density peak detection. Background technique [0002] As one of the core technologies of the Web 2.0 era, Mashup technology can realize the integration of heterogeneous resources by combining WebAPI services with different functions. Once this convenient and efficient development technology came out, it has been favored by the majority of software developers, and many organizations have also released their own developed Mashup services and data resources to the Internet for users to call. However, with the continuous growth of mashup service resources on the Internet, how to help users quickly locate mashup services that meet their own needs has become an urgent problem to be solved. In addition, because most of the current Mashup services lack normative WSDL docum...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F16/35G06F40/289G06F40/30G06F40/247G06K9/62
CPCG06F16/3335G06F16/3344G06F16/35G06F18/23213
Inventor 陆佳炜吴涵赵伟马超治程振波徐俊肖刚
Owner ZHEJIANG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products