Massive data aggregation method and system based on cloud computing platform

A cloud computing platform and massive data technology, applied in the field of massive data aggregation methods and systems, can solve the problem that data classification cannot meet practical applications, and achieve the effect of efficient clustering

Active Publication Date: 2011-04-13
CHINA TELECOM CORP LTD
View PDF3 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, the aggregation and classification of data is more focused on the comparison of keywords. The metho...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Massive data aggregation method and system based on cloud computing platform
  • Massive data aggregation method and system based on cloud computing platform
  • Massive data aggregation method and system based on cloud computing platform

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] The present invention will be described more fully hereinafter with reference to the accompanying drawings, in which exemplary embodiments of the invention are illustrated.

[0048] figure 1 A flowchart showing an embodiment of the cloud computing platform-based massive data aggregation method of the present invention.

[0049] Such as figure 1 As shown, in step 102, keywords of the network application are extracted from the data of the network application. Based on the network application, sort out the keyword information in the application database to obtain the keywords of the network application.

[0050] In step 104, the semantic similarity between the keyword of the network application and the ontology in the ontology database is calculated, and the similar ontology of the network application in the ontology database is determined. Semantic similarity can be obtained through semantic distance calculation. There are many algorithms for semantic distance calculat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a massive data aggregation method and system based on a cloud computing platform, and the method comprises the following steps: extracting key words of a network application from data of the network application; computing the semantic similarity between the key words of the network application and an ontology in an ontology base, and determining the similar ontology of thenetwork application in the ontology base; marking the data of the network application which is similar to the ontology in the ontology base through RDF (resource description framework) description; and storing the data of the network application in network resource storage nodes under the similar ontology of the ontology base. The invention provides the method for aggregating massive data of SAAS(software-as-a-service) applications, Internet applications and other network applications, semantic information is adopted for carrying out cluster analysis on the extracted data, and the data processing is more accurate and reliable.

Description

technical field [0001] The present invention relates to data processing technology, in particular to a massive data aggregation method and system based on a cloud computing platform. Background technique [0002] Network applications such as SaaS (Software-as-a-service, software as a service) applications and Internet applications accumulate a large amount of hosted heterogeneous data, and the mining and utilization of this information will become a new application growth point. How to cluster and organize the massive data of network applications is a necessary work before data mining. [0003] At present, the aggregation and classification of data is more focused on the comparison of keywords, the method is relatively simple, and the data classification of network applications cannot meet the needs of practical applications. Contents of the invention [0004] A technical problem to be solved by the present invention is to provide a data aggregation method to realize effi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 顾茜赵鹏杨明川广小明谭国权
Owner CHINA TELECOM CORP LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products