Semantic net based large scale offline data analysis framework

A data analysis and semantic web technology, applied in database update, electronic digital data processing, structured data retrieval, etc., can solve problems such as difficulty in information exchange, inability to reuse, and data is not standardized for storage and processing, so as to improve organizational capabilities. Effect

Inactive Publication Date: 2017-04-19
TONGJI UNIV
View PDF5 Cites 39 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the existence of various data acquisition devices in the same field, the original data is stored in different physical addresses in space. At the same time, the same type of data is not stored in a standardized manner, which brings great difficulties to data retrieval and calculation. ;
[0007] (3) Data traceability is difficult
Data collection and integration processes are diverse, and different collection departments only focus on their own data needs. Therefore, data collected by different departments in the same field has strong semantic ambiguity, which makes information exchange difficult and cannot be reused.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Semantic net based large scale offline data analysis framework
  • Semantic net based large scale offline data analysis framework
  • Semantic net based large scale offline data analysis framework

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0027] Centering on the data storage layer, the analysis framework is divided into upper and lower parts. The lower layer is the data storage and integration part, which is mainly responsible for ontology construction, RDF generation, and big data storage. The upper layer is the application implementation part, which is mainly for application requests, big data Calculation, semantic query, and result display. The upper part contains a total of 18 main processes, and the lower part contains a total of 9 main processes. The serial numbers in the figure represent the order in which actions occur.

[0028] In this embodiment, the Hadoop platform is taken as an example. Data storage is divided into RDF data storage and source data storage. RDF data is stored in the Hbase database by instance data, unstructured data, and semi-structured data generated by ontology. The form of a large table is stored in the Hive database. The call flow chart of the data analysis module based on the Se...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a semantic net based large scale offline data analysis framework. The large scale offline data analysis framework includes a data acquisition layer, a body layer, a data storage layer, a semantic layer, a data analysis layer and an application layer. A data source includes dynamic data and static data, and the static data includes data and database internal logic semantic and structure type. The static data is established into a body model in the analysis framework; the static data is extracted and modeled, and then the static data orients a user or an upper analysis task in a semantic service manner. The large scale offline data analysis framework can effectively improve the ability to organizing multi-source heterogeneous offline data and has a uniform interface to upper data; and application users or data analysis workers can access a lower data source through a semantic interface without knowing all the information of different data sources, and relevant data information is acquired. The large scale offline data analysis framework can effectively update the whole data source from a global perspective by correction of the body structure having changed content and update and inference service built in an application tool.

Description

technical field [0001] The invention relates to a large-scale offline data analysis framework based on semantic web. Background technique [0002] With the maturity of the new generation of information technology, the concepts of the Internet of Things, mobile Internet, and cloud computing are gradually accepted by individuals and enterprises. A large amount of data and information is growing at the PB level every day. How to process, analyze and obtain big data Valuable knowledge has become the ultimate focus of research by major institutions and companies. Big data is an important resource for enterprises to reconstruct the value chain, tap potential economic growth points, and drive independent innovation in the new era. Big data research also involves various fields, such as transportation, medical care, finance, Internet, public management, industry, service industry and scientific research in universities. [0003] Similarly, the processing and analysis of big data r...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/23
Inventor 王坚凌卫青程进
Owner TONGJI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products