Relevant database for managing heterogeneous unstructured data and method for creating and inquiring description information of unstructured data thereof

A technology of unstructured data and heterogeneous data, applied in electrical digital data processing, special data processing applications, instruments, etc., can solve the problems of single protocol, unable to support multiple remote storage, unable to meet big data support, etc. Enhanced integrity and high scalability

Inactive Publication Date: 2013-04-10
TIANJIN NANKAI UNIV GENERAL DATA TECH
View PDF2 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] BLOB, TEXT and other fields are often used in existing relational databases, which cannot meet the support for big data. Both relational databases ORACLE and MS SQL Server have BLOB type fields stored outside the database. In ORACEL, it is BFILE, MS SQL Server It is FileStream, which are characterized by the file name of the data stored in the database, and the database reads the data stored on the disk through the file name
The disadvantage is that the integrity of the data and the consistency with other fields of the database must be guaranteed through external applications, and the database itself has no corresponding binding capabilities
At the same time, the protocol supported by external storage is single, unable to support a variety of remote storage, and adapt to various distributed storage protocols emerging in an endless stream

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Relevant database for managing heterogeneous unstructured data and method for creating and inquiring description information of unstructured data thereof
  • Relevant database for managing heterogeneous unstructured data and method for creating and inquiring description information of unstructured data thereof
  • Relevant database for managing heterogeneous unstructured data and method for creating and inquiring description information of unstructured data thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] Below in conjunction with an embodiment of the present invention, the present invention is further elaborated, and GBase8a is as a kind of database that supports big data, and data is stored in the outside of GBase8a, and its access agreement can be local file, also can be stored in Http server, Ftp server and data stored in other proprietary protocols.

[0030] A Universal Resource Identifier (URI) can locate multiple data types. The descriptive formatted text in GBase8a includes a URI string, which is a URI that can store external data through a simple formatted text.

[0031] The URI string is implemented by adding a URI identifier to the varchar type. Its data is multi-line text, and the lines are separated by a pair of carriage returns and line feeds, including:

[0032] The URI of the first line

[0033] URI=protocol name″:″Authentication information directory file name[″?″query parameter][″#″bookmark]

[0034] Only absolute URIs are supported, relative addresse...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a relevant database for managing heterogeneous unstructured data; and the database comprises formatted text which is used for describing the unstructured data which is stored outside the database. The formatted text comprises a uniform resources identifier (URI) character string which provides the access protocol and storage position of the data, a data validation attribute field and a data format field. Meanwhile, the invention also provides a creation method and an inquiry method for managing the relevant database for managing the external heterogeneous data. The relevant database for managing heterogeneous unstructured data and the method for creating and inquiring the description information of the unstructured data thereof have the beneficial effects that the external data management mechanism of the database has high extendability and can adapt to various access protocols of the external data, and meanwhile the completeness of the external data management of the database and the data independency of external data orientation are improved.

Description

technical field [0001] The invention belongs to the field of data storage, and in particular relates to a relational database for managing external heterogeneous data and methods for creating and querying. Background technique [0002] "Big data" (Big data), in short, is the ability to quickly obtain information from a variety of massive data, which is big data technology. Big data is usually used to describe a large number of unstructured data created by a company. and semi-structured data that would take too much time and money to download to a relational database for analysis. Big data analysis is often associated with cloud computing because real-time analysis of large data sets requires a framework like MapReduce to distribute work among tens, hundreds, or even thousands of computers. [0003] BLOB, TEXT and other fields are often used in existing relational databases, which cannot meet the support for big data. Both relational databases ORACLE and MS SQL Server have B...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 武新范振勇张学崔维力赵伟
Owner TIANJIN NANKAI UNIV GENERAL DATA TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products