Supercharge Your Innovation With Domain-Expert AI Agents!

Method and system for extracting data in document type database into relational database

A data extraction and database technology, applied in the database field, can solve the problems of inconsistent document domain sets, data extraction errors, and inability to batch extraction, and achieve the effect of ensuring data accuracy, accuracy, and extraction accuracy and efficiency.

Active Publication Date: 2020-01-24
WUHAN DAMENG DATABASE
View PDF8 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Aiming at the above defects or improvement needs of the prior art, the present invention solves the inability to extract data in batches caused by inconsistencies in the domain sets of documents when the document-type database extracts data from the relational database, and data extraction errors caused by factors such as repeated domain names in the document.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for extracting data in document type database into relational database
  • Method and system for extracting data in document type database into relational database
  • Method and system for extracting data in document type database into relational database

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0040] For the management of large amounts of data, databases are usually used for management. Currently, commonly used databases include document databases and relational databases. Document-type databases are established based on the idea of ​​shared documents, which can easily access data through the file system, but cannot conveniently use SQL language to operate data. Therefore, in some application scenarios, if the main database of the system is a relational database but the database of the auxiliary system is a document database, it is necessary to extract the data in the document database of the auxiliary system into the relational database as the main database for integration.

[0041] In a document-type database, the basic unit of storage is a document, and data of different attributes in a document are stored in different domains. In a relational database, the basic unit of storage is a table, and the data of different attributes in the table are stored in different...

Embodiment 2

[0062] In the process of extracting data from the document database, on the basis of the data extraction method in Embodiment 1, each step needs to be adjusted according to different situations.

[0063] In some specific application scenarios, there are multiple domains with the same name in the same document, and the data stored in these domains with the same name are different, but because the domain names are the same, they all need to be placed in the same field when extracting data, so they need to be connected. Ensure that all data is extracted into relational database tables, such as Figure 4 As shown, the extraction connection steps are as follows:

[0064] Step 201: Extract data in the domain with the same name.

[0065] Step 202: Perform data type conversion.

[0066] Step 203: Connect the converted data using a preset connector.

[0067] Step 204: Insert the connected data into corresponding fields of corresponding rows.

[0068] Specifically, the preset connec...

Embodiment 3

[0099] On the basis of the method for extracting data from a document-type database to a relational database provided in Embodiment 1 and Embodiment 2 above, the present invention also provides a system for extracting data from a document-type database that can be used to implement the above method, such as Figure 8 Shown is a schematic diagram of the system architecture of the embodiment of the present invention.

[0100] Such as Figure 8 As shown in A, the data extraction system in the document database of this embodiment includes at least one server 1 and at least one client 2, a relational database can be deployed in the server 1, a document database can be deployed in the client 2, the server 1 and the client 2 can exchange database data and files. The data in the document database deployed on client 2 is transmitted to the relational database deployed on server 1 for storage through data exchange; the attachments in the document database deployed on client 2 are trans...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the field of databases, in particular to a method and a system for extracting data in a document type database to a relational database, the method for extracting the data inthe document type database comprises the following steps: creating a view in the document type database, wherein the view comprises all documents needing to be subjected to data conversion; obtainingall documents in the view, and obtaining a document ID of each document; obtaining a union set of domain names appearing in all documents in the view; creating a table corresponding to the view in a relational database, wherein each domain name in the set is a field name in the table; and converting the first data of each domain in each document needing data conversion into second data of a presetdata type of the corresponding domain, and inserting the second data into a corresponding position of the table according to the globally unique ID and domain name of the document. According to the method and the system, the problem that different document domain sets are different during data extraction in a document type database is solved, and the method and the system for correctly and quickly extracting the data in batches are provided.

Description

【Technical field】 [0001] The invention relates to the field of databases, in particular to a method and system for extracting data from a document database to a relational database. 【Background technique】 [0002] Currently, there are two types of databases commonly used: document databases and relational databases. Document-type databases use documents as the basic storage unit to store data, and data of different natures are stored in different domains. Relational databases use tables as the basic storage unit to store data, and data of different natures are stored in different fields. The two databases have different organization methods for data and files, and the data access methods are also different. [0003] In order to solve the problem of data exchange between document databases and relational databases, to convert the data stored in document form in document databases to table form storage in relational databases, it is necessary to extract the data in document ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/25G06F16/28
CPCG06F16/254G06F16/284
Inventor 梅纲付铨胡高坤周淳
Owner WUHAN DAMENG DATABASE
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More