A method, device and system for storing and querying unstructured data

A technology of unstructured data and data packets, applied in the computer field, can solve problems such as low efficiency, reduced data information storage and query efficiency, and low storage efficiency of relational databases

Active Publication Date: 2018-04-27
BANK OF CHINA
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, when using the above method to store and query the electronic image receipt information of business operations, with the continuous increase of business operations, the correspondence between the business metadata information stored in the relational database and the position information of the electronic image receipt in the disk becomes more and more More, the storage efficiency in the relational database will be lower and lower; moreover, the more data stored in the relational database, the lower the efficiency of querying the electronic image bill information from the disk according to the location information
Using a relational database to realize the storage and query of electronic image bills for business operations, when a large amount of data information is stored in the relational database, the efficiency of data storage and query will be reduced

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method, device and system for storing and querying unstructured data
  • A method, device and system for storing and querying unstructured data
  • A method, device and system for storing and querying unstructured data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0061] figure 1 It is a flow chart of Embodiment 1 of a method for storing unstructured data in the present invention, and the method includes:

[0062] Step 101: The data integration module compresses the business metadata information of the business operation and the electronic image receipt provided by the business system to obtain a compressed data package.

[0063] The data integration module compresses the business metadata information and unstructured electronic image receipts of the same business operation through XML technology and compression tools to obtain a compressed data package. The compressed data package includes: business metadata information and its contents The corresponding unstructured electronic image ticket.

[0064] The business metadata information of the same business operation and the electronic image ticket are compressed in one compressed data package, without separating the two kinds of data information. After querying the compressed data pack...

Embodiment 2

[0074] figure 2 It is a flow chart of Embodiment 2 of a method for unstructured data query in the present invention, and the method includes:

[0075] Step 201: the inquiry module receives the inquiry information input by the user, and the inquiry information includes electronic image note ID or business transaction information.

[0076] The present invention builds a web query module based on the BFW framework, which is used to input query information into the query module. The query module will construct a query request based on the query information input by the user and send it to the Master in the distributed system. Wherein, the query information input by the user includes electronic image bill ID or business transaction information, which is the same as the keyword stored in the data node in the compressed data package.

[0077] Step 202: The query module sends a query request to the Master in the distributed system, and the query request carries the query informatio...

Embodiment 3

[0089] image 3 It is a schematic structural diagram of Embodiment 3 of an unstructured data storage device of the present invention, and the device includes:

[0090] The data integration module 301 is connected to the Master 302 in the distributed system, and the Master 302 in the distributed system is connected to multiple data nodes A1-An.

[0091] It should be noted here that the data integration module is configured to send the compressed data package to the Master of the distributed database according to the HDFS protocol of the distributed file system.

[0092] The data integration module 301 is configured to compress the business metadata information of the business operation provided by the business system and the electronic image receipt to obtain a compressed data package, and send the compressed data package to the Master of the distributed system.

[0093] The Master 302 of the distributed system is used to store the compressed data package in any data node dist...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides methods, devices and a system for storing and inquiring unstructured data. When the unstructured data are stored, the Master of a distributed system receives service metadata information and electronic note images and compresses the received information and images into a compressed data package; the Master stores the compressed data package in any one of data nodes distributed in the Master, and therefore, the storage of the unstructured data is realized by use of the distributed system, and as the storage space of each data node is limited, the decline of the storage efficiency is avoided; for inquiry of the unstructured data, the Master in the distributed system receives an inquiry request sent by an inquiry module and then obtains the compressed data package matched with inquiry information from all the data nodes of the Master according to the inquiry information carried in the inquiry request, and therefore, the inquiry of the unstructured data is realized by use of the distributed system, and as the data information stored in each data node in the Master is limited and parallel inquiry is performed on all the data nodes, the decline of the inquiry efficiency is avoided.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a method, device and system for storing and querying unstructured data. Background technique [0002] The bank implements the working mode of separating front-end business and back-end business processing. For a business operation, the front-end generates the physical business bills and business metadata information of this business operation, and scans the physical business bills to obtain electronic image bills, and converts the electronic image The bills are stored in the disk, and the business metadata information and the location information of the electronic image bills in the disk are correspondingly stored in the relational database. [0003] During the verification process of the above-mentioned business operations in the background, first query the location information of the electronic image ticket in the disk from the relational database according to the business me...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/3331
Inventor 何方敏
Owner BANK OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products