Data query method and device, index generation method and device and electronic equipment

A data query and index technology, applied in the computer field, can solve the problems of index data expansion, inability to meet usage requirements, and occupying large storage space.

Pending Publication Date: 2021-07-06
ALIBABA GRP HLDG LTD
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this solution has the problem of index data expansion, space-time query needs to build multiple indexes, and the index will occupy a large storage space as the data grows.
Moreover, due to the defects of the RTree spatial index itself, when the data reaches a certain level, the RTree spatial index needs to deal with leaf node splitting and rebalancing, resulting in a large bottleneck in data writing.
[0006] To sum up, the storage cost and write performance of the traditional spatial index in the massive data scenario have many shortcomings, which cannot meet the usage requirements, resulting in great restrictions on the storage and query of high-dimensional spatial data and / or spatio-temporal data, which cannot Meet the performance needs of computing and retrieval

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data query method and device, index generation method and device and electronic equipment
  • Data query method and device, index generation method and device and electronic equipment
  • Data query method and device, index generation method and device and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0032] refer to figure 1 , shows a flow chart of steps of a data query method according to Embodiment 1 of the present invention.

[0033] The data query method of the present embodiment includes the following steps:

[0034] Step S102: Obtain information to be queried for querying multidimensional data.

[0035] In this embodiment, the data query method is described by taking multidimensional data including spatial data and / or spatiotemporal data as an example. Of course, multidimensional data may also include document word frequency data, user rating data, and the like.

[0036] For example, the data query method can be applied to TSDB (spatio-temporal database). Multidimensional data can be used to store spatial coordinate data of spatial objects, and can also be used to store other attribute data of spatial objects. Spatial objects can be points, lines or surfaces in space, etc. For example, objects such as roads can be indicated by lines in space. Objects such as ve...

Embodiment 2

[0060] refer to image 3 , shows a flowchart of steps of a data query method according to Embodiment 2 of the present invention.

[0061] The data query method of this embodiment includes the aforementioned steps S102 to S108.

[0062] In this embodiment, the multidimensional data index is generated according to the node information of the multidimensional data index tree, and the node information of the multidimensional data index tree is generated according to the second one-dimensional code corresponding to the second multidimensional data; wherein, the node information includes Leaf node information and non-leaf node information, the leaf node information is generated according to the second one-dimensional code stored in the leaf nodes in the multidimensional data index tree, and the leaf nodes in the multidimensional data index tree are from the breadth level according to The adjacent sequence of the second one-dimensional code stores the second one-dimensional code in ...

Embodiment 3

[0075] refer to Figure 4 , shows a flowchart of steps of an index generation method according to Embodiment 3 of the present invention.

[0076] The index generation method of this embodiment includes the following steps:

[0077] Step S302: Obtain multiple second one-dimensional codes corresponding to multiple partitions of the second multidimensional data.

[0078] The second multidimensional data may be spatio-temporal data and / or spatial data. The second multidimensional data can be used to store spatial coordinate data of the spatial object. Spatial objects can be points, lines or surfaces in space, etc. For example, objects such as roads can be indicated by lines in space. Objects such as vehicles and people can be indicated by points in space, and the space coordinate data of vehicles at different times can be recorded through multi-dimensional data to determine their running trajectories and the relative positional relationship between space objects.

[0079] The...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a data query method and device, an index generation method and device and electronic equipment. The data query method comprises the steps that to-be-queried information used for querying multi-dimensional data is obtained, and the to-be-queried information comprises information of first multi-dimensional data to be queried; a first one-dimensional code corresponding to the first multi-dimensional data and a preset bitmap index storing a multi-dimensional data index are obtained, and the multi-dimensional data index is generated according to a second one-dimensional code corresponding to second multi-dimensional data; collecting a bitmap block file corresponding to the first one-dimensional code according to the bitmap index; According to the obtained bitmap block file, obtaining first multi-dimensional data queried by the to-be-queried information. Through the embodiment of the invention, the retrieval efficiency is higher.

Description

technical field [0001] The embodiments of the present invention relate to the field of computer technology, and in particular to a data query method, an index generation method, a device, electronic equipment, and a computer storage medium. Background technique [0002] Multidimensional data, also known as multidimensional data, usually refers to data of two dimensions or more, for example, spatial data and / or spatiotemporal data. In the prior art, the retrieval of multidimensional data is generally performed through a spatial index established based on the multidimensional data. [0003] For example, a traditional database in the prior art (such as ESRI, etc.), which constructs a spatial index (such as RTree or QuardTree, etc.) at the storage layer, and when constructing, indexes of time dimension data and space dimension data are constructed separately of. That is, create a one-dimensional index such as BTree for the time field, and create a spatial index such as RTree f...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/245G06F16/22G06F16/28
CPCG06F16/2228G06F16/245G06F16/283
Inventor 刘志鹏
Owner ALIBABA GRP HLDG LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products