Data index establishment method and device and index retrieval method and device

A technology for data indexing and establishment methods, applied in database indexing, structured data retrieval, database query, etc., can solve problems such as difficulty in achieving high efficiency and inapplicability of indexing methods, and achieve high-efficiency batch deletion, enrichment of data types, The effect of large amounts of data

Inactive Publication Date: 2019-06-28
CHINA MOBILE GROUP JILIN BRANCH +1
View PDF4 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The embodiment of the present application provides a data index establishment method, index retrieval method and device to solve the problem that the existing index method cannot be applied to a massive data environment and it is difficult to achieve efficient batch deletion

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data index establishment method and device and index retrieval method and device
  • Data index establishment method and device and index retrieval method and device
  • Data index establishment method and device and index retrieval method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0057] refer to figure 1 , which is a flow chart of the method for establishing a data index provided in Embodiment 1 of the present application, including the following steps:

[0058] Step 101: extract the field value corresponding to the specified field in the data file to be processed.

[0059] Here, the data files to be processed may be raw data files collected from various business systems, such as user data information, log files, and the like. Moreover, there may be at least one data file to be processed.

[0060] In the specific implementation, since the format of the data files collected from each business system is not necessarily the same, in the embodiment of the present application, in order to adapt to data files of various formats, it is based on not affecting the original format of the data files Specifically, by analyzing the collected files to be processed, the field values ​​corresponding to the specified fields in each data file are extracted. Among the...

Embodiment 2

[0081] refer to figure 2 , which is a flow chart of the index retrieval method provided in Embodiment 2 of the present application, including the following steps:

[0082] Step 201: Receive a retrieval request carrying retrieval conditions sent by a terminal.

[0083] Wherein, the retrieval request carries one of the following retrieval conditions: a field value corresponding to the specified field, a field value range corresponding to the specified field, and a prefix retrieval condition.

[0084] Step 202: Determine the index files satisfying the retrieval condition.

[0085] Wherein, the index file includes the index file of the association relationship between the field value corresponding to the specified field in the data file and the data file information, and the data file information is the data file name and / or the storage location of the data file.

[0086] Step 203: In the index files satisfying the retrieval condition, search for the field value satisfying the ...

Embodiment 3

[0102] like Figure 4 As shown, it is a structural diagram of the data index establishment device provided by Embodiment 3 of the present application, including:

[0103] Extraction module 41, for extracting the field value corresponding to the specified field in the data file to be processed;

[0104] The generation module 42 is used to generate an index file containing the association relationship between the field value corresponding to the specified field in the data file and the data file information, wherein the data file information is a data file name and / or a data file storage location .

[0105] Optionally, the generating module 42 is also used for:

[0106] After the extracting module extracts the field value corresponding to the specified field in the data file to be processed, for each data file, an ordered table containing the specified field in the data file and the field value corresponding to the specified field is generated;

[0107] The generating module ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of data storage and management, in particular to a data index establishment method and device and an index retrieval method and device, and is used for solving the problems that an existing index mode cannot be suitable for a mass data environment and high-efficiency batch deletion is very difficult to achieve. The data index establishing method provided by the embodiment of the invention comprises the following steps: extracting a field value corresponding to an appointed field in a to-be-processed data file; And generating an index file containingan association relationship between a field value corresponding to a specified field in the data file and data file information, the data file information being a data file name and/or a data file storage position.

Description

technical field [0001] The present application relates to the technical field of data storage and management, and in particular to a method for establishing a data index, an index retrieval method and a device. Background technique [0002] With the development of informatization and the advent of the era of big data, the amount of data is growing explosively. In order to support the rapid retrieval of data in a massive data environment, the design of data index has become a crucial link. [0003] The establishment of existing data indexes mostly reflects the relationship between keywords and specific records. Users can input a certain keyword and query a specific record containing the keyword in the index. For example, a piece of information stored in the database is: "Zhang San eats lunch", then when building an index, you can set "lunch" as a keyword, and then you can enter "lunch" to find the specific record as "Zhang San eats lunch" . However, in a massive data enviro...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/22G06F16/24
Inventor 徐党生刘赫常剑飞辛术卞淑
Owner CHINA MOBILE GROUP JILIN BRANCH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products