Method for big data retrieval based on time characteristics and supporting complicated conditions

A time-featured and complex technology, applied in the field of big data retrieval, can solve the problems of reduced retrieval performance and increased cost, and achieve the effect of maintaining high efficiency, improving efficiency, and satisfying retrieval needs.

Inactive Publication Date: 2015-11-25
LINEWELL SOFTWARE
View PDF5 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, there are two main deficiencies in big data retrieval: first, in the big data environment, in order to provide the best query performance, a big data all-in-one machine integrating software and hardware is generally used, which significantly increases the cost of the project; second, in the big data environment, After the amount of data increases and the cluster expands, the performance of retrieval tends to decrease significantly

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for big data retrieval based on time characteristics and supporting complicated conditions
  • Method for big data retrieval based on time characteristics and supporting complicated conditions

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] At present, in big data applications, the data information is stored with time characteristics, that is, time stamps. The present invention divides the data information according to the time characteristics according to the timestamp generated when the data is saved or according to the user-defined time characteristics, and builds indexes with different cluster Collections on the full-text search engine Solr, for example, according to the year and month, in the When the user already knows the generation time of the queried information, he can quickly locate the index library of the current month's cluster Collection for retrieval, which can greatly reduce the scope of retrieval and improve efficiency; when the user cannot determine the generation time of the information Under certain circumstances, the results that meet the user's query are arranged in reverse order according to the time when the information was generated, as the user's default sorting method, and the so...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method for big data retrieval based on time characteristics and supporting complicated conditions. According to timestamps generated during data storage or according to user defined time characteristics, data information is segmented according to the time characteristics, indexes are established on different collections on a full-text retrieval engine Solr; on the conditions that users already know the time of queried information generation, quick positioning can be performed on an index database of a collection of the same month to perform retrieval, the retrieval range can be greatly reduced, and the efficiency is improved. On the conditions that users cannot determine information generation time, results conforming to user query are provided for users for reference and selection. According to the method, not only is the shortcoming that a distributed database HBase has no secondary index overcome, but also the index establishment is more flexible, the retrieval on different demand conditions is met, and the retrieval efficiency can be guaranteed.

Description

technical background [0001] The invention relates to a large data retrieval method that supports complex conditions based on time features. Background technique [0002] In the environment of big data, it is the foundation and an important part of big data applications to quickly and accurately retrieve the information that users care about and are interested in according to the conditions provided by users. At present, there are two main deficiencies in big data retrieval: first, in the big data environment, in order to provide the best query performance, a big data all-in-one machine integrating software and hardware is generally used, which significantly increases the cost of the project; second, in the big data environment, After the amount of data increases and the cluster expands, the performance of retrieval tends to decrease significantly. Contents of the invention [0003] The purpose of the present invention is to provide a large data retrieval method based on...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/2477G06F16/27
Inventor 陈光淙周华游建友
Owner LINEWELL SOFTWARE
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products