Indexing and retrieval system based on HBase-ORM (Object Relational Mapping)

A retrieval system and index technology are applied in the field of HBase-ORM-based indexing and retrieval systems to achieve the effects of improving efficiency and reliability, simplifying data insertion, and improving efficiency

Active Publication Date: 2016-12-07
THE 28TH RES INST OF CHINA ELECTRONICS TECH GROUP CORP
View PDF6 Cites 27 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Purpose of the invention: the technical problem to be solved by the present invention is the fast query problem of massive complex data in HBase

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Indexing and retrieval system based on HBase-ORM (Object Relational Mapping)
  • Indexing and retrieval system based on HBase-ORM (Object Relational Mapping)
  • Indexing and retrieval system based on HBase-ORM (Object Relational Mapping)

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] At this stage, for NoSQL databases, due to the diversity of database types, there is currently no unified ORM framework. Phoenix is ​​an open source HBase query engine. It obtains data by converting SQL queries into one or more HBase Scans, directly uses HBase API, uses Coprocessor and Filter for data query, and tries to avoid MapReduce to reduce delay. However, due to the limited capabilities of Coprocessor and Filter, abandoning MapReduce completely limits performance in the face of complex queries and queries with large amounts of data. SimpleHBase is a lightweight HBase middleware that supports data type mapping and operation conversion. It supports SQL-like queries by encapsulating HBase Filter. Similar to Phoenix, it is also limited by the processing capabilities of Filter. The present invention implements an HBase ORM framework by using Java annotation and reflection technology, and uses Elasticsearch to index data in HBase, and supports near-real-time massive an...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an indexing and retrieval system based on HBase-ORM (Object Relational Mapping). Insertion, reading and modification of data are finished by automatically establishing a mapping relation between an underlying database table and an upper database object; a database layer is separated from a data access layer, so that upper developers can pay more attention to upper service logical processing, and the development efficiency is improved; meanwhile, the error rate is reduced; an index of each line of data in HBase is established according to different types by using Elastic search, and fuzzy query of texts, interval query of values, range query of longitude and latitude and similarity query of images are realized, so that real-time query demands of Web users on different data types are met.

Description

technical field [0001] The invention belongs to the field of big data storage and retrieval, and in particular relates to an index and retrieval system based on HBase-ORM. Background technique [0002] The development of database technology has produced two types of SQL databases and NoSQL (collectively referred to as all non-relational databases) databases. There are currently many NoSQL databases, most of which are open source, among which the well-known ones are: MemcacheDB, Redis, Tokyo Cabinet, Flare, MongoDB, CouchDB, Cassandra, Voldemort, etc. Non-relational databases do not support relational models, do not support connection operations, are easy to use and deploy, and have achieved certain results in realizing distributed storage of massive data and fast reading and writing technologies. [0003] When accessing and operating a relational database, an ORM framework is usually used to be responsible for data persistence. For example, Hibernate is the most popular OR...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/901G06F16/90335
Inventor 郭乔进梁中岩周鹏飞刘旸祁骏刘耀强刘玉龙胡杰陈文明孟剑萍
Owner THE 28TH RES INST OF CHINA ELECTRONICS TECH GROUP CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products