Tool and method applied to massive labeled entity data storage

A technology of entity data and tag data, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as failure to pass storage models, tag values ​​are not published, etc., achieve high-performance concurrent read and write operations, and realize Concurrent read and write operations, strong targeted effect

Active Publication Date: 2018-11-23
BEIJING SCISTOR TECH +1
View PDF9 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In actual business scenarios, entity tags do not simply store a tag value, and there are many storage issues related to whether they can support upper-level business, such as: how to flexibly expand the tag system according to the needs of business development? How can I save the attached property of the tag value at the same time? How to set the lifetime of tag value? How to make the tag value have the attribute of confirmation state, so that the unconfirmed tag value will not be published? How to support custom expansion of the tag value dimension? How to store historical versions to support version backtracking? And how to realize the rapid import of massive offline data? Obviously, these problems cannot be solved by traditional storage models, or by simple relational databases.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Tool and method applied to massive labeled entity data storage
  • Tool and method applied to massive labeled entity data storage
  • Tool and method applied to massive labeled entity data storage

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] The present invention will be further described in detail below in conjunction with the accompanying drawings.

[0046] The present invention proposes a tool and method for storing massive tagged entity data. In order to ensure the efficiency and scalability of tag data storage and management, the following technical means are adopted: 1) using a distributed storage scheme to ensure Under massive data, the storage has a strong horizontal expansion; 2), using the distributed computing framework to realize the fast access to the existing offline massive data; 3), using the full-text search technology, indexing the tag data, and supporting the full-text search and rich search methods. Therefore, it has multiple versions of data, data with additional attributes, column-oriented, sparsity, scalability and high performance.

[0047] Such as figure 1 and figure 2 As shown, it is distributed in Internet services, specifically including tag metadata module, entity tag data m...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a tool and a method applied to massive labeled entity data storage, and belongs to the fields of massive data storage and label data storage. The tool includes a label metadatamodule, an entity label data module and a unified-access API module. A user inputs a username, a password and a request; the unified-access API module accesses the label metadata module according tothe username and the password, and reads metadata of a label; the metadata are packaged according to the user request, are converted into a data format of a data layer, and are transmitted to a corresponding interface of the data layer to execute an operation on label data; the entity label data module executes a corresponding operation according to a request issued by the unified-access API module, and carries out persistence processing on the data; and at the same time, the entity label data module returns a processing result to the unified-access API module, the unified-access API module packages the data, and returns the same to a tool interface according to a specified format. The tool has high pertinency, scalability and persistence, and supports higher-level service needs.

Description

technical field [0001] The invention relates to a tool and a method applied to massive tagged entity data storage, belonging to the fields of massive data storage and tagged data storage. Background technique [0002] In recent years, the domestic Internet business has continued to develop, and mobile Internet technology has continued to mature. However, with the development of business, a large amount of data has accumulated, and the problem of data dispersion has become more and more serious, resulting in a serious weakening of the value of data. Similar to target management, Systems or applications such as automatic intelligent recommendation require complete, highly integrated, accurate, and time-sensitive data as the basis, which makes the problem of how to extract and store high-value entity data more urgent. In this context, applications such as labeling systems and portrait systems have received more and more attention and research. [0003] In actual business scena...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 孙波姚珊姜栋张建松高昕董建武王梦禹胡晓旭刘云昊梁维谢铭王峰汪军强
Owner BEIJING SCISTOR TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products