Unstructured data management method and device based on AI (Artificial Intelligence)

A technology of unstructured data and management methods, which is applied in the field of AI-based unstructured data management methods and devices, can solve the problems of inability to dig out the value of data, cannot be effectively used, and is difficult to manage, so as to improve query accuracy and expansibility, realize the value extraction of data features, and facilitate identification

Active Publication Date: 2018-07-10
BEIJING UNIV OF POSTS & TELECOMM +1
View PDF9 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, most of the existing unstructured processing methods and devices preset corresponding templates or rules for parsing and matching keywords in unstructured data, or directly store the metadata information of known data in In the index table, the potential important data information of other unstructured data is buried in the unstructured data, and the potential value of the data cannot be excavated. It is necessary to store the metadata information in advance, which undoubtedly increases the pressure on storage and Cost, due to the explosive growth of unstructured data such as images, sounds, and videos in enterprises, and unstructured data is not as easy to retrieve and utilize as structured data, making it difficult to manage and cannot be used effectively. solve

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unstructured data management method and device based on AI (Artificial Intelligence)
  • Unstructured data management method and device based on AI (Artificial Intelligence)
  • Unstructured data management method and device based on AI (Artificial Intelligence)

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary and are intended to explain the present invention and should not be construed as limiting the present invention.

[0029] Before introducing the AI-based unstructured data management method and device according to the embodiments of the present invention, a method in the related art will be briefly introduced.

[0030] Unstructured data not only has a large amount of data, but also grows very rapidly. However, in such a huge amount of data, only 10% of the data is structured data stored in the database, and the rest is composed of emails, videos, Weibo, documents, etc. , Page clicks, etc. generate a large amount of semi-structured data and un...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an unstructured data management method and device based on AI (Artificial Intelligence). The method comprises the following steps that: through HDFS (Hadoop Distributed File System), HBase and Nosql, storing a large scale of unstructured data objects; through an AI intelligence algorithm, extracting data features from the unstructured data objects, in addition, storing theextracted data features in an external table, and constructing an unstructured data model; and using an SQL (Structured Query Language) to retrieve the feature table of the unstructured data object, and realizing the management of the unstructured data, wherein the model is used for carrying out similarity search on the unstructured data. By use of the method, the query accuracy and the expansibility of the unstructured data can be effectively improved, the core data feature value extraction of the unstructured data is realized, the unstructured data can be conveniently identified, retrieved and used, and the diversity and the flexibility of value added service are fully embodied.

Description

technical field [0001] The present invention relates to the technical field of unstructured data, in particular to an AI (Artificial Intelligence, artificial intelligence)-based unstructured data management method and device. Background technique [0002] Semantic information of unstructured data includes format information, content information, etc. However, unstructured data has a huge amount and various formats, and the content information is difficult to completely extract and store, and the storage cost is extremely high. The main methods commonly used in related technologies are: through Preset parsing rules or feature templates, and then extract keywords to obtain data information and directly store metadata information of unstructured data in the index table. [0003] However, most of the existing unstructured processing methods and devices preset corresponding templates or rules for parsing and matching keywords in unstructured data, or directly store the metadata i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/334G06F16/3343G06F16/3344
Inventor 鄂海红宋美娜段云峰江裕锋
Owner BEIJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products