Unstructured-data description method and device

A technology of unstructured data and structured data, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as unsatisfactory data model, difficulty in retrieval result accuracy meeting user expectations, and data retrieval speed constraints

Inactive Publication Date: 2013-09-18
BEIJING UNIV OF POSTS & TELECOMM
View PDF2 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For example, the current retrieval of unstructured data is mostly keyword retrieval for the title or content of the data, and the accuracy of the retrieval results is difficult to meet user expectations; at the same time, with the increase in the amount of data, the speed of data retrieval is greatly restricted
Especially in terms of data models, the existing models cannot meet the needs of complex retrieval for data models, and the description of data is limited to the basic properties of the data files themselves

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unstructured-data description method and device
  • Unstructured-data description method and device
  • Unstructured-data description method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary and are intended to explain the present invention and should not be construed as limiting the present invention.

[0027] The present invention aims at analyzing the behavioral characteristics of the data operation subject based on the research on the existing unstructured data model, aiming at the user's demand for complex retrieval, and at the same time considering the external factors such as the background of data generation and the field to which it belongs, and proposes an unstructured data model. How to describe the data.

[0028] Such as figure 1 As shown, the method for describing unstructured data in the embodiment of the present i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an unstructured-data description method and device, wherein the unstructured-data description method comprises the following steps: collecting and importing the attribute information of unstructured-data files in a manual or automatic manner; generating JSON (Java Script Object Notation) files describing the attributes of the unstructured-data files according to the collected attribute information, and building a data model; saving the unstructured-data files and the JSON files corresponding to the unstructured-data files. The unstructured-data description method and device have the advantages of comprehensiveness and high efficiency.

Description

technical field [0001] The invention belongs to the technical field of database and retrieval, and in particular relates to a description method and device for unstructured data. Background technique [0002] With the advent of the era of big data, unstructured data accounts for an increasing proportion of data. The IDC (Internet Data Center) report shows that by 2012, unstructured data will account for more than 75% of the entire Internet data volume, of which 50% to 75% are generated centered on people. Unstructured data not only covers a variety of data types, including documents, pictures, HTML, images, and audio / video, but also has obvious user characteristics, diverse storage media, and diverse applications. Existing unstructured data is usually distributed and stored in enterprise servers or personal computers, and data owners manage it through file systems or data management systems. However, in the big data environment, the unstructured characteristics of data put...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 鄂海红韩晶宋美娜郑聪许可毕建鹏宋俊德黎燕于艳华
Owner BEIJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products