Unlock instant, AI-driven research and patent intelligence for your innovation.

A method and system for unstructured data annotation management

A technology of unstructured data and management system, applied in other database retrieval, other database clustering/classification, etc., can solve problems affecting vertical business analysis results, incomplete analysis, lack of formatting, standardization, etc., to improve intelligence Analysis level, efficiency improvement, and the effect of improving utilization value

Inactive Publication Date: 2019-03-01
珠海市智图数研信息技术有限公司
View PDF6 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] The information crawled by a large number of crawlers is mainly various unformatted data, lacking formatting and standardization requirements. These data have many incomplete defects in the analysis of vertical business, which will directly affect the analysis results of vertical business

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and system for unstructured data annotation management
  • A method and system for unstructured data annotation management
  • A method and system for unstructured data annotation management

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 2

[0046] Such as Figure 1-3 An unstructured data labeling management method shown specifically includes the following steps:

[0047] Step 1: Operate the label management platform 1. When the unstructured data information is transmitted to the label management platform 1, the storage management module 3 performs storage modeling according to the original data, basic attributes, underlying features and semantic features of the unstructured data, so that The unstructured data is converted and stored in the label management platform 1, and can be applied to the functional modules inside the label management platform 1 for calculation and processing;

[0048] Step 2, the unstructured processing that enters the label management platform 1 is also processed by the business abstraction module 8, abstracting the business unstructured data, and formulating data standards that meet business requirements;

[0049] Step 3, the feature extraction module 2 extracts various specific informat...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A method and system for unstructured data annotation management are disclosed, comprising a label management platform, wherein the label management platform comprises a feature extraction module, a storage management module, a conversion loading module, a data label module, an access interface module and a query processing module; the connection end of the label management platform is provided with a service abstraction module and a manual processing module; the data label module comprises a label creation module, a label marking module and a label storage module. The feature extraction modulecomprises a text extraction module, an image extraction module, an audio extraction module and a video extraction module. The text extraction module is used for extracting a stop word from the text,and the TF-IDF Features and Keywords. By constructing a label management platform, the invention realizes the one-stop management of data label creation, conversion and storage describing service attributes by using a data label module, thereby improving the utilization value of big data and improving the intelligent analysis level of vertical service data.

Description

technical field [0001] The invention relates to the field of data management, in particular to a method and system for labeling and managing unstructured data. Background technique [0002] The information crawled by a large number of crawlers is mainly a variety of unformatted data, lacking formatting and standardization requirements. These data have many incomplete defects in the analysis of vertical business, which will directly affect the analysis results of vertical business. [0003] Therefore, it is necessary to invent a method and system for unstructured data labeling management to solve the above problems. Contents of the invention [0004] The purpose of the present invention is to provide a method and system for unstructured data labeling management. By building a label management platform and using the data label module to realize the "one-stop" management of creating, converting and storing data labels describing business attributes, and improving large Data ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/906
Inventor 邓炽成
Owner 珠海市智图数研信息技术有限公司