Metadata generation system based on multi-source data

A technology of multi-source data and metadata, applied in the field of data processing, can solve problems such as large amount of data and inconsistent data structure, and achieve the effect of extensive utilization value and improvement of extraction efficiency and accuracy.

Inactive Publication Date: 2021-12-07
BEIJING YUCHEN SHIMEI SCI & TECH
View PDF2 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, massive natural language texts have the characteristics of large data volume, inconsistent data structures of different data sources, and fast update.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Metadata generation system based on multi-source data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0013] In order to further explain the technical means and effects of the present invention to achieve the intended purpose of the invention, the following is a specific implementation of a metadata generation system based on multi-source data proposed in the present invention in conjunction with the accompanying drawings and preferred embodiments The method and its effect are described in detail below.

[0014] An embodiment of the present invention provides a metadata generation system based on multi-source data, such as figure 1 As shown, it includes an original database, a metadata database, a mapping table database, a processor and a memory storing a computer program, and the original database is used to store data from N data sources {P 1 , P 2 ,…P N} Get the raw data, P n is the nth data source, and the value of n ranges from 1 to N. It can be understood that the value of N is determined according to specific application requirements, and can be increased or decrease...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a metadata generation system based on multi-source data. The system comprises an original database, a metadata database, a mapping table database, a processor and a memory in which a computer program is stored, the original database is used for storing original data acquired from N data sources {P1, P2,... PN}, Pn is the nth data source, and the value range of n is 1-N; the meta-database is used for storing a metadata record, the metadata record comprises M metadata fields {D1, D2,... DM}, Dm is the name of the mth metadata field of metadata, and the value range of m is 1-M; and the mapping table database is used for storing a mapping table Rn corresponding to each data source Pn, and the Rn is used for storing a mapping relation between an original data field of the Pn and {D1, D2,... DM}. The multi-source data can be quickly and accurately converted into the metadata with the same data structure, and the information extraction efficiency and accuracy of the multi-source data are improved.

Description

technical field [0001] The present invention relates to the technical field of data processing, in particular to a system for generating metadata based on multi-source data. Background technique [0002] With the rapid popularization and development of the Internet, a large amount of data information is generated and disseminated in the network, how to timely and accurately extract target information from a large number of natural language texts from different data sources has become increasingly urgent. However, massive natural language texts have the characteristics of large data volume, inconsistent data structures of different data sources, and fast update. Before extracting target information from massive text data, if data from different data sources can be processed to generate metadata with a unified data structure, the efficiency of target information extraction will be greatly improved. It can be seen that how to quickly and accurately construct metadata of the sa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/84
CPCG06F16/86
Inventor 刘羽林方张正义左为
Owner BEIJING YUCHEN SHIMEI SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products