Method and system for populating a database with bibliographic data from multiple sources

A database and data source technology, applied in the field of database management systems, can solve the problem of integrating multi-source data into a combined database or data stack, etc.

Inactive Publication Date: 2010-03-24
SEMICON INSIGHTS
View PDF8 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

While these approaches allow for a more comprehensive research strategy with multi-source data, these methods do not address the problem of integrating such multi-source data into combined databases or data stacks

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for populating a database with bibliographic data from multiple sources
  • Method and system for populating a database with bibliographic data from multiple sources
  • Method and system for populating a database with bibliographic data from multiple sources

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.

[0022] figure 1 A schematic diagram of a known system 100 for storing data from various data sources into a database is provided. In this example there are four different data sources 102, which typically provide data in different source-specific formats. The access data 104 from each data source is interpreted using a source specific interpreter 114 to reference the database's existing data storage database. Stored database elements (eg, existing data) may be stored in data store 112 . It is very inefficient to attempt to directly normalize or interpret data in different formats accessed from different data sources by, for example, finding matches or associations of elements within a database (e.g., existing data) or within the same source file . Furthermore, such a system would ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

There is disclosed a method of populating a relational database of bibliographic data associated with one or more document-based collections, wherein the bibliographic data is sourced from two or moresources having distinct source-specific formats. The method generally comprises the steps of accessing source data from the two or more sources; independently standardizing the accessed data from each of the two or more sources in accordance with a common intermediate source-independent format dictated by an intermediate data structure, such that similar data elements from distinct source-specific formats are commonly identified within the intermediate format; and further interpreting the standardized data in relation to stored database elements comprising at least some database elements derived from each of the two or more sources, for populating the database in accordance with the relation with at least some repetitive elements replaced with reference thereto, consistent with a refineddatabase data structure distinct from the intermediate data structure. A system and computer-readable medium for implementing the above method are also disclosed.

Description

technical field [0001] This invention relates to database management systems and, more particularly, to methods and systems for storing bibliographic data from multiple sources in a database. Background technique [0002] Depending on the context of use, there are several ways to store relational data in the database. The data can be entered one piece at a time through the user interface, or it can be collected in an automated fashion from some other data source. In many systems, several data sources are stored in the database, each data source is interpreted in its own way, and the data is then correlated and added to other data already in the database. For example, a source data file in a particular source format can be obtained and directly converted to a format suitable for a database, based on, for example, a predetermined source-to-database conversion. That is, if a particular source format or schema (i.e., source data structure) is known, appropriate transformations...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F17/30595G06F16/284
Inventor 杰森·怀特阿萨德·阿巴斯
Owner SEMICON INSIGHTS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products