Parallel metadata acquisition system

A technology for collecting system and metadata, applied in the field of network communication, it can solve the problems of single point failure, long collection period, heavy load, etc., achieve efficient information resource management and sharing, improve retrieval speed and accuracy, and improve accuracy and reliability. The effect of efficiency

Inactive Publication Date: 2013-07-17
BEIHANG UNIV
View PDF3 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the OAI framework, there are shortcomings such as performance bottlenecks, long acquisition periods, high performance requirements, single point failures, and heavy loads. In order to solve these shortcomings and improve the performance of the acquisition system, many research institutions and scholars have explored the use of emerging grid technology, trying to use multiple collection nodes to collect DPs in parallel to speed up the collection of metadata. For example, the Digital Library Research Group of OldDominion University uses grid technology to improve the performance of metadata collection and Dr. Zheng Zhiyun in China studies digital library under grid. interoperability framework, etc.
[0004] The existing metadata parallel collection framework has the following disadvantages: (1) The real-time monitoring problem of the collection scheduling server: in the process of resource scheduling for the collection nodes, although the existing cluster technology is used, the RSS algorithm can be used between the collection nodes. Achieve balanced distribution of collection tasks, but the dynamic changes of the collection nodes and DP nodes are not considered during the collection process, the dynamics of the grid are not reflected, the collection nodes are not monitored in real time, and the integrity of the collected data cannot be guaranteed
(2) Synchronous update problem: SP cannot actively and timely reflect the data update operation in DPs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Parallel metadata acquisition system
  • Parallel metadata acquisition system
  • Parallel metadata acquisition system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] Such as figure 2 As shown, the parallel collection system of the present invention is made up of 6 big modules: application module, metadata storage module, metadata processing module, collection module, collection scheduling module, digital resource processing module and registration server; provide digital books with shared metadata After the digital resource processing module, the library is converted into a metadata warehouse conforming to the OAI architecture, becoming a DP, and registering with the registration server relevant information that can be used by the collection and scheduling module; the collection and scheduling module assigns collection tasks to groups according to the static and dynamic information of the collection nodes , the acquisition module obtains the allocated DPs base address through the acquisition scheduling module to collect metadata, and transmits the metadata to the metadata processing module after the collection is completed; the meta...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a parallel metadata acquisition system, which is improved on the basis of an existing parallel metadata acquisition frame by combining network technology, mobile Agent technology and an OAI (Open Archives Initiative) frame model to improve the metadata acquisition rate and realize high-efficiency federated search service. By adopting a grouping strategy, the parallel metadata acquisition system realizes parallel acquisition of metadata outside and in groups, thereby improving the acquisition rate on the whole. Additionally, the metadata are stored in balanced classification, so that the system can perform parallel search when responding to a search request, and the searching speed and accuracy are improved.

Description

technical field [0001] The invention relates to a metadata parallel collection system, which belongs to the field of network communication and is used for peer-to-peer network search optimization and resource collection optimization. Background technique [0002] OAI provides an application-independent interoperability framework based on metadata collection, and has two main roles: data provider (DataProvider, DP) and service provider (ServiceProvider, SP). DP is the owner of metadata, and metadata is expressed by public metadata DC, which complies with OAI to publish metadata, and responds to collection requests in XML format. The SP is the data collector and the main body that provides value-added services to users by using the collected metadata. The working principle and process of OAI-based metadata collection are as follows: figure 1 Shown: [0003] In the OAI framework, SP collects and extracts metadata from each DP, processes and merges them and stores them centra...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 王丽华尹科王宝会陈浩王海泉于雷
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products