WEB data bank selection method based on WDB (World Data Bank) characteristics and user query requests

A query request, database technology, applied in the research field, can solve problems such as information redundancy

Inactive Publication Date: 2010-08-25
林培光
View PDF0 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, to obtain the characteristics of a Web database, it is necessary to extract certain data samples based on the real data of the database. There are a large number of data sources on

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • WEB data bank selection method based on WDB (World Data Bank) characteristics and user query requests
  • WEB data bank selection method based on WDB (World Data Bank) characteristics and user query requests
  • WEB data bank selection method based on WDB (World Data Bank) characteristics and user query requests

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

In response to the above described process, actual testing and verification of the existing network are carried out. The specific steps are as follows:

1. Data preparation

Using Watir tools to crawl national recruitment websites Zhilian Recruitment (www.zhilian.com), 51job.com, and local recruitment websites Dazhong Talent Network (www.51job.com) from the Internet according to industry attributes (classification attributes) using Watir tools. www.dazhonghr.com), Qilu Talent Network (www.qlrc.com) and other 4 websites contain more than 5,000 sample data (collected in December 2009) of job information (position name, number of recruits, work area), As the test data for method verification. For ease of presentation, the following four groups of symbols ZL, QC, DZ and QL represent the four websites respectively.

2. Extract Web database features

First, extract the characteristics of the text data (job title), numerical data (company size) and classification data (work area) of ea...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a WEB data bank selection method based on WDB (World Data Bank) characteristics and user query requests, which comprises steps of: (1) a feature extraction method of a WDB query interface, (2) the relevancy computation of a WDB and user queries based on the WDB characteristics, (3) the estimation of data volume meeting the user queries, (4) redundancy estimation based on the WDB characteristics, and (5) a selection method of data sources based on WDB characteristics and user queries; and through the methods, first questions of data integration and the provision of a high-efficiency data retrieval strategy in a Deep Web field are solved. Aiming at realizing more data returning at lower cost by selecting a most appropriate data bank for querying when facing mass Web data banks, the invention provides a Web data bank characteristic expression and extraction method based on a Web data bank independent sample and the data source selection method combining with the comprehensive consideration of three elements of query relevancy, returned data volume and data redundancy so as to realize the selection of a WEB data bank based on the WDB characteristics and the user queries and better meet the requirements of an integrated system.

Description

technical field The invention relates to a research field of computer application technology or Web data management and Deep Web, in particular to a method for selecting a WEB database based on WDB features and user query requests. Background technique With the widespread application of Web databases, the Web is "deepening" at an accelerated rate. The Deep Web contains richer and "professional" (focused on a certain field) information, and its data volume is also growing exponentially. Therefore, realizing the retrieval and utilization of information in the Deep Web has become one of the hotspots in the field of database research. In order to enable users to effectively utilize the massive information in the Deep Web, researchers have carried out research on Deep Web data integration, that is, to establish a Deep Web data integration system. The system can provide users with an integrated query interface, and combine the results returned by various Web databases into a uni...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 林培光
Owner 林培光
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products