Water resource information vertical search method based on cloud platform

A vertical search and cloud platform technology, applied in the field of data search, to achieve the effect of improving retrieval efficiency and quality

Inactive Publication Date: 2012-11-28
HOHAI UNIV
View PDF0 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

It can be said that there is not yet a relatively mature, widely used, high-quality professional search tool in the field of water conservancy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Water resource information vertical search method based on cloud platform
  • Water resource information vertical search method based on cloud platform
  • Water resource information vertical search method based on cloud platform

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] Below in conjunction with accompanying drawing and specific embodiment, further illustrate the present invention, should be understood that these embodiments are only for illustrating the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various aspects of the present invention Modifications in equivalent forms all fall within the scope defined by the appended claims of this application.

[0020] like figure 1 As shown, the system includes four layers, which are infrastructure layer, virtualization layer, service layer and client layer. In the virtualization layer, deploy a Hadoop cluster on the cloud platform virtual machine, apply the Map / Reduce programming model to distribute processing tasks, and store data in HDFS (Distributed File System). The service layer describes the working mechanism of the system and consists of three parts: grabber, indexer, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a water resource information vertical search method based on a cloud platform. The method comprises the following steps: constructing a seed site list; capturing the water resource webpage of the seed site by using a web crawler and preserving the water resource webpage into a local webpage database; constructing a water resource terminology standard set, and organizing all water resource terminologies in the standard set into a dictionary for the water resource field; carrying out analysis and text extraction on the webpage in the local webpage database; extracting the semi-structured data of the webpage to be structured data which can be conveniently stored and indexed; establishing inverted index for the structured data of the webpage and storing the index result in the index database; and searching the index database according to the search request submitted by a user and returning the search result. According to the method, the quality of the water resource webpage can be optimized, the search quality is improved, distributed search is realized, and the search efficiency is improved.

Description

technical field [0001] The invention relates to a data search method, in particular to a cloud platform-based vertical search method for water conservancy information. Background technique [0002] With the rapid development of information technology, the phenomenon of "Information Avalanche" (Information Avalanche) is becoming more and more serious. How to enable users, especially professional users in specific fields, to quickly retrieve the most accurate and useful information from massive information resources has become a research topic. One of the hot spots. [0003] The vertical search engine is a new search engine service model proposed relative to the general search engine due to the large amount of information, inaccurate query, and insufficient search depth. Information and related services of a certain value. At present, vertical search engines have been applied in IT, recruitment, shopping, tourism and many other fields. [0004] The water conservancy industr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 叶枫高依旻彭顺风周远超
Owner HOHAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products