Information acquisition method and device

A technology for information collection and access information, which is applied in the Internet field and can solve the problems of inability to collect information and web crawler programs.

Inactive Publication Date: 2009-07-08
NEW H3C TECH CO LTD
View PDF0 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] When searching dynamic web pages, the fundamental problem lies in "input" and "selection". The web crawler program cannot perform the operations of "input" and "selection", and thus cannot perform information collection operations.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Information acquisition method and device
  • Information acquisition method and device
  • Information acquisition method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] The present invention provides an information collection method, specifically: obtaining access information for users to browse Web pages; wherein, the access information includes HyperText Mark-up Language (HTML, HyperText Mark-up Language) files corresponding to the Web pages; and then, Send the obtained access information to the search engine database. HTML files reflect the situation of web pages browsed by users. HTML files can reflect both static web pages and dynamic web pages. The database can collect information about dynamic web pages in the web server.

[0019] In addition, in order to enable the search engine database to further grasp the situation of users browsing Web pages. The access information may further include client IP address, server IP address, URL, and access time. Correspondingly, obtaining the access information of the user browsing the Web page includes: obtaining the IP address of the client where the user is located, the IP address of the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and a device for collecting information. The invention adopts the technical scheme that access information of browsing Web pages by users is transmitted to a database for searching engine, so as to realize the purpose of collecting dynamic Web pages through the database for searching engine. In addition, in the technical scheme of the invention, the information of browsing the Web pages the by users is obtained, so that service conditions of the users of the Web pages can be truly mastered. Therefore, the method and the device for collecting information also plays a significant referential role of ordering the Web pages for searching engine.

Description

technical field [0001] The invention relates to Internet technology, in particular to an information collection method and device. Background technique [0002] Internet information is rapidly expanding, and search engines provide convenience for people to retrieve the information they need on the Internet. [0003] Existing search engines, such as google, Baidu, etc., use a kind of application program called web crawler such as Crawler, Spider, etc. to obtain original information from the Internet. The implementation method is to start to obtain the content of the web page from a specific resource locator (URL, Uniform Resource Locator) list, generally a list of some portal websites, and extract keywords and other information from this information through a web crawler program. Compose the database required by the search engine, and extract URLs pointing to other resources from the web page information, and use these new URLs as a new starting point to start a new round of...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F17/30864G06F17/3089G06F16/958G06F16/951
Inventor 葛长忠
Owner NEW H3C TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products