Field information collection and association method based on website homepage content

A technology for domain information and homepage, which is applied in network data indexing, network data retrieval, unstructured text data retrieval, etc. It can solve the problems of poor information timeliness, difficulty in locating information clusters, invalid web pages, etc., and achieve the effect of accurate positioning.

Inactive Publication Date: 2019-08-06
AGRI INFORMATION INST OF CAS
View PDF6 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, this method usually leads to invalid web pages in the search results, especially when users are looking for professional information, it is very difficult to locate information clusters, and the timeliness of information is poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Field information collection and association method based on website homepage content
  • Field information collection and association method based on website homepage content
  • Field information collection and association method based on website homepage content

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The present invention will be described in detail below in conjunction with the implementations shown in the accompanying drawings, but it should be noted that these implementations are not limitations of the present invention, and those of ordinary skill in the art based on the functions, methods, or structures made by these implementations Equivalent transformations or substitutions all fall within the protection scope of the present invention.

[0031] ginseng figure 1 as shown, figure 1 It is a flow chart of the method for collecting and associating field information based on the content of the homepage of the website in the present invention.

[0032] This embodiment provides a method for collecting and associating field information based on the content of the homepage of the website, including:

[0033] Step S1, based on the concept space of the domain, carry out the domain identification of the website information on the content of the homepage of the website, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of the internet, and particularly relates to a website homepage content based field information collection and association method. The method includes: performing website information filed determination on website homepage content based on field concept space, and then completing field information collection; and performing field information classification on the collected page content based on the field concept space, and then completing field information association. The method forms field concept description based on conception space and performs field information collection based on website homepage determination; and different website nodes form an associated network based on concept space, and then the user can rapidly and accurately position a required field information cluster.

Description

technical field [0001] The invention belongs to the technical field of the Internet, and in particular relates to a method for collecting and associating field information based on the contents of the homepage of a website. Background technique [0002] Today's Internet contains more and more information, especially more and more websites in the professional field, and there is a large amount of content-related information. However, since the information of these professional websites is composed of a large number of hypertext links, and may not be related to each other, it is very difficult for users to quickly locate the required field information clusters. At present, an important way to solve this problem is to allow search engines to search based on keywords. However, this method usually leads to invalid webpages appearing in search results, especially when users are looking for professional information, it is very difficult to locate information clusters, and the time...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/951G06F16/36G06F16/35
CPCG06F16/35G06F16/367G06F16/951
Inventor 谢能付郝心宁孙巍张学福姜丽华
Owner AGRI INFORMATION INST OF CAS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products