Supercharge Your Innovation With Domain-Expert AI Agents!

A Scheduling Method for Information Units in a Vertical Search Engine

A vertical search engine and information unit technology, applied in network data indexing, web data retrieval using information identifiers, network data retrieval, etc., can solve problems such as difficult coordination and prediction of the entry collection cycle, achieve good application effect and reduce consumption Effect

Active Publication Date: 2017-10-31
BEIJING ZHONGSOU NETWORK TECH
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

And usually an entry contains many information units, and the update time of each information unit is inconsistent, so the collection cycle of the entry will be difficult to coordinate and predict, and it is difficult to adjust

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Scheduling Method for Information Units in a Vertical Search Engine
  • A Scheduling Method for Information Units in a Vertical Search Engine
  • A Scheduling Method for Information Units in a Vertical Search Engine

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] The present invention will be described in further detail below in conjunction with the accompanying drawings.

[0045] The invention realizes a dispatching method that uses the update unit as a dispatching unit for calculation. This method is based on the fact that different update units of the same website have different update states and update time points, and cannot be updated at a unified time. For this type of information, if one update unit is used as an information update unit and processed separately, and each update unit counts and uses its own update strategy, each scheduling unit can be updated in a timely and effective manner while reducing download consumption.

[0046] Several concept descriptions in the present invention:

[0047] Entrance: refers to the starting page of a website, through which we can traverse the information of the website. The scheduling module will start periodic scheduling from the entrance, and the extraction module will return ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a scheduling method for information units in a vertical search engine. The method is based on a collecting and scheduling system and comprises the following steps: a scheduling module initiates an inlet domain name scheduling; an extraction module identifies the types of an extracted second-level domain and marks; the scheduling module receives the extracted second-level domain and identifies the mark; that the identified information units are updated or not is judged; the domain information, and history update records of the information units are added to or updated to an updating unit page. According to the history update records, a time point, at which next updating occurs is predicted, and information unit scheduling is performed at the time point. According to the invention, in the vertical search, information unit characteristics are provided, and a better application effect on a website with a big difference between the update cycle and the update time point of each information unit is achieved.

Description

technical field [0001] The invention relates to a method for dispatching network information, in particular to a method for dispatching information units in a vertical search engine. Background technique [0002] Now users have a lot of personalized search requirements. Such requirements are generally specific in scope and have high requirements on data quality. Therefore, corresponding search manufacturers have launched vertical searches based on specific directions, such as news search, video search, music search, and Weibo. Search and novel search and so on. These vertical channels have some obvious features: 1. The data type is consistent, the source is very narrow, and almost all of them are directional crawls; 2. The timeliness of data is very high, and they hope to be included in the system as soon as possible; 3. Data needs to be continuously updated; 4. The activity of data updates varies greatly. [0003] With these specific requirements, for the collection syste...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06F16/951G06F16/955
Inventor 齐彦杰
Owner BEIJING ZHONGSOU NETWORK TECH
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More