Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Information capture method and device

A technology of target information and statistical information, applied in the field of information capture methods and devices, can solve the problems of non-real-time information capture and large resource consumption, and achieve the effects of real-time, efficient management, and high monitoring

Active Publication Date: 2019-06-11
北京百分点科技集团股份有限公司
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The embodiment of the present invention provides an information capture method and device to solve the defects of non-real-time information capture and large resource consumption in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Information capture method and device
  • Information capture method and device
  • Information capture method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044]In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0045] The core of the present invention is to realize the procedural monitoring of the crawling system by modifying and tracking the marked fields of the database; realize real-time monitoring through the deep crawling of the list page and frequency update; Valuable data for processing and analysis. The various embodiments of the present inventio...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention provides an information capturing method and device. The information capturing method comprises the following steps: counting an information website list, and saving a list page corresponding to an information website in a list page database in a first database, wherein contrasting relation between the information website and a corresponding URL address is saved in the list page; reading contents of the list page from the first database, capturing detail page link addresses conform to a default capturing strategy, and saving the captured detail page link addresses in a detail page database in the first database; allocating the detail page link addresses to different capturing machines for capturing, and saving captured webpage detail data in a second database; and capturing corresponding webpage detail data from the second database according to a database status code in the first database, extracting a target field, and saving the target field in a target format. According to the invention, information can be captured in a real-time, efficient and intelligent manner.

Description

technical field [0001] Embodiments of the present invention relate to the field of information technology, and in particular, to an information capture method and device. Background technique [0002] Information scraping is a process of grabbing unstructured information from a website and storing it in a structured database. Information capture is the foundation and the first step of enterprise informatization. Only by using advanced technology to complete the information capture work can it bring the greatest value to informatization. [0003] Information capture is mainly used in the following aspects: key information capture: access to various professional information databases on the Internet; competitive intelligence system: monitor the market information of itself and competitors on network media through keywords; Content management: Accurately obtain external content in batches and automate processing; Database marketing: Extract message information and contact info...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/951G06F16/953G06F16/955
CPCG06F16/951
Inventor 杜晓梦刘钰骆永健党拓张扬吴昊谭树国张建枝李红梅谢靖鹏
Owner 北京百分点科技集团股份有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products