Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for searching by utilizing automatic structured crawler in e-commerce platform

An automatic structure and e-commerce platform technology, applied in the Internet field, can solve the problems of inconvenient search and data collection, and achieve the effect of fast collection and arrangement, convenient collection of information, and small memory usage

Pending Publication Date: 2020-12-15
广东赛博威信息科技有限公司
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In view of this, one of the purposes of the present invention is to provide a method for using automatic structured crawler search in an e-commerce platform, which solves the technical problem of inconvenient search and collection of data on the e-commerce platform in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for searching by utilizing automatic structured crawler in e-commerce platform
  • Method for searching by utilizing automatic structured crawler in e-commerce platform

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0036] Embodiment 1: A method for utilizing automatic structured crawler search in an e-commerce platform, such as Figure 1-2 shown, including the following steps:

[0037] S1. Determine the search topic A, determine the link set B, B={b1, b2...bn}, b1, b2...bn represent different links, and b1, b2...bn are all related to the search topic A, n> 0, n is a natural number, store the link set B, and put the link set B into the cache queue. Putting the link set B into the cache queue can make the system operation process faster. According to the analysis algorithm of the corresponding webpage designed in advance, analyze and filter out some Links that are not related to the search topic A, store valid links and put them in the cache queue to be crawled;

[0038] S2. Determine the type C of the webpage opened by each link in the link set B respectively. The type C of the webpage is divided into a static webpage and a dynamic webpage. If it is a static webpage, the link is marked C...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for searching by using an automatic structured crawler in an e-commerce platform, which comprises the following steps of: S1, determining a search theme A, determininga link set B, storing the link set B, and putting the link set B into a cache queue; S2, respectively determining a type C of each link opening webpage in the link set B, if the webpage is a static webpage, marking the link as C = 0, and if the webpage is a dynamic webpage, marking the link as C = 1; S3, capturing the link bk by adopting a specific strategy, and obtaining webpage information contained in the link bk; S4, storing the captured link bk and webpage information contained in the link bk; Big data of the e-commerce platform is sorted and collected based on the Internet, rapid searchof the data is achieved, and the technical problem that in the prior art, search and data collection of the e-commerce platform are inconvenient is solved.

Description

technical field [0001] The invention relates to the field of Internet technology, in particular to a method for using automatic structured crawler search in an e-commerce platform. Background technique [0002] Now, there is a lot of information on the Internet. The entire Internet is like a huge and directed spider web, each web page is like a node in the spider web, and each web page has addresses pointing to other web pages. So when a crawler crawls a web page, it will use a directed traversal algorithm for traversal. The current e-commerce platforms, such as JD.com, Taobao.com, Pinduoduo, Suning.com, etc., are cumbersome to search and collect when various valuable data are needed. Especially when using a certain topic to search and collect information, the steps are relatively cumbersome, manual operations are frequently required, and it is impossible to automatically search and collect information. [0003] Therefore, it is necessary to improve the prior art to solve...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/951G06F16/953G06F16/955
CPCG06F16/951G06F16/953G06F16/955
Inventor 刘勇勤吴肖峻蓝文广邓铭武
Owner 广东赛博威信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products