Method for crawling BitTorrent torrent files

A seed file and crawler technology, applied in the field of computer networks, can solve problems such as ineffective tracking, and achieve the effect of comprehensive retrieval results, rich resources, and enhanced functions

Inactive Publication Date: 2011-12-21
PEKING UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Aiming at the problem that existing BitTorrent seed crawlers cannot effectively track private BT servers with strong dynamics, the purpose of the present invention is to provide a method for crawling BitTorrent seed files. This method can find private BT servers in time and guide crawlers to download seeds

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for crawling BitTorrent torrent files
  • Method for crawling BitTorrent torrent files
  • Method for crawling BitTorrent torrent files

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] Now take a specific example as an example to illustrate the implementation of the scheme.

[0041] The hardware environment of the system implementation is as image 3 As shown: the working environment of the crawler system includes two LANs. LAN 1 can access the Internet and is built by network switch 1; LAN 2 is an internal network that cannot access the Internet and is built by network switch 2. One detection host is dedicated to running the detection module software and connected to LAN 1; one crawler host is used to run the crawler module software and the seed file parser. It has two network cards, one is connected to LAN 1 and the other is connected to LAN 2 . The file storage server is an NFS server, which is used as a seed file library, and it is connected to LAN 2; the crawler host mounts the shared file directory of the NFS server for writing seed files. Note that, as mentioned above, in addition to using NFS file storage services, the seed file library can ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method for crawling BitTorrent torrent files, and belongs to the field of computer networks. The method comprises the following steps that: 1, according to set characteristic key words of a BT server, a detection module calls a search engine interface to search WEB sites released by the BT and sending the webpage addresses to a crawler module; 2, according to the received released webpage addresses, the crawler module downloads the corresponding webpages; 3, the crawler module analyzes the downloaded webpages to obtain the addresses of the torrent files, and downloads the torrent files to a torrent file library according to the addresses of the torrent files; and 4, a torrent file analyzer analyzes the torrent files to obtain an address of an index server, converts the address of an index server into the addresses of the released webpages and sends the addresses to the crawler module, and steps 2 to 4 are repeated. Compared with the prior art, the method hasthe advantages that: crawled torrent resources are more complete and abundant, and the torrent resources of the torrent file library are greatly increased.

Description

technical field [0001] The invention relates to a method for crawling BitTorrent seed files, which has the characteristics of quickly and effectively discovering and downloading BitTorrent seeds, and belongs to the field of computer networks. Background technique [0002] Since the emergence of Napster in 1999, the P2P file sharing system has continuously innovated technology and developed different transmission protocols, including BitTorrent, EDonkey, and Gnutella. Its core idea is to make full use of the upload bandwidth of downloaders (generally called peers), so that they can upload the downloaded part to other peers while downloading. Taking the BitTorrent protocol as an example, the bittorrent seed file records the name, size and address of the index server of the content file. Users can find the index server (or tracker server) through the torrent file, and then find the peers corresponding to the content files, establish a connection with them and download the data...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30H04L29/08
Inventor 宋维佳马皓张建宇张缘杨加张蓓周渊
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products