Unlock instant, AI-driven research and patent intelligence for your innovation.

A method, training method and device for preventing web crawlers from stealing private data

A web crawler, privacy data technology, applied in the field of data security, can solve problems such as data leakage, achieve high accuracy and recall rate, reduce loss, and prevent theft of private data.

Active Publication Date: 2022-04-08
ALIPAY (HANGZHOU) INFORMATION TECH CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The sensitive personal information of these users is stored in the crawling company, which can easily lead to large-scale data leakage

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method, training method and device for preventing web crawlers from stealing private data
  • A method, training method and device for preventing web crawlers from stealing private data
  • A method, training method and device for preventing web crawlers from stealing private data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] In order to enable those skilled in the art to better understand the technical solutions in this specification, the technical solutions in the embodiments of this specification will be clearly and completely described below in conjunction with the drawings in the embodiments of this specification. Obviously, the described The embodiments are only some of the embodiments in this specification, not all of them. Based on the embodiments in this specification, all other embodiments obtained by persons of ordinary skill in the art without creative efforts shall fall within the protection scope of this specification.

[0050] At present, many data companies use web crawlers to steal users' private data. Even if this process is authorized by the user (in many cases, the user is unconsciously authorized), there is still the problem of excessive collection. And these data are crawled to companies to steal users' sensitive information for use, and large-scale data leakage is pro...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of this specification provides a method, training method and device for preventing web crawlers from stealing private data. The method for preventing web crawlers from stealing private data includes: extracting the application program interface API access records of the target client within a preset time period from the network traffic data of the target client. Data to be identified is generated based on the API access records of the target client, and the data to be identified includes a two-dimensional map of the API access of the target client within a preset time period with time and API access volume as dimensions. Input the data to be identified into the web crawler recognition model to obtain the network recognition result of the target client. The web crawler recognition model is trained based on the sample data and the web crawler classification labels of the sample data. The sample data includes time and API A two-dimensional graph of API visits by sample users within a preset time period with the number of visits as the dimension. Implement privacy data protection measures that match the web crawler identification results on the target client.

Description

technical field [0001] This document relates to the technical field of data security, in particular to a method, training method and device for preventing web crawlers from stealing private data. Background technique [0002] While Internet companies provide services to users, they also provide opportunities for information crawling. Web crawlers only need to write automated scripts, and under the conscious or unconscious authorization of users, they can excessively collect users' private data in various Internet companies. The sensitive personal information of these users is stored in the crawling company, which can easily lead to large-scale data leakage. [0003] Therefore, there is an urgent need for a technical solution that can automatically identify web crawlers and prevent the web crawlers from stealing private data. Contents of the invention [0004] The purpose of the embodiments of this specification is to provide a method for preventing web crawlers from stea...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): H04L9/40G06F21/62G06F16/951
CPCH04L63/1425H04L63/1441G06F16/951G06F21/6263
Inventor 宗志远
Owner ALIPAY (HANGZHOU) INFORMATION TECH CO LTD