Association rule mining-based method for determining public webpage involving personal information, electronic equipment and storage medium
A determination method and information network technology, applied in digital data information retrieval, network data retrieval, network data indexing and other directions, can solve problems such as low work efficiency and personal information leakage, and achieve the effect of reducing the risk of violations of laws and regulations
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0062] Such as figure 1 As shown, this embodiment 1 provides a method for judging webpages involving personal information disclosure based on association rule mining, including the following steps:
[0063] Step S1, crawling the announcement webpages to form a webpage collection W, and manually marking to form a personal information webpage collection WP and a non-personal information webpage collection WN.
[0064] Step S11, crawling all the announcement webpages on the website, analyzing the TITLE of the webpage, the content of the webpage, the name of the attachment and the content of the attachment, and forming a set W of webpages containing webpage elements that are helpful to personal information in the announcement content;
[0065] W={Webpage 1 ,Webpage 2 ,Webpage 3 ...} (1)
[0066] In formula (1), Webpage is a connection string of a certain webpage's TITLE, webpage content, attachment name and attachment content webpage elements.
[0067] Step S12: Carry out man...
Embodiment 2
[0102] Embodiment 2 of the present application provides an electronic device in the form of a general-purpose computing device. Components of an electronic device may include, but are not limited to: one or more processors or processing units, memory for storing computer programs that can run on the processors, connections to different system components (including memory, one or more processors or processing unit) bus.
[0103] Wherein, when the one or more processors or processing units are used to run the computer program, execute the steps of the method described in Embodiment 1. The types of processors used include central processing units, general purpose processors, digital signal processors, application specific integrated circuits, field programmable gate arrays or other programmable logic devices, transistor logic devices, hardware components or any combination thereof.
[0104] Wherein, the bus refers to one or more of several types of bus structures, including a me...
Embodiment 3
[0106] Embodiment 3 of the present application provides a storage medium on which a computer program is stored, and when the computer program is executed by a processor, the steps of the method described in Embodiment 1 are implemented.
[0107] It should be noted that the storage medium shown in this application may be a computer-readable signal medium or a storage medium or any combination of the above two. The storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, device, or device, or any combination thereof. More specific examples of storage media may include, but are not limited to, electrical connections with one or more wires, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable programmable read-only memory (EPROM or flash memory), fiber optics, portable compact disk read only memory (CD-ROM), optical storage devices, magnetic storage dev...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com