A method and device for determining junk text information in a page

A text information and garbage technology, applied in the Internet field, can solve problems such as the inability to detect junk text information, affect the security and efficiency of information acquisition by users, and reduce the user's search and browsing experience, so as to improve the search and browsing experience and improve Safety and efficiency of obtaining information, and the effect of accurate cheating information

Active Publication Date: 2017-11-03
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When the site corresponding to the search result may have a security risk, the search engine / browser will remind the user of the security of the site, such as prompting the user of the possible security risk of the site. However, usually not all pages in the site All have security risks, but some information in some pages has security risks. For example, when the site has no security risks but some pages contain spam information, the security risk prompt with the site as the coarse grain cannot detect the page Spam text information in the website, which affects the security and efficiency of information acquisition by users, and reduces the user's search and browsing experience

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and device for determining junk text information in a page
  • A method and device for determining junk text information in a page
  • A method and device for determining junk text information in a page

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The present invention will be described in further detail below in conjunction with the accompanying drawings.

[0024] figure 1 Shows a spam text determination device 1 for determining spam text information in a page according to one aspect of the present invention, wherein the spam text determination device 1 includes an acquisition device 11, a candidate determination device 12, a cheating degree determination device 13 and a garbage determination device 14. Specifically, the acquiring means 11 acquires the initial page to be processed; the candidate determining means 12 determines one or more candidate junk text information corresponding to the initial page; the cheating degree determining means 13 determines the cheating corresponding to the candidate junk text information; degree information; the spam determining means 14 determines one or more spam text information corresponding to the initial page from the one or more candidate spam text information according t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention aims to provide equipment and a method for determining junk text messages in a page. The method particularly includes: acquiring a to-be-processed initial page; determining one or more candidate junk text messages corresponding to the initial page; determining cheating degree messages corresponding to the candidate junk text messages; determining one or more junk text messages corresponding to the initial page from the one or more candidate junk text messages according to the cheating degree messages. Compared with the prior art, the equipment and the method for determining the junk text messages in the page have the advantages that by determining the cheating degree messages of the candidate junk text messages corresponding to the initial page, the junk text messages corresponding to the initial page are determined from the candidate junk text messages according to the cheating degree messages, and accordingly screening of the candidate junk text messages according to the cheating degree messages is realized, the junk text messages in the initial page can be effectively recognized, safety and efficiency in message acquisition are improved for users, and searching and browsing experiences are promoted for the users.

Description

technical field [0001] The invention relates to the technical field of the Internet, in particular to a technology for determining junk text information in a page. Background technique [0002] At present, with the development of Internet technology and the penetration of Internet applications into users' study, work and life, people increasingly obtain information through the Internet, such as expressing their needs by entering keywords in the search bar of search engines, and then obtaining corresponding search results. When the site corresponding to the search result may have a security risk, the search engine / browser will remind the user of the security of the site, such as prompting the user of the possible security risk of the site. However, usually not all pages in the site All have security risks, but some information in some pages has security risks. For example, when the site has no security risks but some pages contain spam information, the security risk prompt w...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30G06F21/60
CPCG06F16/335
Inventor 施鹏牛章鹏
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products