Junk image filtering method based on semi-supervision
A technology of garbage pictures and filtering methods, which is applied in the direction of instruments, character and pattern recognition, computer components, etc., can solve the problems of unresponsive text and fonts, and the amount of calculation is not large, so as to improve accuracy and efficiency, save program operation time and effect of space
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0038] Step 1) Initial sample selection:
[0039] Download image spam from the image spam database shared on the Internet, image spam collected from private mailboxes and image collections in normal mail to form a sample set.
[0040] Step 2) Text feature extraction:
[0041] Step 2.1) Use optical character recognition technology to batch process the images in the file to obtain the text features of each image.
[0042] Step 2.2) Save the text extraction results of step 2.1), save the text of each picture in a .txt text file, and put them into the junk image folder and the normal image folder respectively.
[0043] Step 2.3) Use the Waikato intelligent analysis environment to normalize the results of step 2.2) into an .arff file, and the first column of each line in the file represents the text in an image, and the second column represents the label of an image , as the text feature vector of the image.
[0044] Step 3) Use the R-value feature selection method to rank the f...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com