Method for filtering Chinese junk mail based on Logistic regression
A logistic regression and spam filtering technology, applied in electrical components, transmission systems, office automation, etc., can solve problems not involved in Chinese spam filtering methods, improve operating efficiency and classification effect, reduce size, and avoid limitations sexual effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0026] Main principle of the present invention is as follows:
[0027] 1) In the preprocessing stage of emails, including email parsing and word segmentation process. Use JavaMail to extract the title, text content of the body text, and the attachments, pictures, audio, video and other information contained in the email; segment the non-Chinese text according to natural segmentation marks such as punctuation and spaces, and use the maximum matching method to analyze the Chinese text Text is segmented.
[0028] 2) At the feature level, all the words in the email sample set form a feature space, and each email can be mapped into a vector of the feature space; an improved feature value calculation method is adopted, and a weight factor is introduced to reflect the text features of the email ; Using word frequency as the feature selection basis to implement dimensionality reduction, reducing the size of the feature space.
[0029] 3) At the model level, Logistic is used for trai...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com