Similarity calculation-based junk comment detection method
A similarity calculation and spam comment technology, which is applied in computing, unstructured text data retrieval, natural language data processing, etc., can solve problems such as uneven quality of user comment text, increased information mining costs, confusion and even misleading
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0046] The present invention will be further described below in conjunction with specific drawings.
[0047] The invention takes user comments on network platforms such as forums and e-commerce as the research object, and aims to detect spam comments from network comments, improve the quality of comment texts, and reduce the cost of automatic mining tools.
[0048] Spam review detection method based on similarity calculation, including data acquisition, false review detection, duplicate review detection, product feature dictionary construction and irrelevant review detection five steps, such as figure 1 shown. The five steps are described in detail below.
[0049] 1. Data acquisition: Use web crawlers to crawl forums and e-commerce webpages related to the specified product, then extract the comment data from the webpage, and save the comment data to the database.
[0050] The data acquisition process is as figure 2shown. First, call the Baidu search interface to search fo...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com