Web forum information extraction system
An information extraction and forum technology, applied in the field of Web information processing, can solve problems such as low accuracy, inability to meet practical applications, and a large number of manual participation, and achieve the effects of high accuracy, cost reduction, and strong versatility
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0021] The present invention will be further described below in conjunction with the accompanying drawings and embodiments.
[0022] Such as figure 1 Shown, system structure of the present invention comprises following module:
[0023] Web forum webpage collection module 101, for automatically downloading forum webpage according to the forum site and corresponding section specified by the user, this collection module needs to utilize the content extracted in the extraction module; webpage analysis module 102, for cleaning the webpage content , making it meet the HTML specification and parsing the webpage to form the Document Object Model (DOM) of the webpage; the online extraction module 103 is used to extract the specified information in the webpage according to the structural characteristics of the forum webpage and the characteristics and statistical laws of the information to be extracted ; The database storage module 104 is used to store the extracted content in the data...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com