Distributed acquisition method and system oriented to user generated content
A user-oriented, collection method technology, applied in the transmission system, special data processing applications, instruments, etc., can solve the problems of not paying attention to efficiency, high real-time requirements, and unable to meet the diverse page collection requirements of news certification and early warning, and achieve improvement Real-time, fast collection effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0032] figure 1 A frame diagram of a UGC news distributed collection system according to an embodiment of the present invention is shown, including: a thread preprocessing module, a collection entity selection module, a collection cluster, a storage management module, a login management module and an anti-blocking management module. These modules are introduced separately below.
[0033] 1. Clue preprocessing module
[0034] The clue preprocessing module is used for preprocessing the collected clues. The collection clues include a short description or phrase of the news, the possible start time and end time of the news, etc. It contains various news elements, but is often not suitable as an input for subsequent data processing directly. Therefore, the clue preprocessing module performs word segmentation, keyword extraction, invalid word filtering, semantic entity recognition and other preprocessing on the collected clues to extract the news elements. These news elements wi...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com