Microblog event abstract extracting method based on multiple storylines
An extraction method and story line technology, applied in file management systems, special data processing applications, instruments, etc., can solve problems such as inability to describe the development and evolution of events, and achieve the effect of reducing complexity
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0066] In order to illustrate the working process of the method in detail, the specific process of the present invention will be introduced below in combination with specific examples.
[0067] Step 1. Microblog corpus preprocessing
[0068] There are 43,152 microblog event corpus about the Qingdao explosion, and each microblog contains the sending time of the microblog. Use the public tokenizer to segment the corpus and remove punctuation marks. Microblogs with less than 5 words after word segmentation are removed. For the remaining microblogs in the corpus, obtain their time information and number the microblogs. Information such as Weibo number, Weibo content, and Weibo release time are stored in the dictionary database. Afterwards, the content of the microblog and the publishing time of the microblog can be quickly obtained through the microblog number.
[0069] Step 2. Weibo vectorization
[0070]Use word embedding technology to vectorize the words after word segment...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com