A method for extracting minority theme data in new media environment
A minority and new media technology, applied in unstructured text data retrieval, text database browsing/visualization, semantic tool creation, etc., can solve problems such as efficiency bottlenecks, rare etymology, strong professionalism, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0050] Embodiment: An example of extracting Tibetan data from "Sina Weibo".
[0051] Step 1: Preprocessing
[0052] Firstly, the microblog data is obtained from the "Sina Weibo" platform, and the single microblog data is shown in Table 1.
[0053] Table 1 Weibo data example
[0054]
[0055] For the convenience of description, additional information items will be included in the description of the following data extraction A i Hidden, so the obtained Sina Weibo data contains 5 Weibo data a 1~ a 5, as shown in Table 2.
[0056] Table 2 Sina Weibo Data
[0057]
[0058] Then, for the text part of Weibo data T i Perform word segmentation processing, select word segmentation tools, support custom dictionaries and stop words, and introduce Tibetan domain knowledge Z ={, , , , , }, add vocabulary in the Tibetan field to the word segmentation tool dictionary, and record the word segmentation results as Seg_T i ,as shown in Table 3.
[0059] Table 3 Segmentation res...
PUM

Abstract
Description
Claims
Application Information

- R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com