Method and device for classifying topics in online communities
A network community and topic technology, applied in the field of data processing, can solve problems such as data imbalance and inaccurate data classification, and achieve the effect of solving low classification accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0056] According to an embodiment of the present invention, a network community topic classification method is provided, such as figure 1 shown, including:
[0057] Step 101: Collect topic corpus in the online community and determine corresponding category marks, preprocess the collected topic corpus as a sample set;
[0058] According to an embodiment of the present invention, collecting the topic corpus of the online community and determining the corresponding category mark includes: grabbing each topic content in each section of the online community through a web crawler, using the captured topic content as the topic corpus, and passing the corresponding section The serial number establishes a corresponding relationship with each category in the classification system, and determines the category identification of each topic corpus according to the established corresponding relationship; among them, the topic content includes: topic title, topic text, topic release time, top...
Embodiment 2
[0107] According to an embodiment of the present invention, a network community topic classification device is provided, such as figure 2 shown, including:
[0108] Collecting module 201, is used for collecting network community topic corpus and determines corresponding category mark;
[0109] The preprocessing module 202 is used to preprocess the topic corpus collected by the collection module 201 and use it as a sample set;
[0110] The construction module 203 is used to construct the cost-sensitive matrix of the misclassification of the sample set obtained by the preprocessing module 202 according to the category mark determined by the collection module 201 and the Naive Bayesian algorithm;
[0111] The training module 204 is used to train the sample set obtained by the preprocessing module 202 based on the cost-sensitive matrix constructed by the construction module 203 to obtain a classifier;
[0112] The classification module 205 is configured to use the classifier ob...
Embodiment 3
[0145] According to an embodiment of the present invention, there is also provided a network community topic classification device, including one or more processors, a storage device storing one or more programs; when the one or more programs are used by the one or more When the processors execute, the one or more processors implement the steps of the method for classifying topics in the online community as described above.
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More - R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com


