Document Classification Method and System Based on LDA Topic Model
A technology of document classification and topic model, applied in text database clustering/classification, unstructured text data retrieval, etc., can solve the problems that LDA cannot meet the classification requirements and cannot realize classification
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0056] According to the method of the present invention, take the sentimental classification of the documents to be classified according to the title as an example, as follows:
[0057] There are 10 pre-set words in the initial supervision dictionary, which are "like", "first", "desperate", "divorce", "upgrade", "disband", "depressed", "transfer", "also", " For", according to the emotion of the words, four categories are preset corresponding to four themes, which are positive, negative, neutral, and others. Negative includes "divorce", "disbandment", and "depressed", neutral includes "transfer", and others include "also" and "for", and others contain words that have nothing to do with emotion.
[0058] There are 100 documents to be classified in a txt text. After reading the 100 documents to be classified, first perform Chinese word segmentation on the words in the title of the document to be classified, and then remove the words that appear in the title of the document to be ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 
