Parallel data processing method based on latent dirichlet allocation model
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- INST OF SOFTWARE - CHINESE ACAD OF SCI
- Publication Date
- 2009-02-04
- Estimated Expiration
- Not applicable · inactive patent
Smart Images
Figure 1 Figure 2 Figure 3
Abstract
Description
Technical field
[0001] The invention relates to a text data mining method, in particular to an efficient data processing method based on implicit topic text representation, and belongs to the field of computer data mining. Background technique
[0002] Computer data mining
[0003] Computer data mining refers to the intelligent information processing process that uses computers to obtain effective, useful and understandable information or knowledge from a large amount of data. The early computer data mining mainly focused on the mining of regular numerical data in the database system. With the continuous expansion of the Internet scale and the great enrichment of applications, computer data mining has gradually turned to Internet information processing. The data carried on the Internet is very different from the data in the database system: First, the data on the Internet is mainly text written in natural language, while the data in the database system is mainly numerical; second...