Unsupervised news automatic classification method
An automatic classification and unsupervised technology, applied in the field of information classification, can solve problems such as low efficiency and manpower loss, and achieve the effect of reducing manpower burden, speeding up speed and strengthening professionalism
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0063] like figure 1 Shown is an embodiment of an unsupervised automatic news classification method, comprising the following steps:
[0064] Step 1: Use the simhash method to check the obtained news for plausibility; simhash is an algorithm for comparing the similarity of articles with the main idea of dimensionality reduction, and the output result is the simhash value.
[0065] Step 2: Generate news vocabulary vector table (wordvec) through word2vec; word2vec is a neural network used to map vocabulary into word vector wordvec.
[0066] Step 3: Calculate the term frequency-inverse text frequency index value (TF-IDF, term frequency–inverse document frequency) of the vocabulary in the news, and obtain the weighted average sum of the first k key words according to the vocabulary vector table to obtain the news document Vector table; document vector table (docvec) is a commonly used weighting technology for information retrieval and data mining, and is used to evaluate the im...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com