News theme classification method

A subject classification and subject technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problem of high time cost, limited accuracy, machine learning algorithm applicability, historical data quality, classification model accuracy discount, etc. problems, to achieve the effect of reducing labor costs, shortening classification time, and reliable classification results

Inactive Publication Date: 2014-02-12
南京绿色科技研究院有限公司
View PDF3 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This type of method can effectively reduce time cost and labor cost, but the accuracy is often limited by the applicability of the selected machine learning algorithm and the quality of the historical data used
In addition, this type of method requires computers to learn and train knowledge from historical data. If the historical data used is large, it will take a high time cost for learning and training, and if the historical data used is small , the accuracy of the established classification model will be greatly reduced

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • News theme classification method
  • News theme classification method
  • News theme classification method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] The present invention will be specifically introduced below in conjunction with the accompanying drawings and specific embodiments.

[0029] refer to figure 1 Shown, a kind of news topic classification method of the present invention comprises the following steps:

[0030] Step 1: Establish a seed dictionary according to the topic category of the news. The seed dictionary includes topic categories and seed keywords. A seed keyword corresponds to a topic category, and each topic category corresponds to multiple seed keywords;

[0031] Step 2: Carry out word segmentation processing on the headline of the news, and extract the keywords of the headline;

[0032] Step 3: perform meta-search for title keywords through multiple Internet-based search engine servers;

[0033] Step 4: Perform frequency statistics on the seed keywords in the meta search results;

[0034] Step 5: Determine the final subject category of the news according to the frequency of the seed keywords in ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a news theme classification method which is characterized by including the following steps: (1) building a seed dictionary according to news theme classes, conducting word segmentation on news themes to extract title keywords, conducting meta searching on the title keywords through a plurality of search engine servers based on the internet, conducting frequency statistics on seed keywords in meta searching results, and judging the final news theme classes according to the occurring frequency of the seed keywords in the meta searching results. According to the news theme classification method, the classification time can be greatly shortened, the labor cost is effectively reduced, dependency on historical data is avoided, the use time of the whole classification process is short, the classification result is reliable, multiple kinds of classifications can be carried out on news, and the news theme classification method is high in universality in actual situations.

Description

technical field [0001] The invention relates to a method for classifying news topics, in particular to a method for classifying news on the Internet by using computer technology, and the invention belongs to the field of computer technology. Background technique [0002] With the advancement of modern science and technology and the rapid development of Internet technology, the information resources on the Internet are constantly growing explosively. How to quickly and accurately obtain the required information from these massive resources has become an urgent problem that Internet users care about. At the same time, this problem has also become a major challenge in the field of information processing. In order to effectively organize and manage massive electronic information and enable users to quickly and conveniently obtain the required resources, researchers have proposed a variety of information organization and processing technologies such as text retrieval, text class...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/353
Inventor 欧吉顺周楚新张伟
Owner 南京绿色科技研究院有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products