Unlock instant, AI-driven research and patent intelligence for your innovation.

A method and system for categorizing the same news information

A news and headline technology, applied in the field of information processing, can solve the problem of low accuracy of news classification

Active Publication Date: 2021-01-26
深圳市比量科技传媒有限公司
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The invention provides a method and system for categorizing the same news information to solve the problem of low accuracy of news categorization in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and system for categorizing the same news information
  • A method and system for categorizing the same news information
  • A method and system for categorizing the same news information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] The technical solutions of the present invention will be described in detail below through the accompanying drawings and specific embodiments. It should be understood that the embodiments of the present invention and the specific technical features in the embodiments are only descriptions of the technical solutions of the present invention, rather than limitations. , the embodiments of the present invention and specific technical features in the embodiments may be combined with each other.

[0045] like figure 1 Shown is a flow chart of a method for categorizing the same news information in an embodiment of the present invention, and the method includes:

[0046] S101, performing Chinese word segmentation on the obtained news title, and obtaining a word list;

[0047] After getting the headline of the news, the Chinese word segmentation is first performed on the headline of the news. The method of the Chinese word segmentation is to divide each word. Mode "Mobile Tenc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and system for categorizing the same news information. The method includes: performing Chinese word segmentation on the obtained news titles, and obtaining a word list; performing data filtering on the word list to obtain data-filtered news titles; The titles filtered by the data are completed to obtain the completed titles; through the title fingerprint algorithm, the title fingerprints of each completed title are calculated to obtain the title fingerprints corresponding to each completed title; the news titles with the same title fingerprints are classified into for a category. By this method, similar news titles can be well identified, and then the information fingerprint of each title can be calculated, and the news with the same information fingerprint can be classified to better identify the same news.

Description

technical field [0001] The present application relates to the technical field of information processing, in particular to a method and system for categorizing the same news information. Background technique [0002] With the development of information technology, especially the development and popularization of Internet technology, the network has become the main way for people to publish, communicate and obtain information. However, information on the web is growing explosively. [0003] Taking online news as an example, it has gradually replaced newspapers, radio or television as the main source of news for many people due to its fast update speed, rich content and various forms. However, the advantages of fast update and rich content of online news also become the disadvantages that are not conducive to people's reading. People often have to work hard to find the news they care about. [0004] Plus, the web is flooded with much of the same news content. This is because...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/35G06F16/335G06F40/279
CPCG06F16/335G06F16/35G06F40/279
Inventor 万里黄娜周宇顺
Owner 深圳市比量科技传媒有限公司