Specific-group discovery method based on news data and related comment information

A technology of news data and comment information, which is applied in digital data processing, special data processing applications, unstructured text data retrieval, etc., to achieve high stability, reduce complexity, and facilitate group discovery

Inactive Publication Date: 2018-02-09
NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT
View PDF3 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] At present, there are still many deficiencies in the discovery of specific groups in news data and related comment information. For example, how to discover robot accoun

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Specific-group discovery method based on news data and related comment information
  • Specific-group discovery method based on news data and related comment information
  • Specific-group discovery method based on news data and related comment information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The present invention will be further described in detail below in conjunction with the embodiments and the accompanying drawings.

[0031] refer to figure 1 , in one embodiment, a specific group discovery method based on news data and related comment information is provided, comprising the following steps:

[0032] Step 1: Collect news data information and relevant comment information of the targeted media network within a certain period of time.

[0033] When implemented, data from major mainstream media and forums can be collected for all mainstream media on the Internet, and special groups can also be found within a specified range, such as major mainstream media: NetEase News, Toutiao, Sina, Sohu Wait. The certain period may be, for example, approximately three months. The collected information mainly includes: news data information, comment information, user information, keywords, number of comments, and time of posting comments. Generally, it can be obtained ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a specific-group discovery method based on news data and related comment information. The method comprises the following steps: collecting the news data information and the related comment information in targeted media; classifying the news data information according to text contents thereof to obtain different class clusters; using the class cluster, which contains the newsdata information with a highest comment number, as a sample to acquire all comments of news data messages in the class cluster and users, who publish the comments, according to the related comment information; obtaining keywords through carrying out word segmentation on contents of all the comments, and using the keywords, of which occurrence frequency is higher than a threshold value, as high-frequency words; adopting a vector space model to represent the contents of the comments, clustering text of the comments through an agglomerative hierarchy, and obtaining comment user reference features of different class clusters according to a clustering result; and identifying a specific group according to the high-frequency words and the comment user reference features. A robot account can be quickly and intelligently discovered through analyzing the comment information contents, and thus processing is carried out in time.

Description

technical field [0001] The invention relates to the field of social network analysis and data mining, in particular to a specific group discovery method based on news data and related comment information. Background technique [0002] With the rapid development of information technology and Internet technology, the Internet has gradually become one of the main gathering places of social public opinion. In recent years, major events at home and abroad have been communicated, disseminated and discussed through the Internet. [0003] Among them, some special users have played an important role in the formation of social network public opinion and the dissemination of news information, engaging in topic manipulation, network marketing and other harmful behaviors, interfering with the development direction of social public opinion, endangering social security, and causing harm to society. certain influence. Most of them are ordinary users, and it is difficult to distinguish the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/3346G06F16/35G06F16/9535
Inventor 张露晨吴震马秀娟李传海刘丙双涂波戴帅夫张建宇
Owner NAT COMP NETWORK & INFORMATION SECURITY MANAGEMENT CENT
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products