Information discrimination method and system

A technology of information discrimination and information database, applied in the field of data processing, can solve the problem of one-sided analysis and prediction of public opinion development trend

Active Publication Date: 2016-07-27
SUZHOU UNIV
View PDF5 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Based on the above problems, the present invention proposes a method and system for information discrimination, so as to solve the problem in the prior art that only a single media is monitored, resulting in relatively one-sided analysis and prediction of public opinion development trends

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Information discrimination method and system
  • Information discrimination method and system
  • Information discrimination method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0071] Such as figure 1 As shown, it is a flow chart of an information discrimination method disclosed in an embodiment of the present invention, which mainly includes:

[0072] S101, based on the web crawler technology, retrieve and collect webpage information corresponding to hotspot information of traditional media and social media on the Internet, and generate corresponding traditional media information base and social media information base;

[0073] The web crawler technology based on S101 can be customized or pre-set by supervisors according to requirements. In the process of executing S101, with the URL (UniformResourceLocator) as the entry, hot topic keywords can be obtained from the keyword list in the hot spot rankings of portal websites such as Baidu real-time hot spot rankings, and then according to each hot topic Topic keywords combined with web crawler technology to retrieve and collect webpage information corresponding to hot information of traditional media o...

Embodiment 2

[0088] Based on the information discrimination method disclosed in the first embodiment of the present invention, for figure 1 S101 shown in , the specific execution process is as follows figure 2 As shown, it mainly includes:

[0089] S201, based on web crawler technology, retrieve hot topics in traditional media and social media on the Internet;

[0090] S202, collecting traditional media web page information and social media web page information corresponding to the hot topic;

[0091] S203. Store each piece of traditional media webpage information and social media webpage information in chronological order, and generate a corresponding traditional media information base and social media information base.

[0092] Wherein, the web page information includes valid information such as time information, source information, original URL, author information and text information, and the traditional media information base here can be in the form of a document, that is, a tradit...

Embodiment 3

[0114] Based on the information discrimination method disclosed in the first and second embodiments of the present invention, the third embodiment of the present invention discloses a corresponding information discrimination system.

[0115] Such as Figure 5 As shown, it is a schematic block diagram corresponding to the information discrimination system 100 disclosed in Embodiment 1 of the present invention, which mainly includes:

[0116] The information collection module 101 is used to retrieve and collect webpage information corresponding to hot topics of traditional media and social media on the Internet based on web crawler technology, and generate corresponding traditional media information bases and social media information bases;

[0117] The information pre-processing module 102 is used to perform data analysis and processing on the traditional media webpage information stored in the traditional media information database and the social media webpage information stor...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an information discrimination method and system. The method comprises the steps of based on a web crawler technology, retrieving and collecting webpage information corresponding to hotspot information of a traditional media and a social media, and processing the collected webpage information to obtain traditional and social media data sets marked with categorical data and divided into training set data and test set data; based on the training set data, performing topic modeling to obtain topic and keyword documents, and establishing a topic characteristic set corresponding to traditional media data and a keyword characteristic set corresponding to social media data; and training a classifier by utilizing the topic characteristic set and the keyword characteristic set, and performing classification discrimination on the test set data through the obtained traditional media classifier and social media classifier to obtain traditional media data capable of triggering social media reports and/or social media data capable of triggering traditional media reports. By monitoring a plurality of medias, the trend of public sentiment development can be analyzed and predicted more comprehensively and more quickly.

Description

technical field [0001] The present invention relates to the technical field of data processing, in particular to an information discrimination method and system. Background technique [0002] At present, the method of monitoring network public opinion mainly adopts the automatic identification of new topics from the information flow and continuous tracking of known topics. For example, the TDT (Topic Detection and Tracking) system in the United States, and the TRS public opinion monitoring system of Beijing Tops Corporation. However, the network public opinion monitoring carried out by the above system is only monitored in a single media, and it cannot combine the interactive relationship between the information flow of traditional media and social media to better analyze and predict the development trend of public opinion. [0003] In view of the current large and scattered national conditions in my country, the source of people's access to information does not only depend...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06K9/62
CPCG06F16/951G06F18/24
Inventor 龚慧敏段湘煜张民
Owner SUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products