The invention relates to a network-based
system for analyzing opinion information in a discrete text, belonging to the field of network information safety. The
system comprises the following modules: a discrete text
information acquisition module which acquires network information in a preset analysis cycle, a discrete text information tracking and restoring module which restores ellipsis and remote anaphora in the original content to obtain a text which contains a relatively complete text structure and
semantic information, a
semantic information mining and characteristic extracting module which realizes
semantic information mining and characteristic extracting on text information by utilizing a
latent semantic indexing technology, an opinion information clustering module which realizes information clustering by combining a
niche genetic algorithm with a K-Means method, a hot opinion event discovery module which mines the hot opinion in the obtained topic and event, and a
background information processing and data supporting center which analyzes data and provides a
repertoire specially for a network, new words in the network, the existing
class information and the existing
hot topics. By applying the invention, the problem that
information analysis is influenced as the text structure of the existing network opinion information is incomplete, ellipsis and remote anaphora are more and the new works in the network are more is solved, and the accuracy for discovery of the opinion and hot event is improved by adopting a high-efficiency clustering method.