Method and system for analyzing events on basis of non-negative matrix factorization

A technology of non-negative matrix decomposition and basic matrix, which is applied in semantic analysis, special data processing applications, instruments, etc., can solve problems such as high computational complexity and poor scalability, and achieve good scalability, ensure accuracy, Calculate the effect of ease
CN104281663AInactive Publication Date: 2015-01-14BEIHANG UNIV

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
BEIHANG UNIV
Publication Date
2015-01-14
Estimated Expiration
Not applicable · inactive patent

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention provides a method and a system for analyzing events on the basis of non-negative matrix factorization. The method includes acquiring to-be-processed data of at least one data text; respectively segmenting words of each data text to obtain text space matrixes corresponding to the to-be-processed data; performing non-negative matrix factorization ion the text space matrixes, determining various events and keywords in the to-be-processed data according to basis matrixes, and determining data texts according to coefficient matrixes. The basis matrixes are obtained by means of factorization. The keywords are respectively used for describing the various events. The coefficient matrixes are obtained by means of factorization. The determined data texts respectively correspond to the various events. The method and the system have the advantages that the text space matrixes of the to-be-processed data are constructed and are subjected to non-negative matrix factorization, so that each matrix on a huge scale can be factorized to obtain two corresponding matrixes on small scales, the non-negativity of matrix elements can be guaranteed before and after the matrixes are factorized, the accuracy of event mining results can be guaranteed, the events in the to-be-processed data can be found by means of reducing dimensions, computation is simple and convenient, and the method and the system are good in expansibility.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention belongs to the technical field of data mining, and in particular relates to an event analysis method and system based on non-negative matrix decomposition. Background technique

[0002] With the vigorous development of Internet technology, more and more users publish various news or express personal opinions on some social phenomena through social network platforms such as forums and Weibo, which leads to the emergence of various data information on the Internet. With explosive growth, how to effectively mine massive amounts of data information is a major research issue for various search engines.

[0003] An existing method of data mining is to use hierarchical clustering to decompose a given set of data objects hierarchically until a certain cut-off condition is met. Specifically, it can be divided into: agglomerative hierarchical clustering: a bottom-up strategy that first treats each data object as an atomic cluster, and then merges ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More