Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Method and system for analyzing events on basis of non-negative matrix factorization

A technology of non-negative matrix decomposition and basic matrix, which is applied in semantic analysis, special data processing applications, instruments, etc., can solve problems such as high computational complexity and poor scalability, and achieve good scalability, ensure accuracy, Calculate the effect of ease

Inactive Publication Date: 2015-01-14
BEIHANG UNIV
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In view of the above-mentioned problems, the present invention provides an event analysis method and system based on non-negative matrix factorization to overcome the problem of high computational complexity and poor scalability caused by the hierarchical clustering method in the prior art defect

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for analyzing events on basis of non-negative matrix factorization
  • Method and system for analyzing events on basis of non-negative matrix factorization
  • Method and system for analyzing events on basis of non-negative matrix factorization

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] figure 1 It is a flowchart of Embodiment 1 of the event analysis method based on non-negative matrix factorization of the present invention, such as figure 1 As shown, the method includes:

[0020] Step 101. Obtain data to be processed, the data to be processed includes at least one data text;

[0021] Step 102: Perform word segmentation processing on each data text in the at least one data text respectively to obtain a text space matrix corresponding to the data to be processed, the text space matrix describes the text contained in the at least one data text word information;

[0022] Step 103: Perform non-negative matrix decomposition on the text space matrix, determine each event included in the data to be processed and keywords used to describe each event according to the base matrix obtained through decomposition, and determine the The coefficient matrix determines the data texts respectively corresponding to the respective events.

[0023] The method provided ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a method and a system for analyzing events on the basis of non-negative matrix factorization. The method includes acquiring to-be-processed data of at least one data text; respectively segmenting words of each data text to obtain text space matrixes corresponding to the to-be-processed data; performing non-negative matrix factorization ion the text space matrixes, determining various events and keywords in the to-be-processed data according to basis matrixes, and determining data texts according to coefficient matrixes. The basis matrixes are obtained by means of factorization. The keywords are respectively used for describing the various events. The coefficient matrixes are obtained by means of factorization. The determined data texts respectively correspond to the various events. The method and the system have the advantages that the text space matrixes of the to-be-processed data are constructed and are subjected to non-negative matrix factorization, so that each matrix on a huge scale can be factorized to obtain two corresponding matrixes on small scales, the non-negativity of matrix elements can be guaranteed before and after the matrixes are factorized, the accuracy of event mining results can be guaranteed, the events in the to-be-processed data can be found by means of reducing dimensions, computation is simple and convenient, and the method and the system are good in expansibility.

Description

technical field [0001] The invention belongs to the technical field of data mining, and in particular relates to an event analysis method and system based on non-negative matrix decomposition. Background technique [0002] With the vigorous development of Internet technology, more and more users publish various news or express personal opinions on some social phenomena through social network platforms such as forums and Weibo, which leads to the emergence of various data information on the Internet. With explosive growth, how to effectively mine massive amounts of data information is a major research issue for various search engines. [0003] An existing method of data mining is to use hierarchical clustering to decompose a given set of data objects hierarchically until a certain cut-off condition is met. Specifically, it can be divided into: agglomerative hierarchical clustering: a bottom-up strategy that first treats each data object as an atomic cluster, and then merges ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/3344G06F40/30
Inventor 张日崇邰振赢于伟仁刘俊伟李建欣
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products