Clustering method and system of parallelized self-organizing mapping neural network based on graphic processing unit

A graphics processing unit and self-organizing mapping technology, applied in the field of parallelized self-organizing mapping neural network clustering, can solve the problems of large amount of data and slow calculation speed, and achieve the effect of fast clustering

Inactive Publication Date: 2014-01-01
HARBIN INST OF TECH SHENZHEN GRADUATE SCHOOL
View PDF2 Cites 38 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The technical problem to be solved by the present invention is to construct a parallelized self-organizing mapping neural network clustering method and system based on a Graphic Processing Unit (Graphic Processing Unit, referred to as "GPU"), which overcomes the limitations of existing technologies in The technical problem of slow calculation speed due to the large amount of data in the process of text clustering

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Clustering method and system of parallelized self-organizing mapping neural network based on graphic processing unit
  • Clustering method and system of parallelized self-organizing mapping neural network based on graphic processing unit
  • Clustering method and system of parallelized self-organizing mapping neural network based on graphic processing unit

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] The technical solutions of the present invention will be further described below in conjunction with specific embodiments.

[0043] Such as figure 1 As shown, the specific embodiment of the present invention is: provide a kind of parallel self-organizing map neural network clustering method based on graphics processing unit, comprise the steps:

[0044] Step 1: Parallel keyword frequency statistics, namely: segment the text content into words and obtain a collection of keywords; for large-scale text data, the large-scale computing unit of the graphics processing device can provide each text document with a thread parallel statistical document The frequency of keywords is obtained as a word frequency matrix.

[0045] The specific implementation process is as follows: Computers do not have human intelligence. After reading an article, people can have a vague understanding of the content of the article according to their own understanding ability, but computers cannot eas...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a clustering method and system of a parallelized self-organizing mapping neural network based on a graphic processing unit. Compared with the traditional serialized clustering method, the invention can realize large-scale data clustering in a faster manner by parallelization of an algorithm and a parallel processing system of the graphic processing unit. The invention mainly relates to two aspects of contents: (1) firstly, designing the clustering method of the parallelized self-organizing mapping neural network according to the characteristic of high parallelized calculating capability of the graphic processing unit, wherein the method comprises the following steps of obtaining a word-frequency matrix by carrying out parallelized statistics on the word frequency of keywords in a document, calculating feature vectors of a text by parallelization to generate a feature matrix of data sets, and obtaining a cluster structure of massive data objects by the parallelized self-organizing mapping neural network; and (2) secondly, designing a parallelized text clustering system based on a CPU / GPU cooperation framework by utilizing the complementarity of the calculating capability between the graphic processing unit (GPU) and the central processing unit (CPU).

Description

technical field [0001] The invention relates to a parallel self-organizing map neural network clustering method and system, in particular to a graphic processing unit-based parallel self-organizing map neural network clustering method and system. Background technique [0002] At present, with the popularization of computers, the number of Internet users continues to increase, and Internet users generate a large amount of information on the Internet every day. At the same time, in some social media systems with a large number of users, a large amount of new data is added every day. Data mining and machine learning algorithms provide feasible methods for us to extract valuable information from these data, but most of the algorithms have complex learning processes, require iterative learning, and take a long time to process massive data. Although useful information is extracted, the information may no longer be time-sensitive, which requires the development of faster algorithm...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/951G06N3/08
Inventor 叶允明张金超黄晓辉
Owner HARBIN INST OF TECH SHENZHEN GRADUATE SCHOOL
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products