Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Short text clustering method based on deep learning

A clustering method and deep learning technology, which is applied in the field of short text clustering based on deep learning, can solve problems such as not considering the semantic connection of short texts and unsatisfactory clustering effects, and achieve fast and accurate clustering Analyze and improve the effect of accuracy

Inactive Publication Date: 2017-05-10
RUN TECH CO LTD
View PDF3 Cites 30 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But the disadvantage is that it does not take into account the semantic connection between short texts, making the effect of clustering unsatisfactory

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Short text clustering method based on deep learning
  • Short text clustering method based on deep learning
  • Short text clustering method based on deep learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] The present invention will be further described below in conjunction with drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, but not to limit the present invention. In addition, it should be noted that, for the convenience of description, only some parts related to the present invention are shown in the accompanying drawings but not the whole content. Unless otherwise defined, all technical and scientific terms used herein are related to the technical field of the present invention. The skilled person generally understands the same meaning. The terms used herein are for describing specific embodiments only, and are not intended to limit the present invention.

[0048] Please refer to figure 1 as shown, figure 1 It is a flow chart of the short text clustering method based on deep learning provided by the embodiment of the present invention.

[0049] In the present embodiment, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a short text clustering method based on deep learning. The method includes the following steps that S101, the semantic similarity between short texts is obtained through calculation through a convolutional neural network; S102, the semantic similarity is applied to a clustering algorithm, and the short texts are clustered. The existing short text clustering accuracy is improved, then mass short text data can be more rapidly and accurately subjected to clustering analysis, the method can be widely applied to the fields such as short text clustering tasks, sentiment analysis and recommendation systems, short text similarity parts and short text clustering parts are calculated through the convolutional neural network without preprocessing input short text data, and the length of the input short texts can be increased.

Description

technical field [0001] The invention relates to the technical fields of deep learning and text mining, in particular to a short text clustering method based on deep learning. Background technique [0002] Text clustering is a major topic of cluster analysis algorithms in the fields of data mining and natural language processing. With the rapid popularization of the Internet and the rapid development of information technology, the total amount of data has become larger and larger, and the relationship between data has become more and more complex; at the same time, due to the development of social media, text data has grown rapidly, and usually In the form of short texts: such as Weibo, product reviews, and geographic location information, how to accurately and quickly extract valuable information from a large-scale short text data set has become a new challenge. [0003] The usual practice is to use text clustering and other methods to effectively organize short text inform...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30G06F17/27G06N3/08
CPCG06F16/35G06F40/284G06F40/289G06N3/08
Inventor 杨华兴苗欣董美亚
Owner RUN TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products