Clustering method for topic views based on sentence similarity

A clustering method and similarity technology, applied in the field of computer networks, can solve the problems of coarse classification granularity, inability to understand the categories of arguments, arguments and argumentation process, difficult to meet user needs and other problems, to achieve refined clustering results, avoid The effect of ambiguity and one-sidedness, and diversity of clustering results

Active Publication Date: 2017-02-01
SOUTHEAST UNIV
View PDF3 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, the classification granularity is relatively rough, and users generally can only understand the polarity of each category, but cannot understand the arguments, arguments and demonstration process of the category
Moreover, for topics with a large number of opinions, or topics where it is difficult to simply use positive and negative to describe opinions, the effect of traditional opinion clustering methods has great limitations, and it is difficult to meet the needs of users

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Clustering method for topic views based on sentence similarity
  • Clustering method for topic views based on sentence similarity
  • Clustering method for topic views based on sentence similarity

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] Below in conjunction with specific embodiment, further illustrate the present invention, should be understood that these embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various equivalent forms of the present invention All modifications fall within the scope defined by the appended claims of the present application.

[0018] During the specific implementation of the present invention, the text related to the topic to be clustered is first collected from the Internet through tools such as web crawlers, and then the viewpoint clustering is carried out according to three steps of constructing a viewpoint lexicon, topic viewpoint clustering, and extracting viewpoint representative sentences. The implementation of specific steps is as follows:

[0019] Step 1: Construct the opinion lexicon. Firstly, according to th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a clustering method for topic views based on sentence similarity. The clustering method can be used for clustering main views about a certain topic in internet. The clustering method comprises the following steps: firstly, constructing a view lexicon aiming at a topic by utilizing a human-computer cooperation mode; secondly, extracting all view sentences of the topic and performing the view clustering by using the similarity of the view sentences; finally, selecting a representative view sentence for each view class according to the average similarity of the sentences. The clustering method disclosed by the invention has the advantages that a clustering result can be more diversified and refined; a user is enabled to learn the views and details of various parties of the topic more clearly; fuzziness and one-sidedness of view clustering and description are effectively avoided.

Description

technical field [0001] The invention relates to a topic viewpoint clustering method based on sentence similarity, which can be used to realize viewpoint clustering and viewpoint mining of Internet hot topics, and belongs to the technical field of computer networks. Background technique [0002] With the rapid development of the mobile Internet, the content and information on the Internet are complicated, and the characteristics of diverse viewpoints are obvious. In order to increase the in-depth understanding of Internet content topics and avoid being misled by one-sided information, people increasingly need to obtain other people's opinions on a topic from a large amount of Internet information, and help themselves to do better by comparing different opinions. make more rational decisions. For example, when shopping electronically, people often need to judge whether the product is worth buying according to the tendency of product reviews. Opinion clustering is the main me...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06K9/62
CPCG06F16/35G06F18/231G06F18/22
Inventor 杨鹏袁志伟顾梁赵丹丹
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products