Social media large data subject discovering method based on knowledge map

A knowledge map and social media technology, applied in the field of topic discovery, can solve problems affecting the quality of topics and high degree of fragmentation, and achieve the effects of high accuracy, high scalability, and high availability

Active Publication Date: 2017-06-30
TONGJI UNIV
View PDF5 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the short content and high degree of fragmentation of social media big data, these methods have obvious defects in capturing data semantic infor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Social media large data subject discovering method based on knowledge map
  • Social media large data subject discovering method based on knowledge map
  • Social media large data subject discovering method based on knowledge map

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] workflow such as figure 1 shown.

[0021] In step 1, the present invention for each triple fact i ,r,e j >, identify the m data sources DS of the triplet 1 ,DS 2 ,...,DS mAfter that, each data source DS needs to x (1≤x≤m) to evaluate the reliability. due to DS x Contains a large amount of data, and the distribution of the data is unknown, so it is difficult for us to accurately evaluate its credibility. The present invention adopts an approximate evaluation strategy, and the implementation method is as follows: For the data source DS x , first extract the triplet facts with the quantity w to form the set TF x ={}, and the value of w is determined by the following method, record DS x The number of triplet facts in is ψ:

[0022]

[0023] where min is the minimum value function. Then, the present invention uses TF x Collect data samples, train and construct data source DS x A regression predictive model for the confidence value of triplet facts in . On thi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a social media large data subject discovering method based on a knowledge map. The social media large data subject discovering is realized by four steps of knowledge map probability processing, social media large data semantic similarity calculating, subject fuzzy density clustering and weak information subject filtering. The knowledge map probability processing step is to complete confidence assessment and true probability value generation of a triad fact in the knowledge map. The social media large data semantic similarity calculating step is to achieve semantic mapping structure of each pair of documents and semantic similarity assessment among the documents based on an approximate map matching strategy. The subject fuzzy density clustering step is to obtain different subjects of the social media large data and automatically determine the number of the subjects. The weak information subject filtering step is to delete the subjects with less semantic strength and return an optimal subject list to a user. Compared with the prior art, the method has the advantages of being high in extendibility degree, strong in self-adapting capability, high in accuracy and the like, and being effectively used in the fields of social public safety, public health care, Internet depth information service, e-commerce and the like.

Description

technical field [0001] The present invention relates to a topic discovery method, in particular to a knowledge map-based social media big data topic discovery method. Background technique [0002] In recent years, with the rapid development of technologies such as cloud computing, mobile communications, and social networks, the big data contained in social media platforms, that is, social media big data, has become more and more manifested in the 4 "V" (Volume, Velocity, Variety, Veracity) characteristic. With the accumulation of time, social media big data contains a wealth of social information, including network mapping of a large number of important social event clue information, and these network mapping information usually seem to be disorganized. In-depth analysis and mining of social media big data, quickly and accurately discover the hidden deep-seated themes, and then effectively predict the future development trend of social events on the basis of the existing di...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06Q50/00
CPCG06F16/367G06Q50/01
Inventor 黄震华倪娟程久军
Owner TONGJI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products