Dynamic user attribute extraction method based on social media

A technology for social media, attribute extraction

Active Publication Date: 2017-01-25
UNIV OF ELECTRONIC SCI & TECH OF CHINA
View PDF3 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The purpose of the invention of the present invention is: in order to solve the sparsity problem of short text, overcome the disadvantages such as inaccurate user attribute mining of the prior art and can not update in time, the present invention is based on the new dynamic user attribute model constructed (can be automatically obtained from text Min

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Dynamic user attribute extraction method based on social media
  • Dynamic user attribute extraction method based on social media
  • Dynamic user attribute extraction method based on social media

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] In order to make the purpose, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the implementation methods and accompanying drawings.

[0026] see figure 1 , the social media-based dynamic user attribute extraction method of the present invention mainly involves three parts: text data preprocessing (text preprocessing for short), topic extraction and user dynamic attribute mining.

[0027] The short texts of Sina Weibo users are obtained through crawlers. Since there is a lot of noise information, text information with low noise can be obtained through preprocessing methods such as word segmentation and removal of meaningless characters. Use the BTM topic model to extract 10 topics (respectively fitness, food, digital, sports, beauty, tourism, military, music, cute pets, and games) and their corresponding top 20 weighted high-frequency keywords, and From the extracted ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a dynamic user attribute extraction method based on social media. The dynamic user attribute extraction method includes the steps: firstly, performing text preprocessing for an acquired training sample set and extracting subject terms to obtain K subjects and m subject terms of each subject; secondly, extracting short texts of a user to be processed, dividing time sub-segments, filling data through a time sliding window to obtain text data of the time sub-segments, counting the occurrence frequency of the subject terms after text preprocessing to obtain attribute weight information of the subjects, introducing time attenuation coefficient, sequentially acquiring user attribute features associated with time attributes in time sequence, extracting the user attribute features of the latest time sub-segment as current user attribute features and outputting the current user attribute features. Without external knowledge, the short texts of the social media are semantically expanded by disordered words in the texts, and dynamic user attributes can be extracted from micro-blog texts released or forwarded by users.

Description

technical field [0001] The invention belongs to the field of computers, and in particular relates to a method for extracting dynamic user attributes based on social media. Background technique [0002] Social media services define a whole new way for users to communicate with each other, express themselves and share on the web. With the continuous development of social media, more and more people publish and share instant messages on social media platforms, common social media such as Sina Weibo, Twitter, Facebook and LinkedIn. For example: on the Sina Weibo platform, users can publish microblog information within 140 characters, and these microblogs can be composed of Chinese and English, custom characters, external links, etc. Therefore, effectively analyzing microblog short text streams to detect users' dynamic attributes is of great significance to the research and application of related fields, such as social recommendation, personalized retrieval, online promotion, et...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06Q50/00
CPCG06F16/9535G06Q50/01
Inventor 黄秀杨阳胡玥沈复民邵杰
Owner UNIV OF ELECTRONIC SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products