Method for user portrait extraction based on multilayer latent variable model

A latent variable, user technology, used in data processing applications, special data processing applications, instruments, etc.

Inactive Publication Date: 2016-08-17
BEIJING UNIV OF TECH
View PDF6 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Multimodal information in social curation networks poses user modeling challenges

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for user portrait extraction based on multilayer latent variable model
  • Method for user portrait extraction based on multilayer latent variable model
  • Method for user portrait extraction based on multilayer latent variable model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The technical solution of the present invention will be described in more detail below with reference to the drawings and embodiments.

[0044] This embodiment is carried out for the real data of a certain social curation network. The 100 target users in the example are real users in the network, which come from three categories respectively, wherein No.1-No.35 belongs to category one, and No. 36-No.75 belongs to Category 2, and No.76-No.100 belongs to Category 3, including a total of 633,337 favorite items and the forwarding chains corresponding to the favorite items.

[0045] A. Establish a new word thesaurus, including about 300,000 new words, which are commonly used and popular keywords. A stop word lexicon is built, which contains 1433 stop words, which have no specific meaning in the sentence.

[0046] The descriptive information of all collection items collected by 100 target users is segmented using the word segmentation tool ICTCLAS. After word segmentation, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a method for user portrait extraction based on a multilayer latent variable model and relates to the field of data mining and recommendation systems. A user portrait is extracted according to a social curation network, and the method for user portrait extraction based on the multilayer latent variable model is provided according to data of two modes including text description information of collected entries and user behaviors on a forward chain. A latent Dirichlet allocation (LDA) model is introduced to the text description information to obtain user's latent subject distribution, and subject interest distribution is obtained based on the user's latent subject distribution; and users' interest distribution is obtained in combination with the user's latent subject distribution and the subject interest distribution. A users' social community is found based on the multilayer latent variable model, and user recommendation results are obtained in combination with Jensen-Shannon divergence ascending sort. According to the method, the users' social community is found by utilization of information of the two different modes including the user text description information and the user behaviors on the forward chain, and user recommendation is achieved.

Description

technical field [0001] The invention relates to the field of data mining and recommendation systems, in particular to the research and realization of a method for extracting user portraits in a multi-layer latent variable model. Background technique [0002] Social media refers to a series of network applications that are based on the technology and ideology of Web 2.0 and allow users to create and communicate content produced by themselves. Since 2009, some professional social curation networks (such as Pinterest, Snip.it, Scoopit, Huapetal.com, etc.) have officially appeared. The so-called "social curation" is synonymous with the behavior of people collecting, organizing and sharing information on the Internet. Traditional social networks are user-centric, while social curation networks are content-centric. The social curation network is guided by user interests. Users can create content by themselves, and can also link or copy the content they care about on other websit...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06Q50/00G06F17/30G06F17/21
CPCG06F16/9535G06F40/10G06Q50/01
Inventor 毋立芳王丹刘爽张磊刘海英张岱
Owner BEIJING UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products