Unlock instant, AI-driven research and patent intelligence for your innovation.

Identity Linking Method Based on Multilevel Attribute Embedding and Constrained Canonical Correlation Analysis

A typical correlation analysis and attribute technology, applied in neural learning methods, biological neural network models, instruments, etc., can solve problems such as the difficulty of capturing the implicit connection of different user attributes, and the difficulty of uniformly dealing with various types of attribute texts. The effect of data acquisition cost and method training cost, reducing the amount of prior information, and strong robustness

Active Publication Date: 2022-06-28
XIHUA UNIV
View PDF18 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The purpose of the present invention is to provide an identity based on multi-level attribute embedding and constrained canonical correlation analysis, aiming at the problem that it is difficult to uniformly deal with various types of attribute texts and capture the implicit connection between different user attributes in current user identity links. The chaining method, which solves the above problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Identity Linking Method Based on Multilevel Attribute Embedding and Constrained Canonical Correlation Analysis
  • Identity Linking Method Based on Multilevel Attribute Embedding and Constrained Canonical Correlation Analysis
  • Identity Linking Method Based on Multilevel Attribute Embedding and Constrained Canonical Correlation Analysis

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0035] see figure 1 , an identity linking method based on multi-level attribute embedding and constrained canonical correlation analysis, including the following steps:

[0036] (a) Preprocess social network user data; represent social network users as nodes, and relationships between users (such as friends, followers / fans, etc.) as edges, and construct an undirected and unweighted graph G=(V, E , A), where V represents the set of users in the network, E represents the set of relationships between users (such as friend relationship, follower / fan relationship, etc.), and A represents the set of user attributes, such as user name, occupation and educational experience, etc.

[0037] (b) Embedding multi-level text attributes; first divide the text attributes of each network into three parts A = (A c ,A w ,A t ), where A c Represents a character-level attribute, A w Represents word-level attributes, A t represents topic-level attributes; then three corresponding user feature...

Embodiment 2

[0075] The present invention will be further described below in conjunction with specific examples. This example is two real social networks collected from the Internet, Sina Weibo and Douban. The specific information is shown in Table 1.

[0076] Table 1 Weibo-Douban Network Data Statistics

[0077]

[0078] Step (a): preprocessing social network user data. ;

[0079] Consider the users in the two social networks Weibo and Douban to be matched as network G X / G Y = node V in (V, E, A), and use different numbers to distinguish different users. For example, users in Weibo network correspond to numbers 0 to 9713, and users of Douban network correspond to numbers 9714 to 19239.

[0080] The following / fan relationship between users is regarded as an edge E in the network, that is, if two users have a following or fan relationship, an edge (u) is constructed between them. i ,u j ) ∈ E.

[0081] Use the respective screen names (ie nicknames) of users in the two networks as ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an identity linking method based on multi-level attribute embedding and constrained typical correlation analysis. The method first performs data preprocessing on social network user data, constructs an undirected and unweighted graph, and then embeds multi-level text attributes to form corresponding user feature matrix; then perform network structure embedding and user feature aggregation, and then project the two social networks into the same latent vector space based on the linear projection of constrained canonical correlation analysis, so that the distance between matching users in the space is the shortest; finally through Comparing the distance between any user and all users in another network in the same latent vector space, and then determining the matching user of the user; the present invention is applicable to situations where user attributes are missing or the network structure is sparse; The amount of prior information is reduced, which solves practical problems in the case of lack of prior information, and saves the cost of data collection and method training.

Description

technical field [0001] The invention relates to the technical field of user identity linking, in particular to an identity linking method based on multi-level attribute embedding and constrained canonical correlation analysis. Background technique [0002] User Identity Linkage, also known as "User Alignment", "User Identification", etc., aims to identify the same natural person on different social networks. Security and other fields are becoming more and more important; a large number of social network applications, including friend recommendation, information diffusion, link prediction, network dynamic analysis, etc., demonstrate the necessity and benefits of user identity linking. [0003] Early research on user identity linking across social networks mainly uses publicly available user attribute information to obtain user characteristics, including user basic information (such as username, gender, location), user-generated content (such as microblogs, posts, articles) an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/9536G06F40/289G06N3/08G06Q50/00
CPCG06F16/9536G06F40/289G06Q50/01G06N3/088
Inventor 陈晓亮陈白杨李显勇杜亚军
Owner XIHUA UNIV