Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A cross-site user association method based on user name similarity

A user name and similarity technology, applied in the computer field, can solve the problems of incompleteness, untruthful information, and poor versatility, and achieve the effects of easy acquisition, high accuracy, strong versatility and practicality

Active Publication Date: 2019-10-18
INST OF INFORMATION ENG CHINESE ACAD OF SCI
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In other words, most websites cannot obtain the user's personal email address and mobile phone number, so the method of associating users based on email address and mobile phone number is not universal
[0009] The method of extracting features and modeling based on user profiles and published information depends on the authenticity and integrity of user-related information, and because each website requires different information when users register, and some users will The purpose of protecting personal privacy is to deliberately misfill some information, which may lead to untrue or incomplete user-related information, which will affect the correlation effect of this type of method, so this type of method also has certain limitations.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A cross-site user association method based on user name similarity
  • A cross-site user association method based on user name similarity
  • A cross-site user association method based on user name similarity

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0060] Embodiment 1: Judging whether user names a=ye2dai and b=ye8023dai belong to the same user

[0061] Randomly select 1,657,320 usernames from the data set as the username set U, and the threshold τ is 0.15.

[0062] First, the user name a is expressed as a self-information vector. The content characteristics of the substring contained in the username a include ye, e2, 2d, da, and ai, and the sequence characteristic of the combination of letters and numbers is "English letters + numbers + English letters", without the characteristics of numbers, dates and keyboard layouts. The self-information value calculation of each feature of user name a is shown in the following table:

[0063]

[0064] Table 2

[0065] Since most of the items in the self-information vector are 0, only the items that are not 0 in the self-information vector are described in the form of "feature: self-information value", and the self-information vector corresponding to the user name a is:

[0066...

Embodiment 2

[0076] Embodiment 2: Judging whether user names a=asdfjk and b=as1001 belong to the same user

[0077] Randomly select 1,657,320 usernames from the data set as the username set U, and the threshold τ is 0.15.

[0078] First, the user name a is expressed as a self-information vector. The content characteristics of the substring contained in the user name a include as, sd, df, fj, and jk. The combination sequence between letters and numbers is characterized by "only English letters", and there is no digital date characteristic, which conforms to the keyboard layout characteristic ①. The self-information value calculation of each feature of user name a is shown in the following table:

[0079]

[0080] Table 4

[0081] Since most of the items in the self-information vector are 0, only the items that are not 0 in the self-information vector are described in the form of "feature: self-information value", and the self-information vector corresponding to the user name a is:

[...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a user association method across website based on username similarity. The steps comprise: 1) filtering characters in a plurality of usernames, just reserving English letters and numbers; 2) finding out characteristics of the processed usernames, and obtaining self-information values of the characteristics, according to the self-information values, obtaining self-information vectors; 3) according to the self-information vectors, obtaining similarity among the plurality of usernames, if the similarity is larger than a given threshold value [tau], determining that the plurality of usernames belong to a same user. Through the similarity among the plurality of usernames, the method determines whether the usernames belong to the same user, and accounts belonging to the same user on different websites can be associated.

Description

technical field [0001] The invention relates to the computer field, in particular to a cross-site user association method based on user name similarity. Background technique [0002] At present, more and more companies provide users with network services such as information retrieval, resource download, and virtual social networking by establishing their own websites. When people use these network services, they usually need to register an account on each website and obtain a corresponding user name as a public identity. If the accounts of the same user on different websites can be associated, the user experience of many website applications can be improved. For example, if it is possible to associate the same user's account on a shopping website that does not have social functions, such as Dangdang.com and JD.com, with a user of a social networking site such as Sina Weibo and Renren. The social network structure can improve the accuracy of personalized recommendation of s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/951G06F16/953G06F16/9535G06Q50/00
Inventor 柳厅文王玉斌时金桥亚静李全刚
Owner INST OF INFORMATION ENG CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products