Method and system for identifying same user among different platforms
A user and platform technology, applied in character and pattern recognition, data processing applications, special data processing applications, etc., can solve the problems of uncommon accounts, difficult to judge whether two Weibo belong to the same user, etc., to achieve good development, high The effect of accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0041] This embodiment provides a method for identifying the same user between different platforms, figure 1 A flow chart of this embodiment is shown, including:
[0042] Step S101: collecting a preset number of text messages published by users on the first platform and the second platform;
[0043] Designate two platforms, such as Sina Weibo and Tencent Weibo, and collect a preset number of text information published by users of the two Weibo platforms. The specific collection process is as follows:
[0044] Step S201: building a user queue;
[0045] Step S202: Select a user as a seed user and add it to the user queue;
[0046] Step S203: Take out a user from the user queue, grab the user profile information and published text information through the API provided by Weibo, the user profile information includes the following user and the followed user, and store the following The user and the followed user are added to the user queue;
[0047] Step S204: Repeat the above p...
Embodiment 2
[0063] This embodiment provides a system for identifying the same user between different platforms, figure 2 A schematic structural diagram of this embodiment is shown, including:
[0064] A collection module 101, configured to collect a preset number of text information published by users on the first platform and the second platform;
[0065] An annotation module 102, configured to annotate a part of the text information;
[0066] The first sample acquisition module 103 is configured to use the labeled text information in the text information as a labeled sample, and use the unlabeled text information in the text information as a sample to be tested;
[0067] The second sample acquisition module 104 is used to extract theme features from marked samples and samples to be tested by using the LDA model, respectively perform cosine similarity calculations on the extracted theme features, and use the obtained similarity values as training samples respectively with the test s...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com