User portrait method and system of Dirichlet process based on word
A user and word technology, applied in special data processing applications, instruments, unstructured text data retrieval, etc., can solve problems that affect the performance of the method, are not comprehensive, cannot respond to users, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0047] Such as figure 2 As shown, this embodiment provides a user profiling method based on word-to-Dirichlet process, which is used for profiling users by extracting user data from Sina Weibo. The method may include the following steps:
[0048] S101. Extract short documents in user data.
[0049] In specific implementation, such as image 3 The information panel of a Sina Weibo user as shown provides the user’s account information including basic information, work information, education information, and label information identified by himself or others through network social activities. These information are user data. part. In this embodiment, the user data of the user also includes content information such as microblogs and public messages posted or updated by the user daily, and each microblog or public message is a short document. A data table including all short documents is established, and a field of the data table includes at least a short document id correspond...
Embodiment 2
[0060] This embodiment provides a user portrait method based on word-pair Dirichlet process, which is used to profile users by extracting user data in Sina Weibo. The difference between this embodiment and Embodiment 1 is that short documents extracted from user data are segmented according to the time axis, each segment is used as a short document set to extract keywords, and user portraits are performed according to changes in the probability distribution of key words. If it is found that the value of the keyword "food" becomes lower, it can be judged that the user is on a diet.
Embodiment 3
[0062] Corresponding to all the method embodiments of the present application, this embodiment provides a user portrait system based on the word pair Dirichlet process. All or part of the data used by the system to generate the user portrait comes from the keywords obtained through the method of the present application or from any process data obtained during the implementation of the method.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com