Method and device for recognizing class of social contact short texts and method and device for training classification models
A classification model and short text technology, applied in text database clustering/classification, unstructured text data retrieval, special data processing applications, etc. Accuracy and the effect of enriching user experience
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0036] figure 1 It is a flow chart showing the method for identifying categories of social short texts according to Embodiment 1 of the present invention. The method can be performed, for example, on a microblog server.
[0037] refer to figure 1 , in step S110, acquire social short text data.
[0038] For example, the obtained social short text data is shown in Table 1 below:
[0039] Table 1
[0040]
[0041] It can be seen that the category information of the social short text data in Table 1 is unknown, and subsequent processing is required to identify the category of the social short text.
[0042] In step S120, text feature data is extracted from the social short text data.
[0043] Here, the text feature data may include, but not limited to, at least one of the following: plain text feature data, writing habit feature data, social feature data and user feature data.
[0044] Wherein, the plain text feature data may include the data of the importance index of the ...
Embodiment 2
[0062] figure 2 It is a flow chart showing the training method of the short text classification model in Embodiment 2 of the present invention. The short text classification model is used to identify the categories of social short texts.
[0063] refer to figure 2 , in step 210, a plurality of labeled sample data is acquired, each of the labeled sample data includes social short text data, labeled text feature data and category information.
[0064] Here, the text feature data may include, but not limited to, at least one of the following: plain text feature data, writing habit feature data, social feature data, and user feature data.
[0065] In addition, considering that social short texts have both media and social attributes, reasonable categories need to be set for social short texts. Therefore, the category information can be news events, advertisements, non-commercial sharing or private conversations. Among them, news events, advertisements, and non-commercial shar...
Embodiment 3
[0081] image 3 It is a logical block diagram showing the device for identifying categories of social short texts according to Embodiment 3 of the present invention. can be used to execute as figure 1 The method steps of the illustrated embodiment.
[0082] refer to image 3 , the device for identifying the category of the social short text includes a text data acquisition module 310 , a feature data extraction module 320 , a category information acquisition module 330 and a category information determination module 340 .
[0083] The text data acquiring module 310 is used for acquiring social short text data.
[0084] The feature data extraction module 320 is used to extract text feature data from the social short text data.
[0085] Here, the text feature data may include at least one of the following: plain text feature data, writing habit feature data, social feature data and user feature data.
[0086] Specifically, the plain text feature data may include the data of...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com