Feature screening method and device, terminal and medium
A feature screening and feature word technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of limited manual processing speed, poor timeliness, poor versatility, etc. Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0051] figure 1 It is a flowchart of a feature screening method provided in Embodiment 1 of the present invention. This embodiment is applicable to the case of extracting feature words from data of at least one user, especially the case of extracting risk feature words from data of multiple risk users. The method can be executed by a feature screening device, and the device can be implemented by software and / or hardware. see figure 1 , the feature screening method provided in this embodiment includes:
[0052] S110. Acquire data of at least one user.
[0053] Wherein, the user is a user to be screened by a feature, and the feature is a feature representing commonality of at least one user, and is specifically determined by user data. For example, if the above user data is the user's interest data, then the feature is the user interest feature. The user's data can be determined as required, and optionally, it can be user behavior data or data uploaded by the user.
[0054...
Embodiment 2
[0102] figure 2 It is a flowchart of a feature screening method provided in Embodiment 2 of the present invention. This embodiment is an optional solution proposed on the basis of the foregoing embodiments. In this embodiment, the application scenario is an Internet risk identification scenario as an example for illustration. see figure 2 , the feature screening method provided in this embodiment includes:
[0103] S210. Obtain data submitted by risk users in the risk seed set from complaint feedback and / or penalty history.
[0104] Wherein, the risk seed set is a set of at least one risk user of the same category.
[0105] Typically, the above data may be risk data uploaded by risk users for promotion. The category of risk data can be gambling, pornography, violence, etc., and this category can be obtained from complaint feedback and / or penalty history.
[0106] S220. Directly determine words or phrases in the data as keywords.
[0107] S230. Perform word segmentatio...
Embodiment 3
[0129] image 3 It is a schematic structural diagram of a feature screening device provided in Embodiment 3 of the present invention. see image 3 , The feature screening device provided in this embodiment includes: an acquisition module 10 , a keyword determination module 20 and a feature word determination module 30 .
[0130] Wherein, the obtaining module 10 is used to obtain the data of at least one user; the keyword determination module 20 is used to determine at least one keyword from the data; the feature word determination module 30 is used to determine the user frequency according to the keyword , determining a feature word from at least one of the keywords, wherein the user frequency indicates the number of users whose data contains the keyword.
[0131] According to the technical solution of the embodiment of the present invention, the keywords representing the commonness of the users are determined from the user data as the characteristic words according to the u...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


