Method for recognizing topics of nonequilibrium interactive texts based on example obtaining
A topic recognition, unbalanced technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problem of low topic recognition accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0063] A topic recognition method for unbalanced interactive text based on instance acquisition, comprising the following steps, referring to figure 1 It consists of three steps:
[0064] Step 1: Filter instances from the source dataset stage:
[0065] (1) Determine the feature set representing the instance in the common feature set, that is, from the source data set (denoted as Dset Source ) and the target data set (denoted as Dset Target ) from the common feature set to select the feature set that can represent the instance and tend to the minority class.
[0066] (2) Sort and filter source dataset instances by cosine similarity. Use the cosine function to calculate the similarity between each minority class target instance and the same class instance in the source data set, and sort in descending order according to the value of this similarity, and for each minority class target instance, obtain the first K similar to the target data set instance The source dataset inst...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 