Abnormal telephone recognition method and system based on feature selection and integrated learning

An ensemble learning and feature selection technology, applied in ensemble learning, telephone communication, character and pattern recognition, etc., can solve problems such as sample imbalance, reduced model recognition efficiency, poor performance of abnormal phone recognition models, and low accuracy. Effect

Pending Publication Date: 2019-07-30
UNIV OF JINAN
View PDF6 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The inventors found that a single sample feature has very limited ability to describe the original sample, and excessively high-dimensional features will increase the complexity of the model and reduce the recognition efficiency of the model, so it is very important to select effective features and contain rich information
[0005] For the first question of sample characteristics: how to obtain sufficient sample characteristics? Many abnormal phone models only use two or three features to judge the sample category, such a model is not advisable
[0011] (1) Insufficient mining of user call behavior features, lack of effective sample information;
[0012] (2) The sample dimension is inappropriate, too high or too low will affect the prediction results;
[0013] (3) In the actual phone samples, the samples of abnormal calls are far smaller than the normal phone samples, so there is a huge problem of sample imbalance, which affects the model results
[0014] (4) The performance of the abnormal phone identification model with a single classification algorithm is poor

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Abnormal telephone recognition method and system based on feature selection and integrated learning
  • Abnormal telephone recognition method and system based on feature selection and integrated learning
  • Abnormal telephone recognition method and system based on feature selection and integrated learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0055] In one or more implementations, a method for identifying abnormal phone calls based on feature selection and integrated learning is disclosed, such as Figure 6 As shown, it specifically includes the following steps:

[0056] (1) Construct a mixed data set;

[0057]In the data sample, the abnormal call behavior detection problem belongs to the category imbalance problem because the number of instances of normal call behavior is much larger than that of abnormal ones. And there are some "dirty data" of unknown categories and a small amount of abnormal calls in the normal call samples provided by the operator. The focus of the research in this example is the analysis of abnormal phone behaviors under high-dimensional small samples, so when sampling the sample set, it is necessary to construct a mixed data set to restore the real data set such as figure 1 shown.

[0058] (1) Mining sample features through the user's historical call behavior in the window from the start ...

Embodiment 2

[0138] In one or more embodiments, an electronic device is provided, including a memory, a processor, and computer instructions stored in the memory and executed on the processor. When the computer instructions are executed by the processor, the steps in the method are completed. Each operation, for the sake of brevity, will not be repeated here.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an abnormal telephone recognition method and system based on feature selection and integrated learning. The method comprises the steps of constructing a mixed data set; miningsample characteristics through historical call behaviors of the user in a window from the starting time to the ending time; combining and optimizing the characteristics based on the user call behavior, and mining the characteristics with behavior information from the aspects of time, frequency, short message, flow, position and contact person; performing oversampling based on user call behavior samples, increasing the number of few samples, and reducing the influence of sample imbalance on the model; performing feature dimension reduction processing on the user call behavior sample; and establishing a model by using the integrated learning training data set, and carrying out abnormal telephone identification. According to the method, the original information of the sample is fully restoredin a feature mining combination and dimensionality reduction mixing mode, so that the prediction precision is improved.

Description

technical field [0001] The invention belongs to the technical field of machine learning and data mining, and in particular relates to a method and system for identifying abnormal calls based on feature selection and integrated learning. Background technique [0002] The statements in this section merely provide background information related to the present invention and do not necessarily constitute prior art. [0003] Traditional identification models such as black and white list technology, abnormal traffic detection technology, etc. are currently the main forms of abnormal phone identification. With the rapid development of communication technology and the improvement of people's living standards, fraudulent calls have become more and more low-cost and diverse. Due to the defects of various aspects, the traditional abnormal phone identification model has great defects in the initiative and accuracy of prevention. In order to solve this problem, many schemes have been pr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04M3/22G06N20/20G06K9/62
CPCH04M3/2281G06N20/20H04M2203/6027H04M2203/60G06F18/2135G06F18/24323
Inventor 纪科袁雅涵孙润元王琳陈贞翔马坤
Owner UNIV OF JINAN
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products