Method for obtaining APP service feature library and corresponding device

A technology of business features and feature items, applied in the field of big data, can solve the problems of inability to meet DPI identification, private features, feature blind spots, etc., and achieve the effect of improving coverage and applicable scenarios, eliminating data blind spots, and improving accuracy.

Active Publication Date: 2019-09-17
WUHAN GREENET INFORMATION SERVICE
View PDF5 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the prior art, there are several different solutions for APP feature recognition. For example, the patent application number CN201710453676.X discloses a method and device for obtaining APP recognition rules. The generated business feature data has a certain data blind spot, and the feature is collected by word segmentation. The feature value is single, and no composite feature is used, which cannot meet the requirements of DPI identification.
[0004] The patent with application number CN201810346473.5 discloses a method for building an APP traffic automatic identification model. Although the patent takes into account the local simulated packet capture data set, the simulated packet capture data set does not contain IOS-based business data, and the IOS The service of the application and the Android application has a certain degree of isolation, and the patent cannot recognize the characteristics of both systems in place, and the coverage is narrow
[0005] The patent application number is CN201610994224.8, which discloses an APP identification method and system. This patent only analyzes the URL in the installation package, and the identification is weak, and it is easy to produce feature blind spots, causing common features to be identified as private feature

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for obtaining APP service feature library and corresponding device
  • Method for obtaining APP service feature library and corresponding device
  • Method for obtaining APP service feature library and corresponding device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0063] This embodiment provides a method for obtaining an APP service feature library. Using the method for obtaining an APP service feature library in this embodiment, a service feature library with an APP name can be constructed, and the user data of an unknown service type can be processed through the service feature library. The judgment of the APP to which it belongs lays the foundation for analyzing the network behavior of users using the APP on the Internet. The method for obtaining the APP business feature library in this embodiment is applicable to various application scenarios involving APP business feature recognition, for example, in DPI technology, the feature analysis of business data, precise marketing of business data, or analysis of business data to obtain User portraits, etc.

[0064] Such as figure 1 As shown, the method for obtaining the APP service feature library in this embodiment includes the following steps:

[0065] In step 101, an APP installation ...

Embodiment 2

[0146] In an actual application scenario, in the simulated business data generated according to the method of Embodiment 1, there will be a device number of the simulator. Since the number of simulators is limited, there will be a large amount of data containing the same device number in the simulated business data. When performing feature extraction, the device number will be determined as a service feature, which will affect the identification of subsequent user data.

[0147] In order to solve the foregoing problems, this embodiment improves the method in Embodiment 1 to obtain a more accurate service feature database. Among them, most of the implementation process is the same as that of Embodiment 1, and will not be repeated here, and only the areas with improvements will be described below.

[0148] In this embodiment, the learning data set includes a first label data set, a second label data set, and a third label data set; wherein, the establishment process of the first l...

Embodiment 3

[0160] In order to more clearly show the process of establishing the learning data set and the process of establishing the business feature library in the above embodiment 1, combined with Figure 7 and Figure 8 , once again briefly explain the concept and implementation process of the above embodiment.

[0161] Such as Figure 7 As shown, the process of establishing the learning data set is briefly and clearly shown. Obtain the APP installation package, and determine the name of the APP to which the APP installation package belongs, parse the APP installation package, obtain the URL data, label the URL data with the APP name, and establish the first label data set.

[0162] The simulator installs the APP installation package, captures the simulated business data generated by each APP, labels the simulated business data with the APP name, and creates a second label data set.

[0163] Analyze the APP installation package, get the package name, and extract the APP identifier...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for obtaining an APP service feature library and a corresponding device. The method comprises the steps of obtaining an APP installation package, current network service data and simulation service data; analyzing the APP installation package, the present network service data and the simulation service data respectively to obtain APP names to which the APP installation package, the present network service data and the simulation service data belong respectively, and generating a learning data set; performing feature extraction on the current network service data and the analog service data to obtain a service feature tree containing at least one service feature; and carrying out feature matching on the business feature tree and the learning data set, determining an APP name to which each business feature in the business feature tree belongs, and generating a business feature library. The existing network service data has certain complexity, a data blind area can be eliminated, and the recognition rate is ensured; the service feature library is constructed from multiple feature dimensions, so that the service data with lower identification degree can be effectively identified, the feature identification accuracy is improved, and manpower can be effectively solved.

Description

technical field [0001] The invention belongs to the field of big data, and more specifically, relates to a method and a corresponding device for acquiring an APP service feature database. Background technique [0002] In recent years, computer technology has developed rapidly, and popular fields such as big data and machine learning are even more exciting. In terms of the Internet, user portraits and precision marketing have become hot words in the industry, and the basis of this series of technologies is labeled data. For DPI (Deep Packet Inspection, DPI for short) products, business traffic is data, and how to make good use of business data is the top priority of DPI products. Among these requirements: APP (Application, APP for short) business identification is the key technology for analyzing and processing basic data. [0003] In the prior art, there are several different solutions for APP feature recognition. For example, the patent application number CN201710453676.X...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/901G06F16/903G06F16/951H04L29/08
CPCG06F16/9027G06F16/90335G06F16/90344G06F16/951H04L67/34H04L67/146H04L67/51
Inventor 杨琨叶志钢张本军
Owner WUHAN GREENET INFORMATION SERVICE
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products