Intranet transaction identification method based on random forest and naive Bayes model

A technology of Bayesian model and random forest, which is applied in the fields of instrumentation, finance, and data processing applications, etc. It can solve the problems of slow optimization speed of genetic algorithm and difficulty in achieving precision in optimization results, achieving improved accuracy and good accuracy , easy-to-achieve effects

Pending Publication Date: 2019-09-06
CHINA THREE GORGES UNIV
View PDF0 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The genetic algorithm has a powerful global search ability, the initial search speed is very fast,

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Intranet transaction identification method based on random forest and naive Bayes model
  • Intranet transaction identification method based on random forest and naive Bayes model
  • Intranet transaction identification method based on random forest and naive Bayes model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] like Figure 1-3 As shown, the insider trading identification method based on random forest and naive Bayesian model, the specific steps are as follows,

[0060] Step 1: Use a web crawler to obtain the stock samples with insider trading announced by the China Securities Regulatory Commission and their corresponding characteristic indicators under different event time windows, including securities market micro indicators, company financial indicators and corporate governance indicators; obtain different event time window periods The insider trading sample dataset under;

[0061] Step 2: Use the Gini coefficient of the random forest model to calculate the importance of feature indicators, and select the best combination of feature indicators according to the importance score;

[0062] Step 3: Build a Bayesian identification model for insider trading based on the selected feature index set, and use the insider trading sample data set as the training data set to train the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an intranet transaction identification method based on a random forest and a naive Bayes model. The method comprises the following steps: acquiring an internal screen transaction sample data set under different time window periods, screening and constructing a characteristic index set by adopting a random forest model, constructing a Bayesian identification model of the internal screen transaction according to the screened characteristic index set, and performing internal screen transaction identification by adopting the Bayesian identification model to obtain a resultof whether the internal screen transaction exists or not; and after the event, supervising and verifying whether the internal transaction recognition result is correct, and training and updating the Bayesian recognition model according to the recognition result. According to the invention, the stock intranet transaction identification model is established, so that whether the test target is subjected to intranet transaction or not is accurately identified; a quasi-Newton method and a genetic algorithm are combined, so that parameters of the random forest model are quickly optimized to an optimal solution with high precision, and the solution of the optimal solution has small dependence on an initial value; the method is easy to implement and stable in performance, and robustness and accuracy can be further improved along with increase of sample data.

Description

technical field [0001] The invention belongs to the field of securities market supervision, and in particular relates to an insider trading identification method based on a random forest and a naive Bayesian model. Background technique [0002] Insider trading in the securities market seriously interferes with the normal operation of the stock market, especially harming the interests of small and medium investors. In recent years, how to identify insider trading has become a hot spot in academic circles. [0003] Compared with other classifiers, Naive Bayesian classifier is one of the classifiers with better learning efficiency and classification effect. Its advantages are simple algorithm logic, easy implementation, and stable algorithm performance. The basis of Naive Bayesian classification is probabilistic reasoning. The probability of occurrence of each condition is known, and the reasoning and decision-making tasks are completed. [0004] Genetic algorithm is a search...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06Q40/04
CPCG06Q40/04
Inventor 邓尚昆王晨光曹成航
Owner CHINA THREE GORGES UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products