The invention belongs to a
data information processing technology, and discloses a
deep learning adversarial sample generation method and
system based on a second-order method, and the method comprises the steps: carrying out the secondary Taylor expansion of a neural network function in a tiny neighborhood of an input sample X, i.e., Lp (p belongs to [2, 0, infinity]) norm constraint, and replacing a nonlinear part of a neural network; and constructing a
dual function through a
Lagrange multiplier method to calculate an extreme value to solve the optimal disturbance
delta, so that the confidence coefficient of the confrontation sample X '= X +
delta which is judged as a correct class is reduced to the minimum, or the confidence coefficient of the confrontation sample X' which is judged asa target class is increased to the maximum. According to the method, the operation of reducing the confidence coefficient of the correct output class is adopted for the target-free
attack; and for the target
attack, the operation of improving the confidence of the target class is adopted. The method provided by the invention can avoid falling into a local extreme value, generates a high-quality adversarial sample at extremely low cost, is applied to adversarial training of the deep neural network, and can effectively improve the defense effect.