The invention relates to a
black box voice confrontation sample generation method with auditory concealment, and relates to the technical field of
artificial intelligence safety. According to the main technical scheme, the method comprises the steps of initializing
simulated annealing parameters; reading in an original audio, and initializing an audio confrontation sample; calculating
black box noise according to an input audio, and performing concealment
processing, namely a time-varying
noise strategy based on
signal variance and concealment improvement based on an auditory effect of human ears; synthesizing a new confrontationsample by using the
black box noise; and inputting a black box voice recognition model, judging whether the
attack is successful or not, if the
attack is successful, stopping iteration, outputting an audio confrontation sample, and if the
attack is not successful, generating a new solution as an input audio according to a Markov criterion to continue iteration until the iteration is completed or the attack is successful. According to the invention, the audio confrontation sample generated by the method has high similarity with the original audio, is more in line with the auditory effect of human ears, has high concealment, and can be successfully attacked without being perceived.