The invention discloses a visual dialogue generation method based on a context perceptual map neural network. The visual dialogue generation method comprises the following steps of 1, preprocessing the text input in a visual dialogue and constructing a
word list; 2, extracting the features of a dialogue image and the features of a dialogue text; 3, obtaining a context
feature vector of the historical dialogue; 4, constructing a context perceptual map; 5, iteratively updating the context perceptual map; 6, carrying out attention
processing on the nodes of the context perceptual map based on a current problem; 7, performing multi-
modal semantic fusion and decoding to generate an answer feature sequence; 8, generating the parameter optimization of a
network model based on the visual dialogueof the context perceptual map neural network; 9, generating a prediction answer. According to the method, the context perceptual map neural network is constructed on the visual dialogue, and the
implicit relationship between different objects in the image can be reasoned by using the text
semantic information with finer
granularity, so that the reasonability and accuracy of the answers generated by an
intelligent agent for question prediction are improved.