The invention belongs to the technical field of
big data retrieval, and provides a multi-
modal retrieval method and
system based on weak supervision hash learning. In order to solve the problem of incomplete
pairing information among modals, the method comprises the following steps: acquiring a to-be-retrieved sample, and carrying out hash code calculation on the to-be-retrieved sample; performing 0 / 1 XOR operation on the hash code of the to-be-retrieved sample and the hash code in the retrieval
database to calculate a
Hamming distance, and returning similar data from small to large according to the
Hamming distance; the construction process of the retrieval
database comprises the following steps: establishing a semi-supervised and semi-paired cross-
modal hash target function based on intra-
modal pairing similarity, inter-modal
pairing similarity and complemented
label information of each modal; hash representation is obtained by optimizing an objective function of semi-supervised and semi-paired cross-modal Hash, sampling is carried out from the Hash representation, then corresponding partial cross-modal similarity information is embedded into
Hash function learning, and finally, a retrieval
database is generated by utilizing the embedded
Hash function. According to the method, the calculation complexity is reduced, and the retrieval precision is improved.