The invention belongs to the technical field of image recognition, and particularly relates to a text positioning method and system based on the visual structure attribute. Based on the visual attribute of a text, by means of color polarity difference transformation and edge neighborhood tail end bonding, abundant closed edges are detected so that abundant candidate connection elements can be obtained, then character stroke attributive character and text colony attributive character screening is conducted, the connection elements belonging to characters are extracted from the candidate connection elements, and then the final text is positioned through multi-channel blending and repeated connection element removal. The method is high in robustness and can be adapted to the situation that multiple word language categories are mixed, or various font styles exist, or arrangement directions are random, or background interference exists and other situations, the positioned text can be directly provided for OCR software for recognition, and OCR software recognition rate can be increased. The text positioning method and system based on the visual structure attribute can be applied to image video retrieval, junk information blocking, vision assisted navigation, street view positioning, industrial equipment automation and other fields.