The invention discloses a method and software for digitalizing the full text of a standard document, belongs to the technical field of standard documents and information, solves the problems of the full text retrieval and detailed retrieval of the standard document and realizes standard information text mining. Set out from the application prospect of the standard document, processes including visualizing, characterizing and structuring are performed; the digitalization processing method is performed by a scanned image processing module, an OCR identifying and correcting module, a standard title recording module, a structured full text making module and the like; and a standard full text XML format recording and defining file and a standard full text XML file are defined. According to the standard full text XML format recording and defining file and the standard full text XML file, the method and the software define schema file development software, realize data processing of a standard title, a single-layer PDF file, a double-layer PDF file, the full text XML file, a table, an image and the like, and realize image and table retrieval and data deriving in determined ranges, such as a standard preface, a foreword, a range, referenced files, terms and the like.