The embodiment of the invention provides a Word document fragmentization method and device. In order to solve the problem that in the prior art, the retrieval efficiency is low when the target content is retrieved in a Word document, the Word document fragmentization method comprises the steps that firstly, all paragraphs of the Word document are obtained; secondly, according to the sequential order of the paragraphs in the Word document, the paragraph attributes of the paragraph are obtained in sequence, all the paragraph attributes which first appear in the Word document are extracted, and a paragraph attribute set of the Word document is generated; thirdly, a paragraph attribute recognition model is utilized to extract all headline paragraph attributes in the paragraph attribute set, and a headline paragraph attribute set is generated; fourthly, according to the headline paragraph attribute set, all headlines in the Word document are recognized, a headline tree of the Word document is generated, and the Word document fragmentization is achieved. Therefore, a user can directly retrieve the document paragraphs containing the target content in the fragmentized Word document or retrieve the target content in the headline tree of the Word document, and the retrieval efficiency is improved when the user retrieves the headlines of the Word document.