Method and device for structuring corpus
A structured, corpus technology, applied in the field of information processing, can solve the problem of low efficiency in extracting content, and achieve the effect of rapid acquisition
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0021] An embodiment of the present invention provides a method for structuring a corpus, the method comprising: obtaining a corpus file corresponding to the corpus to be structured, and adding segmentation tags between different specific contents of the corpus file according to the font attribute information of the characters in the corpus file to generate an intermediate file; according to the corresponding relationship between the font attribute information set in the preset automatic structuring rules and the specific content, extract the character information corresponding to the specific content from the intermediate file; according to the set in the automatic structuring rules The hierarchical relationship of the different specific contents of the system will combine the extracted character information and upload it to the server, so that the server can store structured corpus files.
[0022] like figure 1 As shown, the embodiment of the present invention provides a met...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 