Automatic identification ancient book introduction system and method based on deep learning model

A deep learning and automatic identification technology, applied in neural learning methods, biological neural network models, natural language data processing, etc., can solve problems such as weak and weak purely manual operation

Pending Publication Date: 2022-02-08
NANJING AGRICULTURAL UNIVERSITY
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the face of the vast amount of ancient books, pure manual operation is already weak. For large-scale or even super-large-scale ancient text corpora, if you want to dig deep into the mutual reference relationship, it is obviously difficult to manually mark them one by one as the predecessors did. Unrealistic

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] In order to more comprehensively and clearly demonstrate the specific implementation process and advantages of the method, the method will be further described in detail below. It is worth noting that the specific implementation process described here is only aimed at the specific objects used in this method, and its purpose is mainly to better explain the implementation of this method, and is not intended to limit the present invention. At the same time, the scope of protection of the present invention also includes other embodiments obtained by other researchers in the field without making substantive creative work on the basis of this case.

[0022] A system for automatically identifying ancient book citations based on a deep learning model, including:

[0023] Corpus preprocessing module. According to the test requirements and the model's requirements for the corpus, the corpus is cleaned and processed, and the training set and the verification set are divided acco...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a system and a method for automatically identifying ancient book introduction based on a deep learning model. The system comprises a corpus preprocessing module; a language model pre-training module; a book introduction entry identification test module; and a book introduction item identification effect evaluation module. The method comprises the following steps: S1, selecting a target research corpus, manually processing and proofreading the target research corpus, and determining a model category; S2, compiling related model codes, modifying and adjusting model parameters according to corpora and research purposes, and performing iterative training on the model; S3, selecting a model evaluation method, scoring the model evaluation method according to a result, and selecting an optimal model for storage; S4, selecting the optimal model as a tool for the user to directly carry out the task in the future, and realizing automatic identification of ancient book introduction. The method has the advantages that a large amount of manual labor is avoided, recognition and extraction of ancient book introduction can be automatically realized directly by means of a computer technology, and the problem that large-scale ancient book text introduction corpora are difficult to construct is effectively solved.

Description

technical field [0001] The present invention relates to the technical field of computer software, in particular to an ancient book quotation automatic recognition system and method developed based on deep learning models SIKU-BERT and SIKU-RoBERTa. Background technique [0002] Different from references in modern Chinese texts, whose citations have a unified description format and summary at the end of the text, citations in ancient texts are scattered throughout the text in various forms, with neither obvious citation signs nor standardized formats. Therefore, the identification of citations in ancient texts is an extremely challenging task. In previous studies, obtaining citation entries mostly relied on manually reading volume by volume. However, in the face of the vast amount of ancient books, pure manual operation is already weak. For large-scale or even super-large-scale ancient text corpora, if you want to dig deep into the mutual reference relationship, it is obviou...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/295G06F40/237G06N3/04G06N3/08
CPCG06F40/295G06F40/237G06N3/08G06N3/044
Inventor 黄水清周好王东波
Owner NANJING AGRICULTURAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products