Construction method, system, device and storage medium of named entity recognition corpus
Patent Information
- Authority / Receiving Office
- CN Β· China
- Patent Type
- Patents(China)
- Current Assignee / Owner
- SUZHOU UNIV
- Publication Date
- 2022-04-12
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The present invention relates to the technical field of natural language processing, in particular to a method, system, device and storage medium for constructing a named entity recognition corpus. Background technique
[0002] The purpose of information extraction is to extract entities and their interrelationships from unstructured free text, and transform them into structured expressions, so as to provide a data basis for the construction of knowledge bases.
[0003] In the existing technology, the research on Chinese named entity recognition mainly uses high-quality manually labeled corpus, such as the "People's Daily" corpus in January 1998, the MSRA corpus of Microsoft Asia Research Institute, the CityU corpus of Hong Kong City University and the ACE2005 Chinese corpus, etc. . The named entity categories used by different corpora, as well as the labeling rules and the size of the corpus are different, and in order to ensure the quality of the co...