Method for automatically identifying named entities of traditional Chinese medicine patent literatures
A technology of named entities and patent documents, applied in the field of natural language processing, can solve the problems that named entities cannot be correctly identified, illegal marking sequences, etc., and achieve the effect of avoiding illegal marking and high calculation efficiency.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0062] 1. In the delimiting module, decompose the sentence into several consecutive n-tuples
[0063] Given example sentences: "A method of interplanting three-dimensional cultivation of Houttuynia cordata and Longevity fruit". After being processed by the delimiting module, the sentence is divided into the following forms: "a method of |Houttuynia cordata| and |wanshou fruit|interplanting three-dimensional cultivation|". The symbol "|" indicates the dividing boundary of two character n-tuples, turning the sentence into a sequence of 6-character n-tuples. Each n-tuple may or may not be a named entity, so The boundary of named entities or non-named entities in the sentence is determined by the way of n-tuples.
[0064] 2. In the classification module, classify the n-tuples of characters
[0065] After being processed by the delimiting module, the obtained 6-character n-tuple sequence "a|Houttuynia cordata| and |wanshouguo|interplanting three-dimensional cultivation| method" is used ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


