Domain term automatic extraction method based on abnormal sub-graph detection
A technology for automatic extraction and terminology, applied in natural language data processing, instrumentation, electrical digital data processing, etc., can solve problems such as unstable terminology extraction
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0039] The principles, advantages and implementation steps of the present invention will be easier to understand in conjunction with the above algorithm description and the following examples.
[0040] The present invention solves existing problems and is realized through the following technical solutions:
[0041] Step 1. Perform preprocessing operations such as sentence segmentation and word segmentation on the text data and perform part-of-speech tagging. Here, the THULAC word segmentation tool is used to implement.
[0042] Step 2. Select all possible words by n-gram method and grammatical rules, and use stop words and word frequency (experience threshold is 3) to filter. Here, some linguistic rules can be added to filter according to different fields. For example, in "tool realization", the tool is a noun, and the realization is a verb, which generally cannot form an effective phrase.
[0043] Step 3. Build a network. Use the set of candidate terms selected in step 2 as ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


