A Term Judgment Method Based on Decision Tree

A judgment method and decision tree technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as poor field term extraction effect, achieve high reliability and accuracy, and reduce workload.

Active Publication Date: 2018-01-26
IOL WUHAN INFORMATION TECH CO LTD
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] One of the purposes of the present invention is to provide a method for judging terms based on a decision tree, so as to solve the problem of poor field term extraction in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Term Judgment Method Based on Decision Tree
  • A Term Judgment Method Based on Decision Tree
  • A Term Judgment Method Based on Decision Tree

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0010] In the following detailed description, numerous specific details are set forth in order to provide a thorough understanding of the present invention. However, it will be understood by those skilled in the art that the present invention may be practiced without these specific details. In other instances, well-known methods, procedures, components and circuits have not been described in detail so as not to obscure the present invention.

[0011] Such as figure 1 As shown, a method for determining a term based on a decision tree is disclosed, including:

[0012] S11. Segment the original corpus with any length in units of morphemes to obtain several candidate terms, wherein each of the candidate terms consists of at least two morphemes;

[0013] S12. Determine a plurality of features that affect term determination, and calculate the feature value of each feature of each of the candidate terms;

[0014] S13. Using multiple eigenvalues ​​of each of the candidate terms, in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A method for judging a term based on a decision tree, comprising: segmenting the original corpus with arbitrary lengths in units of morphemes to obtain several candidate terms, wherein each of the candidate terms is composed of at least two morphemes; A plurality of features, calculate the feature value of each feature of each of the candidate terms; with each of the multiple feature values ​​of the candidate term, in the decision tree for term determination, according to the decision tree The order of generation is judged sequentially; the candidate term that is judged successful by the decision tree is used as a new term. The invention reduces the workload of manual processing and ensures high reliability and accuracy of the obtained terms.

Description

technical field [0001] The invention belongs to the technical field of data mining, in particular to a method for judging terms based on a decision tree. Background technique [0002] Field terms or professional terms are conventional symbols used to express or define professional concepts with voice or text as the carrier. With the vigorous development of science and technology, the continuous emergence of new technologies and the rapid change of Internet technology, the professional terms in some specific fields are constantly expanding and updating. Therefore, the traditional method of manually collecting field terms can no longer meet the actual needs. Automatic extraction of field terms (ATE, Automatic Term Extraction) has become inevitable. In practical applications, domain term extraction is of great significance in the construction of domain ontology, Chinese word segmentation, information extraction, dictionary compilation, information retrieval, machine translatio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/27
Inventor 江潮张芃
Owner IOL WUHAN INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products