Word elimination process for extracting domain ontology concept

A technology of domain ontology and exclusion method, which is applied in the field of word exclusion method for extracting domain ontology concepts, and can solve problems such as manually setting thresholds

Inactive Publication Date: 2011-02-02
DALIAN UNIV OF TECH
View PDF1 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] The technical problem to be solved by the present invention is to provide a word exclusion method for extracting domain ontology concepts, which solves the difficulty of manually setting thresholds in the process of domain concept extraction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Word elimination process for extracting domain ontology concept
  • Word elimination process for extracting domain ontology concept
  • Word elimination process for extracting domain ontology concept

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0039] As shown in the attached figure, in the figure:

[0040] 1) Corpus. This method uses foreground corpora and background corpora to obtain domain concepts. The foreground corpus is a domain document library containing rich domain concepts, and generally consists of several standardized domain text files; the background corpus is used to compare with the foreground corpus to highlight the different statistics of domain concepts in domain documents and non-domain documents The characteristic electronic document library is composed of several domain documents in more than three different domains.

[0041] Corpus C is composed of foreground corpus in m (m≥3) fields. field of study D. k When the domain-specific concepts of , the foreground corpus is Cf k , background corpus Cb k From the foreground corpus Cf of other m-1 fields in the corpus 1 (1≤1≤m, 1≠k) constitutes. Requires foreground corpus (ie domain corpus) Cf k fully contained D k All domain-specific concepts ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of artificial intelligence, relating to an extraction method of a domain ontology concept, in particular to a word elimination process for extracting domain ontology concept. The invention has the technical scheme that an elimination process is adopted to automatically extract a domain ontology concept assembly and solves the technical problem that the manual threshold configuration is difficult in extracting the domain concept. When a word assembly appears in a given domain corpus, the method firstly calculates the domain correlation degree of the word and deletes irreverent words of the domain; then the method calculates the domain uniformity of residual words and deletes words which are unevenly distributed in the domain corpus so as to obtain the domain ontology concept assembly. The method can automatically obtain the assembly of an exclusive domain concept according to a text corpora composed of prospect corpus (i.e. domain corpus) and background corpus (i.e. non-domain corpus), thereby reducing argument caused by subjective factors, such as the knowledge structure of domain experts and the like, in the domain concept extracting process.

Description

technical field [0001] The invention relates to a method for extracting domain ontology concepts, in particular to a word exclusion method for extracting domain ontology concepts. Background technique [0002] The concept of domain ontology (namely domain-specific concept, domain concept for short) is a knowledge unit that describes the common characteristics of a group of domain objects. The domain concept extraction method is mainly used to support the construction of word sets of domain concepts, and assist domain experts to collect domain concepts and unified concept words (domain terms), that is, to construct a set of terms that uniquely correspond to domain concepts. Domain terms are the most appropriate words that can describe the domain, and they are standardized terms that represent domain concepts. [0003] The domain concept extraction method is a machine learning method and technology that uses a computer to simulate the behavior of human domain experts to obtai...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 党延忠于娟
Owner DALIAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products