Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Geological document lexical item grading method and device

A grading method and a grading device technology, applied in semantic analysis, instruments, electrical digital data processing, etc., can solve the problem of no differentiation of terms and achieve the effect of highlighting differences

Active Publication Date: 2020-04-28
CENT SOUTH UNIV
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In order to solve the problem in the prior art that purely using the number of times a term appears in a document to represent a term’s representation of the subject of a document without differentiated terms, the present invention provides a method for grading geological document terms and device

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Geological document lexical item grading method and device
  • Geological document lexical item grading method and device
  • Geological document lexical item grading method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0051] See attached figure 2 , the term grading method in the first embodiment, comprising the steps of:

[0052] C1. Obtain multiple first rule terms, multiple second rule terms, and multiple third rule terms.

[0053] C2. Based on the preset first level, third level, and fifth level corresponding to the first regular term, second regular term, and third regular term, obtain a plurality of first level terms, Multiple third-level terms and multiple fifth-level terms.

[0054] The multiple first-level terms include: the multiple first regular terms.

[0055] The plurality of third-level terms includes: the plurality of second rule terms.

[0056] The multiple fifth-level terms include: the multiple third regular terms.

[0057] C3. Judging whether there is a first-level term identical to a third-level term or a fifth-level term among the plurality of first-level terms. If yes, the plurality of first-level terms are processed to obtain the processed plurality of first-leve...

Embodiment 2

[0083] In this embodiment, according to the semantics expressed by different terms in the geological document, the terms are divided into multiple levels, and the level definitions of terms are shown in Table 1.

[0084] Table 1 Definition of term level

[0085]

[0086]

[0087] The "level" mentioned in Table 1, the larger the number, the higher the level, and the more important the role in characterizing geological documents.

[0088] (2) Term-level definition

[0089] (2-1) Initial level

[0090] The initial rank of a particular term is determined according to the source of the term, and the initial ranks of terms from different sources are defined as follows:

[0091] In this embodiment, the words, phrases or phrases obtained by the ordinary Chinese word segmentation are used as the first regular term, and its initial level is 1.

[0092] In this embodiment, the term extracted from the common named entity is used as the second rule term, and its initial level is 3...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a geological document lexical item grading method. The method comprises the steps of obtaining a plurality of target processing lexical items and length values of the target processing lexical items; acquiring a first type of target processing lexical items and a second type of target processing lexical items based on the target processing lexical items and a preset end word; obtaining a length value of a tail word of the second type of target processing lexical items; based on the length value of the tail word of the second type of target processing lexical item and the length value of the target processing lexical item to which the tail word belongs, obtaining a prefix length value of the target processing lexical item to which the tail word belongs; based on theprefix length value of the target processing lexical item to which the tail word belongs and a preset target level corresponding to the prefix length value, obtaining the target level of the target processing lexical item to which the tail word belongs; wherein the target level comprises a first target level or a second target level or a third target level or a fourth target level or a fifth target level or a sixth target level.

Description

technical field [0001] The invention relates to the field of language processing, in particular to a method and device for grading geological document terms. Background technique [0002] At present, most Chinese text classification systems use words as feature items, called feature words. These feature words are used as the intermediate representation of the document, and are used to realize the similarity calculation between documents and documents, documents and user targets. Usually, the score value of each feature is calculated according to a feature evaluation function, and then these features are sorted according to the score value, and several highest score values ​​are selected as feature words. There are many methods for text representation, the most common and effective method is to establish a term-document matrix. [0003] Each element value in the term-document matrix represents the weight of the term on the corresponding row to the document on the correspond...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/289G06F40/295G06F40/30
Inventor 邓吉秋路馥毓李晨菡
Owner CENT SOUTH UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products