Supercharge Your Innovation With Domain-Expert AI Agents!

Geological document feature lexical item sorting method and device based on graded lexical items

A technology of geological documents and sorting devices, which is applied in the fields of instruments, computing, and electrical digital data processing, etc., can solve problems such as the inability to reflect the differences in the importance of different terms to the subject, and achieve reliable sorting and effective calculations

Active Publication Date: 2020-05-01
CENT SOUTH UNIV
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the general term-document matrix, the number of occurrences of the term is purely used to represent the representation of the term to the topic of the document. The TextRank algorithm uses the relationship between local words (co-occurrence window) to sort the subsequent feature words. Reflect the difference in the importance of different terms to the topic

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Geological document feature lexical item sorting method and device based on graded lexical items
  • Geological document feature lexical item sorting method and device based on graded lexical items
  • Geological document feature lexical item sorting method and device based on graded lexical items

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0040] See attached figure 1 , the method of this embodiment specifically includes:

[0041] A1. Get range type parameter information.

[0042] A2. Determine whether the type parameter is the same as a preset first parameter, second parameter, or third parameter.

[0043] If so, get range parameter information.

[0044] The range parameter information includes: a first range parameter or a second range parameter.

[0045] A3. Based on the scope type parameter information and the scope parameter information, obtain a preset first document set, second document set, or third document set corresponding to the scope type parameter information and scope parameter information.

[0046] The first document set includes multiple first rule documents, and the first rule document in this embodiment is any document that may be extracted.

[0047] The second document set includes a plurality of second rule documents, and in this embodiment, the second rule document is any document belon...

Embodiment 2

[0058] (1) Input description

[0059] The input includes the term classification table words_list, the term level weight table levels_weights, the document term table files_words, and the feature value sorting parameter list orders.

[0060] (1-1) Words_list: It is a database table containing basic information of all words extracted from documents in a specific range. See Table 1 for specific field definitions.

[0061] Table 1 Definition of Term Grading Table

[0062]

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a geological document feature lexical item sorting method based on graded lexical items. The method comprises the steps that range type parameter information is acquired; judging whether the range type parameter is the same as a preset first parameter or a preset second parameter or a preset third parameter or not; if so, acquiring range parameter information; based on therange type parameter information and the range parameter information, obtaining a preset first document set or a preset second document set or a preset third document set corresponding to the type parameter information and the range parameter information; obtaining word frequencies of feature lexical items in the first document set or the second document set or the third document set; based on the word frequency of the feature lexical items in the first document set or the second document set or the third document set and preset lexical item levels and level weights corresponding to the feature lexical items, obtaining feature values of the feature lexical items in the first document set or the second document set or the third document set; and based on the feature values of the feature word items, obtaining the feature word items corresponding to the first N feature values in the feature values.

Description

technical field [0001] The invention relates to the field of language processing, in particular to a method and device for sorting geological document feature terms based on hierarchical terms. Background technique [0002] The subject (or feature) of a geological document is determined by all terms in the document and their grammar, context dependencies, etc., among which terms play an important role. [0003] The terms in geological documents include geological named entities such as "XX fault", "XX mine", "XX rock", geological property terms such as "normal fault" and "rhyoline structure", and "2019 Common named entities such as "October 10, Year" and "Hunan Academy of Geological Sciences", basic geological terms such as "stratum", "structure", and "rock mass", and "control", "basis", and "area" , "feature" and other common word segmentations, different terms have different characterization functions on geological documents. [0004] At present, most Chinese text classi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/284G06F40/295
Inventor 邓吉秋路馥毓李晨菡
Owner CENT SOUTH UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More