Method and device for excavating semantic keywords from text

A keyword and semantic association technology, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve problems such as the increase in the number of texts, mining of semantic keywords, and the lack of structure in texts

Inactive Publication Date: 2014-12-24
FUJITSU LTD
View PDF5 Cites 47 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the number of texts is growing explosively, and there are many types of texts, and a considerable part of the texts does not have a fixed structure
Therefore, there is a problem of how to mine semantic keywords from massive, unstructured texts

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for excavating semantic keywords from text
  • Method and device for excavating semantic keywords from text
  • Method and device for excavating semantic keywords from text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] Exemplary embodiments of the present invention will be described in detail below with reference to the accompanying drawings. In the interest of clarity and conciseness, not all features of an actual implementation are described in this specification. It should be understood, however, that in developing any such practical implementation, many implementation-specific decisions must be made in order to achieve the developer's specific goals, such as meeting those system- and business-related constraints and those Restrictions may vary from implementation to implementation. Moreover, it should also be understood that development work, while potentially complex and time-consuming, would at least be a routine undertaking for those skilled in the art having the benefit of this disclosure.

[0019] Here, it should also be noted that, in order to avoid obscuring the present invention due to unnecessary details, only the device structure and / or processing steps closely related ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and a device for excavating semantic keywords from a text. According to the invention, the method comprises the steps of: searching known words in the text to obtain multiple candidate keywords; calculating the probability of the candidate of the multiple candidate keywords based on the reference probability and/or the context of the known words, wherein the reference probability shows the probability of the known words as an anchor text, and the probability of the candidate shows the probability of the candidate keywords as the semantic keywords; determining whether the multiple candidate keywords are the semantic keywords of the text based on the probability of the candidate of the multiple candidate keywords.

Description

technical field [0001] The present invention relates generally to the field of natural language processing. Specifically, the present invention relates to a method and device for mining semantic keywords from text. Background technique [0002] Text is the most common processing object in the field of natural language processing. Faced with massive amounts of text, it is obviously not practical to directly use the text itself to operate. People usually use semantic keywords representing the semantic information of the text to help represent, index, share, retrieve, classify, and cluster text. [0003] However, the number of texts is growing explosively, and there are many types of texts, and a considerable part of texts does not have a fixed structure. Therefore, there is a problem of how to mine semantic keywords from massive, unstructured texts. [0004] Therefore, it is expected to mine semantic keywords from texts with high efficiency and accuracy. Contents of the i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/3334
Inventor 缪庆亮孟遥于浩
Owner FUJITSU LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products