Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Semantics-based Keyword Extraction Method for Specific Task Text

An extraction method and keyword technology, applied in the field of natural language processing, can solve problems such as complex structure, large amount of meaningless information, and large amount of information

Active Publication Date: 2020-07-10
北京东方科诺科技发展有限公司
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the large amount of information, complex structure, and many meaningless information make it impossible for people to process and process every piece of received information and identify the valuable part of it.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Semantics-based Keyword Extraction Method for Specific Task Text
  • A Semantics-based Keyword Extraction Method for Specific Task Text
  • A Semantics-based Keyword Extraction Method for Specific Task Text

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0035] The specific implementation method of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0036] A semantic-based keyword extraction method for a specific task text in the present invention firstly considers the semantic features of the keyword text to be extracted, calculates the semantic similarity between the text and the subject word of the specific task, and then considers the structural features of the keyword text to be extracted to construct Word network diagram, and finally use the search engine webpage sorting technology to calculate the importance of each word, and extract the words with higher importance in the network diagram according to the importance.

[0037] like figure 1 As shown, the details are as follows: firstly, use a search engine to search for a specific task-related corpus, extract subject words from the corpus related to a specific task, and use semantic representation technology to convert...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a semantic meaning-based specific task text keyword extraction method, and belongs to the field of natural language processing. The method comprises the following steps of: firstly, extracting a subject word of a certain specific task in a related text, and converting the subject word into a semantic vector by utilizing a semantic representation technology; secondly, carrying out word segmentation, part-of-speech tagging and screening on a text of a to-be-extracted keyword by utilizing a word segmentation tool; thirdly, converting the screened words into semantic vectors and calculating a similarity between each screened word and the subject word of the specific task; and finally, constructing a word network chart by taking the words as nodes, and calculating the importance degree of each word on the basis of the word similarity so as to extract important words in the word network chart. The method disclosed by the invention comprehensively considers the semantic features and structural features of the words in the texts, and is suitable for the extraction of specific-task oriented text keywords, so as to realize a function of obtaining important information from the texts and provide important technical support for the field of text mining, natural language processing, knowledge engineering and the like.

Description

technical field [0001] The invention belongs to the field of natural language processing and relates to information extraction technology, in particular to a method for extracting keywords from specific task text based on semantics. Background technique [0002] With the rapid development of social media, people receive and process a large amount of information from the physical world and the information world every moment. However, the large amount of information, complex structure, and many meaningless information make it impossible for people to process and process every piece of received information and identify the valuable part of it. Therefore, how to obtain useful information from text is the key to realize fast and accurate processing of information. [0003] In the real world, keywords are the most intuitive representation of useful information, so how to obtain keywords that people pay attention to from texts has become an urgent problem to be solved. Obtaining ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/284G06F40/30G06F40/216
CPCG06F40/216G06F40/284G06F40/30
Inventor 吴俊杰孙运动袁石
Owner 北京东方科诺科技发展有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products