Unlock instant, AI-driven research and patent intelligence for your innovation.

Short Text Feature Extension Method Based on Semantic Graph

A technology of semantic map and extension method, which is applied in the field of short text feature extension, can solve problems such as data sparseness, and achieve the effect of improving classification performance, solving sparsity problems and semantic sensitivity problems.

Active Publication Date: 2017-12-01
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Aiming at the above two main problems, the present invention proposes a short text feature extension method based on semantic maps, which solves the problem of data sparsity and semantic sensitivity in the representation of short text features by the traditional bag-of-words model, and finally improves short text features. Ben's classification performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Short Text Feature Extension Method Based on Semantic Graph
  • Short Text Feature Extension Method Based on Semantic Graph
  • Short Text Feature Extension Method Based on Semantic Graph

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in further detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0043] The present invention proposes a short text feature extension method based on a semantic map, specifically a short text feature extension method based on a topic-keyword semantic map and link analysis, which can fully mine the semantic relationship between topic words to a certain extent, It can quickly and accurately extract the information most relevant to the seed keyword, and complete the expansion of the feature representation of the target short text. The basic features of the present invention mainly include the following six aspects: First, it does not rely on external large-scale auxiliary training corpus, directly uses short text data sets for topic modeling, improves modeling efficiency, and ensures seman...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a short text feature extension method based on a semantic map, which includes the following steps: using a short text training data set to perform topic modeling, extracting the distribution of topic words; reordering the distribution of topic words; constructing a dictionary of candidate keywords and Topic-keyword semantic map; based on the method of link analysis, the comprehensive similarity evaluation of candidate keywords and seed keywords is calculated, and the most similar candidate keywords are selected to complete the expansion of the short text. Compared with the short text feature representation method based on the language model, the method of the present invention is simple to operate, has high execution efficiency, and makes full use of the semantic correlation information between keywords, compared with the traditional short text feature representation method based on the bag of words model, effectively alleviating Sparsity issues and semantic sensitivity issues do not depend on external large-scale auxiliary training corpus or search engines.

Description

technical field [0001] The present invention relates to the technical field of text mining, and is a short text feature extension method based on topic-keyword semantic map and link analysis, which can be applied to feature representation in short text classification and clustering tasks, and finally applied to knowledge question answering, Subfields such as user intent understanding and intelligent retrieval. Background technique [0002] With the advent of the era of big data, the Internet and various mobile terminals have generated a large amount of short text information, such as web page retrieval fragments, Weibo, product reviews, news headlines, and various micro-information, etc. Information is also being overwhelmed by the vast amount of resources. How to make the system intelligently manage and better use these massive data resources is facing a huge challenge. Therefore, a high-precision short text classification method can help the system to deepen the understa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06F40/216G06F40/30G06F18/24
Inventor 徐博王鹏王方圆张恒郝红卫
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More