Sentence expression method based on Chinese sentence meaning structural model and topic model

A technology of sentence structure model and topic model, which is applied in the field of Chinese analysis of computer science and natural language processing, can solve the problems of large manpower and material resources, and achieve the effect of improving the classification effect

Inactive Publication Date: 2016-05-11
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF1 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, building a word knowledge base r...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sentence expression method based on Chinese sentence meaning structural model and topic model
  • Sentence expression method based on Chinese sentence meaning structural model and topic model
  • Sentence expression method based on Chinese sentence meaning structural model and topic model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] In order to better illustrate the purpose and advantages of the present invention, the implementation of the method of the present invention will be further described in detail below in conjunction with the accompanying drawings and examples.

[0023] Using the randomly selected texts of three categories of vehicles, finance and health in the sogou text classification corpus, 200 articles in each category with a total of 14357 sentences as data, the sentence classification test is carried out by using the ten-fold cross method.

[0024] Step 1. In order to obtain the basic item words, general item words, topic words and predicate words in the sentence, it is necessary to analyze the sentence structure first to obtain the sentence structure of the sentence.

[0025] In the above-mentioned steps, the basic term refers to the word as the basic item in the sentence structure of the sentence; the general term refers to the general term in the sentence structure of the sentenc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a sentence expression method based on a Chinese sentence meaning structural model and a topic model, and belongs to the technical field of computer science and natural language processing Chinese analysis. The method includes the steps that sentence meaning structural analysis is conducted on a sentence to obtain a sentence meaning structure of the sentence; elementary item words and general item words in the sentence are extracted, and the topic model is used for analysis to obtain an elementary item knowledge base and a general item knowledge base; according to words under a topic and a theme in the sentence meaning structure, the knowledge bases obtained in the previous step are used for expanding sentence content to obtain a sentence expression result. A new thought train is provided for solving the feature sparse problem of sentence expression, the sentence classifying effect is effectively promoted, and the sentence expression method has high theoretical value and an important practice function.

Description

technical field [0001] The invention relates to a sentence representation method based on a Chinese semantic structure model and a topic model, and belongs to the technical field of Chinese analysis of computer science and natural language processing. Background technique [0002] The purpose of sentence representation is to represent the content in a sentence into a computer-processable data form for classification, clustering, or sentence generation. As a basic research of natural language processing, it has been widely used in systems such as automatic question answering and automatic summarization. [0003] The bag-of-words model and the n-lattice model are currently the most commonly used long text representation methods due to their simplicity and efficiency. However, when analyzing and processing short texts such as sentences, these traditional methods will cause sparse representation of features due to the small data content in the text. To solve this problem, ther...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/27
CPCG06F40/205G06F40/30
Inventor 罗森林韩磊潘丽敏尚海
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products