Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method for mapping Chinese problems on basis of LDA (latent Dirichlet allocation)

A mapping method and problem technology, applied in the field of computer software, can solve problems such as dependency accuracy, high classification accuracy, error, etc., and achieve the effect of reasonable design and improved accuracy

Active Publication Date: 2017-12-01
识因智能科技(北京)有限公司
View PDF7 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Now the machine learning method based on statistics occupies a dominant position. The more representative one is to use the SVM (Support Vector Machine) algorithm to classify problems. The analysis determines that using this method to classify Chinese questions will bring a large error

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for mapping Chinese problems on basis of LDA (latent Dirichlet allocation)
  • Method for mapping Chinese problems on basis of LDA (latent Dirichlet allocation)
  • Method for mapping Chinese problems on basis of LDA (latent Dirichlet allocation)

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039]The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0040] see figure 1 , a method for mapping Chinese questions based on LDA, including the following steps:

[0041] Step A, first use the LDA topic model to train the document library D, and the document d can be obtained t theme z j ,z j ∈T, T={z 1 ,...,z 2 ,z k} and its distribution p(z j │d t ), and the subject z can also be obtained j the term w r ,w r ∈v, v={w 1 ,...,w 2 ,w r} and its distribution p(z j │w r ), by the definition of condition...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for mapping Chinese problems on the basis of LDA (latent Dirichlet allocation). The method includes classifying document libraries by the aid of LDA theme models; classifying word characteristics for the problems by the aid of Softmax regression models; assigning high weights to notional words according to difference of categories of the word characteristics, assigning low weights to functional words according to the difference of the categories of the word characteristics and allowing the weights of different word characteristics of the notional words to be different from one another; finding dependency relations of terms in sentences by means of syntactic analysis on the basis of dependency grammar; assigning different weights according to the difference of components of the terms in the sentences; multiplying two portions to obtain a weight of each word in each problem; establishing relationships by the aid of weighted distribution of terms in the problems and distribution of themes and lexical terms in documents according to Bayesian rules. The method has the advantages that the documents are classified on the basis of the LDA theme models, the different weights are distributed on the reference of the word characteristics of the lexical terms in the interrogative sentences and the components in the sentences, accordingly, effects of important lexical terms can be improved during classification, and the Chinese problem mapping accuracy can be improved.

Description

technical field [0001] The invention relates to a Chinese question mapping method, in particular to an LDA-based Chinese question mapping method, and belongs to the field of computer software. Background technique [0002] With the rapid development of Internet technology, search engines can provide people with various online information quickly and conveniently. Early search engines required users to submit keywords for query, and then the system returned to the user a list of web documents related to the query. Such limited keywords sometimes cannot fully express the user's query intention, and even not all users can accurately Give all kinds of keywords you want to query. Therefore, people urgently need a more efficient and convenient way to obtain information from the Internet. The question answering system was born under such a background. It allows users to ask questions in natural language and then directly returns accurate answers. [0003] Automatic question answe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/3329G06F16/35
Inventor 王春辉
Owner 识因智能科技(北京)有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products