Unlock instant, AI-driven research and patent intelligence for your innovation.

Text classification incremental training learning method supporting vector machine by compromising key words

A support vector machine and incremental training technology, which is applied in complex mathematical operations, special data processing applications, instruments, etc., can solve the problem of low classification accuracy and achieve the effect of eliminating the difference in classification accuracy

Inactive Publication Date: 2006-03-15
北京大学计算机科学技术研究所 +1
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] This method is aimed at the phenomenon that in the current incremental training of SVM text classification, because only the optimization of the support vector itself is considered, the effect of incremental training is slightly lower than the classification accuracy of one-time training. Combining incremental training and one-time training in In order to eliminate the difference between the two, the incremental training is consistent with the one-time training. The purpose of the classification accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text classification incremental training learning method supporting vector machine by compromising key words
  • Text classification incremental training learning method supporting vector machine by compromising key words
  • Text classification incremental training learning method supporting vector machine by compromising key words

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] Below with the mode of embodiment and in conjunction with accompanying drawing, the present invention will be further described:

[0024] Such as figure 1 As shown, a support vector machine text classification incremental training learning method that integrates keyword learning includes the following steps:

[0025] First, read the incremental training document through the computer and related software, and perform document segmentation processing on the document.

[0026] Second, according to the word frequency characteristics in the document, keywords of the document are extracted.

[0027] Third, carry out keyword learning and adjustment. For each new incremental training document, update the original keyword set according to the keywords in the new incremental document. The steps are as follows:

[0028] 1) If the keyword t in the incremental training document k already exists in the original keyword set, then the number of training documents corresponding to th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention belongs to an intelligent information processing technology, in particular, it relates to a text-sorted incremental training and leaming method by combining keyboards to make learning and support vector machine. The invention utilizes the important action of keywords in the training process to provide the invented synchronously-regarded keyword "incremental" learning method, in the incremental training process the learning and regulation of sorted keyboards can be made simultaneously, so that said incremental learning method can obtain the sorted accuracy identical to that of one-step training process.

Description

technical field [0001] The invention belongs to the intelligent information processing technology, and further relates to the text classification processing technology, in particular to a support vector machine text classification incremental training and learning method integrated with keyword learning. Background technique [0002] With the rapid development of network and information technology and the great abundance of digital document information, the classification and processing of texts, materials, web pages, etc. has become an important technical means of information processing. For text classification, the support vector machine method (support vector machine, SVM) is currently one of the most effective methods, in 1998 Joachims in the document "Text Categorization with Support VectorMachines: Learning with Many Relevant Features" (In Proceedings of the European Conference on Machine Learning, Berlin, Springer, 1998) verified its excellent performance in text clas...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/21G06F17/16
Inventor 孙晋文
Owner 北京大学计算机科学技术研究所