KNN algorithm based article translation method

A technology of KNN algorithm and optimization method, which is applied to computer components, calculations, instruments, etc., and can solve problems such as low efficiency and accuracy

Inactive Publication Date: 2015-10-28
HENAN UNIV OF SCI & TECH
View PDF2 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention provides a manuscript translation optimization method based on the KNN algorithm to solve the problem of low efficiency and accuracy of the traditional manual classification method, and introduces the mutual information value into the genetic algorithm in the feature extraction step. The advantages of the two extraction methods can be combined to make the feature extraction results more reliable, so that the entire text classification can be better applied to the manuscript text information mining system

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • KNN algorithm based article translation method
  • KNN algorithm based article translation method
  • KNN algorithm based article translation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0072] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0073] see figure 1 and figure 2 , a manuscript translation optimization method based on the KNN algorithm. Firstly, the training manuscript is preprocessed, and then the preprocessed manuscript is represented by a vector space model, and then the feature extraction is performed on the representation result, and then the text classification model can be calculated. After preprocessing, text representation, and feature extraction are also performed on the mail data to be classified, the model is applied to the manuscripts to be classified, and finally the result is obtained.

[0074] A manuscript translation optimization method based on the KNN algorithm, the specific steps are as follows:

[0075] (1) The total number of predefined text categories is n, and n represents the number of categories of samples of known categories, that is, the nu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Provided is a KNN algorithm based article translation method. The method comprises the steps of: firstly, splitting a relatively large article, extracting a keyword from an article to be translated and classifying the article; and obtaining an optimal allocation result by using an algorithm to match a K value. The invention is mainly about preprocessing articles that are used for training and mail data to be classified; performing textual representation on the preprocessed articles that are used for training and mail data to be classified; using a genetic algorithm to perform feature extraction on the textually represented articles that are used for training and on mail data to be classified; performing classification training on the extracted features of the articles that are used for training; using the optimized sample set KNN algorithm to perform classification training, so as to construct a text classifier; and applying the text classifier to the articles to be classified after the feature extraction, so as to obtain a classification result of the articles to be classified. The method provided by the invention can be better applied to an article text information mining system.

Description

technical field [0001] A manuscript translation optimization method based on the KNN algorithm, which uses the K-nearest neighbor node algorithm to classify manuscripts by cutting and optimizing the training set, belongs to the fields of text mining, natural language processing, and computer technology. Background technique [0002] The information age and networking have brought about great changes in the way translation works. Use the translation process management platform to store talent data according to different objects. When there is a translation task, according to the language of the translation project, the type of article, the professional field, and the customer's requirements for translation quality and time limit, we can call the most suitable translators and reviewers to form a project team for translation, thereby improving translation efficiency and saving translation costs, ensure translation quality, and optimize project management. [0003] The current...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/2111G06F18/2413G06F18/214
Inventor 郑林涛史恒亮俞卫华董永生范庆辉
Owner HENAN UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products