Unlock instant, AI-driven research and patent intelligence for your innovation.

Sample classification method and apparatus

A classification method and sample technology, applied in the computer field, can solve problems such as infeasibility, low algorithm efficiency, high overhead, etc.

Inactive Publication Date: 2018-11-06
BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD +1
View PDF0 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the process of realizing the present invention, the inventor finds that there are at least the following problems in the prior art: the K nearest neighbor algorithm needs to calculate the distance of each training sample in the test sample and the training sample set one by one (or Similarity), when the training sample set is large data, the above calculation process will generate high overhead, resulting in very low efficiency of the algorithm, or even infeasible

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sample classification method and apparatus
  • Sample classification method and apparatus
  • Sample classification method and apparatus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] Exemplary embodiments of the present invention are described below in conjunction with the accompanying drawings, which include various details of the embodiments of the present invention to facilitate understanding, and they should be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

[0032] figure 1 is a schematic diagram of the main steps of the sample classification method according to the embodiment of the present invention. Such as figure 1 As shown, the sample classification method in the embodiment of the present invention mainly includes the following steps:

[0033] Step S101: Calculate the similarity between the test sample and the cluster ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a sample classification method and apparatus, and belongs to the technical field of computers. The sample classification method includes the steps: calculating a similarity between a test sample and the cluster center of a plurality of sub-clusters, and determining the selection interval according to the similarity and the preset threshold, wherein the sub-clusters are obtained through clustering of a training sample set; selecting training samples having the similarity with the clustering center in the selection interval from the sub-clusters corresponding to the clustering center with the highest similarity; and taking the selected training samples as a new training sample set, and classifying the test samples. For each test sample, the sample classification method selects the training samples from the sub-clusters corresponding to the clustering center with the highest similarity according to the determined selection interval, and classifies each test sampleby using the selected training samples, thus reducing the number of subsequent training samples to be classified, and improving the sample classification efficiency in a big data environment.

Description

technical field [0001] The invention relates to the field of computers, in particular to a sample classification method and device. Background technique [0002] Because of its simplicity and ease of implementation, the K-nearest neighbor algorithm is widely used in many fields, such as face recognition, gene classification, decision support, etc. The basic idea of ​​the K-nearest neighbor algorithm is: for a given test sample x, find its K nearest neighbor samples in the training sample set, and determine the category of the test sample x according to the categories of the K nearest neighbor samples. [0003] In the process of realizing the present invention, the inventor finds that there are at least the following problems in the prior art: the K nearest neighbor algorithm needs to calculate the distance of each training sample in the test sample and the training sample set one by one (or Similarity), when the training sample set is large data, the above calculation proce...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62
CPCG06F18/2321G06F18/24147
Inventor 张明阳
Owner BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD