Unlock instant, AI-driven research and patent intelligence for your innovation.

Validating training data of classifiers

A technique for training data and classifiers, which is applied in the field of verifying the training data of classifiers, and can solve problems such as the difficulty of collecting problem statements

Pending Publication Date: 2020-02-11
IBM CORP
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the case of the above-mentioned automatic answering system, the data part is the problem statement written in natural language, so there are the following problems: (1) It is not easy to collect the problem statement; (2) The intention of the problem statement collected as training data is usually Has content dedicated to a specific domain, so it is necessary to individually verify and determine intent by experts in each domain, etc.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Validating training data of classifiers
  • Validating training data of classifiers
  • Validating training data of classifiers

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] Embodiments of the present invention disclose a method for determining the intent of a natural language question by classifying statements of the natural language question and returning satisfactory answers according to the intent. This method is used to validate the training data of the classifier and modify the training data of the classifier based on the validation results. This method is used to facilitate revision and improvement of training data. As a result, the method of the present invention not only helps to improve the accuracy of the classifier used by the auto-responder system, but also helps to improve the precision of the auto-responder system.

[0021] In the embodiment of the present invention, for the automatic answering system, in order for the classifier to correctly classify the intention of the question and provide an appropriate answer, it is necessary to prepare the question statement, the intention and the answer statement associated with the in...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A method, a computer program product, and a computer system for evaluation and modification of training data for classifiers. A computer system applies an each piece of training data of an each classto multiple classifiers. The computer system performs evaluation and validation of the training data for the each class and defines at least one representative class for the each piece of the trainingdata of the each class if classification precision of the each piece of the training data is greater than a predetermined threshold. The computer system modifies the training data, based on results of the evaluation and the validation of the training data for the each class. The computer system performs evaluation and validation of the training data for representative classes. The computer systemmodifies the training data, based on results of the evaluation and the validation of the training data for the representative classes.

Description

technical field [0001] The present invention relates generally to validating training data for classifiers, and more particularly to evaluating and modifying training data for classifiers of automated response systems. Background technique [0002] Large volumes of data including semi-structured and unstructured data are known as big data. In recent years, technologies that apply big data to business activities and medical services have attracted attention. In particular, technologies such as cognitive computing and deep learning have been applied in these fields. Cognitive computing provides the ability to recognize natural language, speech and image data. By learning from large amounts of data, deep learning has greatly improved the accuracy of machine learning. [0003] An automatic answering system is a system applying those technologies. Auto-response systems provide answers to questions in natural language, such as speech or typed text. In such systems, by using c...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/35G06F16/332G06K9/62
CPCG06F16/35G06F16/3329G06F18/214G06F40/30G06N20/00G06N5/04G06N3/006G06F40/40
Inventor 小林武彦松泽裕史
Owner IBM CORP