Unlock instant, AI-driven research and patent intelligence for your innovation.

Systems and methods for constructing an artificially diverse corpus of training data samples for training a contextually-biased model for a machine learning-based dialogue system

a machine learning and dialogue system technology, applied in the field of machine learning and artificial intelligence dialogue systems, can solve the problems of severe limitations on the ability of virtual assistants to address queries or commands from users, and the modern virtual assistants implemented via a rule-based approach for categorizing user input and generating responses to users may not fully satisfy queries and commands

Inactive Publication Date: 2021-01-07
CLINC INC
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

While the rules-based approach for implementing a virtual assistant may be useful for addressing pointed or specific queries or commands made by a user, the rigid or finite nature of this approach severely limits a capability of a virtual assistant to address queries or commands from a user that exceed the scope of the finite realm of pointed and / or specific queries or commands that are addressable by the finite set of rules that drive the response operations of the virtual assistant.
That is, the modern virtual assistants implemented via a rules-based approach for categorizing user input and generating responses to users may not fully satisfy queries and commands posed by a user for which there are no predetermined rules to provide a meaningful response or result to the user.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Systems and methods for constructing an artificially diverse corpus of training data samples for training a contextually-biased model for a machine learning-based dialogue system
  • Systems and methods for constructing an artificially diverse corpus of training data samples for training a contextually-biased model for a machine learning-based dialogue system
  • Systems and methods for constructing an artificially diverse corpus of training data samples for training a contextually-biased model for a machine learning-based dialogue system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032]The following description of the preferred embodiments of the present application are not intended to limit the inventions to these preferred embodiments, but rather to enable any person skilled in the art to make and use these inventions.

1. System for a Machine Learning-Based Dialogue System

[0033]As shown in FIG. 1, a system 100 that automatically trains and / or configures machine learning models includes an artificial intelligence (AI) virtual assistant platform 110 (e.g., artificially intelligent dialogue platform), a machine learning configuration interface 120, a training / configuration data repository 130, a configuration data queue 135, and a plurality of external training / configuration data sources 140. Additionally, the system 100 may include artificial training corpus diversity sub-system 170 that may function to generate an artificially diverse training corpus based on training samples from an existing training corpus.

[0034]Generally, the system 100 functions to imple...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Systems and methods for constructing an artificially diverse corpus of training data includes evaluating a corpus of utterance-based training data samples, identifying a slot replacement candidate; deriving distinct skeleton utterances that include the slot replacement candidate, wherein deriving the distinct skeleton utterances includes replacing slots of each of the plurality of distinct utterance training samples with one of a special token and proper slot classification labels; selecting a subset of the distinct skeleton utterances; converting each of the distinct skeleton utterances of the subset back to distinct utterance training samples while still maintaining the special token at a position of the slot replacement candidate; altering a percentage of the distinct utterance training samples with a distinct randomly-generated slot token value at the position of the slot replacement candidate; and constructing the artificially diverse corpus of training samples based on a collection of the percentage of the distinct utterance training samples.

Description

CROSS-REFERENCE TO RELATED APPLICATIONS[0001]This application is a continuation of U.S. patent application Ser. No. 16 / 907,792, filed 22 Jun. 2020, which claims the benefit of U.S. Provisional Application No. 62 / 870,180, filed 3 Jul. 2019, US Provisional Application No. 62 / 870,156, filed 3 Jul. 2019, U.S. Provisional Application No. 62 / 890,461, filed 22 Aug. 2019, and U.S. Provisional Application No. 62 / 890,296, filed 22 Aug. 2019, which are incorporated herein in their entireties by this reference.GOVERNMENT RIGHTS[0002]The subject matter of the invention may be subject to U.S. Government Rights under National Science Foundation grants: NSF SBIR Phase 1 Grant—1622049 and NSF SBIR Phase 2 Grant—1738441.TECHNICAL FIELD[0003]The inventions herein relate generally to the machine learning and artificially intelligent dialogue systems fields, and more specifically to a new and useful system and method for intelligently classifying unstructured data into a machine learning-based conversat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(United States)
IPC IPC(8): G06F40/35G06N5/04G06N20/00G06F40/284
CPCG06F40/35G06F40/284G06N20/00G06N5/043G06N7/01
Inventor LEE, ANDREWLARSON, STEFANCLARKE, CHRISTOPHERLEACH, KEVINKUMMERFELD, JONATHAN K.HILL, PARKERHAUSWALD, JOHANNLAURENZANO, MICHAEL A.TANG, LINGJIAMARS, JASON
Owner CLINC INC