Machine learning of context of data fields for various document types

a technology of data fields and context, applied in the field of machine learning of context of data fields for various document types, can solve the problems of large amount of human and computing resources, difficult to efficiently update the electronic tax return preparation system to correctly, and functions that range from very simple to very complex, so as to achieve quick and accurate determination of correct functions, learn and incorporate new forms reliably, and efficiently.

Inactive Publication Date: 2018-01-18
INTUIT INC
View PDF5 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0007]Embodiments of the present disclosure address some of the shortcomings associated with traditional electronic document preparation systems by providing methods and systems for incorporating new or updated forms by utilizing machine learning in conjunction with training set data. In particular, embodiments of the present disclosure receive form data related to a new form that includes data fields to be completed in accordance with specific functions designated by the new form. Embodiments of the present disclosure determine one or more possible dependencies for each data field. Embodiments of the present disclosure utilize machine learning to quickly and accurately determine the correct function needed to complete each data field of the form. Embodiments of the present disclosure gather training set data that includes previously filled forms related to the new form in order to assist in the machine learning process. The machine learning process for learning and incorporating the new form includes generating candidate functions for each data field of the new form based on the possible dependencies. The candidate functions can include one or more operators selected from a set or superset of operators. The machine learning process applies the candidate functions to the training set data in order to determine the accuracy of the candidate functions. For each data field, embodiments of the present disclosure generate and apply candidate functions in successive iterations until a candidate function is found that produces test data that matches the data values in the corresponding completed data fields of the previously filled forms of the training set data within a threshold level of error. Embodiments of the present disclosure then output results data that indicates that the correct function for a particular data field has possibly been found. This process is repeated for each selected data field of the new form until all selected data fields of the new form have been learned and incorporated. In this way, embodiments of the present disclosure provide a more reliable electronic document preparation system that quickly, efficiently, and reliably learns and incorporates new forms.

Problems solved by technology

However, the processes that enable the electronic tax return preparation systems to incorporate new tax forms into the tax return preparation systems often utilize large amounts of human and computing resources.
If a tax form changes, or a new tax form is introduced, it can be very difficult to efficiently update the electronic tax return preparation system to correctly populate the various fields of the tax forms with the requested values.
These functions range from very simple to very complex.
This can lead to delays in releasing an updated version of the electronic tax return preparation system as well as considerable expenses.
These expenses are then passed on to customers of the electronic tax return preparation system, as are the delays.
Furthermore, these processes for updating electronic tax returns can introduce inaccuracies into the tax return preparation system.
These expenses, delays, and possible inaccuracies can have an adverse impact on traditional electronic tax return preparation systems.
Customers may lose confidence in the electronic tax return preparation systems.
These issues and drawbacks are not limited to electronic tax return preparation systems.
Any electronic document preparation system that assists users to electronically fill out forms or prepare documents can suffer from these drawbacks when the forms are updated or new forms are released.
In some cases, it may not be feasible to obtain relevant historical tax related data related to previously filed tax returns to assist in the machine learning process of a new tax form.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Machine learning of context of data fields for various document types
  • Machine learning of context of data fields for various document types
  • Machine learning of context of data fields for various document types

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025]Embodiments will now be discussed with reference to the accompanying FIG.s, which depict one or more exemplary embodiments. Embodiments may be implemented in many different forms and should not be construed as limited to the embodiments set forth herein, shown in the FIG.s, and / or described below. Rather, these exemplary embodiments are provided to allow a complete disclosure that conveys the principles of the invention, as set forth in the claims, to those of skill in the art.

[0026]Herein, the term “production environment” includes the various components, or assets, used to deploy, implement, access, and use, a given application as that application is intended to be used. In various embodiments, production environments include multiple assets that are combined, communicatively coupled, virtually and / or physically connected, and / or associated with one another, to provide the production environment implementing the application.

[0027]As specific illustrative examples, the assets...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A method and system learns new forms to be incorporated into an electronic document preparation system. The method and system receive form data related to a new form having a plurality of data fields that expect data values based on specific functions. The method and system gather training set data including previously filled forms having completed data fields corresponding to the data fields of the new form. The method and system utilize machine learning in conjunction with the training set data to identify the correct function for each of the data fields of the new form.

Description

RELATED CASES[0001]This application is a Utility application depending from the U.S. provisional patent application filed Jul. 15, 2016 having attorney docket number INTU169813, Ser. No. 62 / 362,688, and entitled “SYSTEM AND METHOD FOR MACHINE LEARNING OF CONTEXT OF LINE INSTRUCTIONS FOR VARIOUS DOCUMENT TYPES,” which is hereby incorporated herein by reference in its entirety as if the contents were presented herein directly.BACKGROUND[0002]Many people use electronic document preparation systems to help prepare important documents electronically. For example, each year millions of people use electronic tax return preparation systems to help prepare and file their tax returns. Typically, electronic tax return preparation systems receive tax related information from a user and then automatically populate the various fields in electronic versions of government tax forms. Electronic tax return preparation systems represent a potentially flexible, highly accessible, and affordable source ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(United States)
IPC IPC(8): G06Q40/00G06N99/00G06N20/00
CPCG06N99/005G06Q40/123G06N5/041G06N20/00G06F40/174G06N5/013
Inventor UNSAL, CEMMUKHERJEE, SAIKATHALVORSEN, PER-KRISTIANMEIKE, ROGER CHARLES
Owner INTUIT INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products