Hybrid approach to approximate string matching using machine learning

a hybrid approach and string matching technology, applied in the field of hybrid approach to approximate string matching using machine learning, can solve the problems of inability to produce the desired information, incorrect or incomplete strings, etc., and achieve the effect of quick and accurate determination

Active Publication Date: 2018-10-25
VISA INT SERVICE ASSOC
View PDF8 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0008]Embodiments of the invention provide for advantages over conventional methods because a string stored in memory can quickly and accurately be determined from an incomplete input string. These and other embodiments of the invention are described in detail below. For example, other embodiments are directed to systems, devices, and computer readable media associated with methods described herein.

Problems solved by technology

A database which is queried with an incorrect or incomplete string cannot produce the desired information.
Human error is a common cause of producing incorrect or incomplete strings.
Likewise, digital transmission error, such as noise, interference or distortion can result in incorrect or incomplete strings.
These errors often result in cost or loss to businesses which rely on searching databases using strings.
However, language and string construction in general is typically very complicated.
As a result rule-based methods are costly to produce and implement.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Hybrid approach to approximate string matching using machine learning
  • Hybrid approach to approximate string matching using machine learning
  • Hybrid approach to approximate string matching using machine learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042]String matching can be difficult to implement efficiently. The present disclosure provides for methods of identifying a corresponding string stored in memory based on an incomplete input string. These methods can enable computer systems to rapidly and accurately match strings without relying on currently used, cumbersome rule-based methods.

[0043]Embodiments can provide for the construction and use of similarity metrics, including phonetic and distance metrics. These similarity metrics can be used as inputs to a machine learning model. The machine learning model can be trained and used to provide a similarity score for any number of strings stored in memory relative to a possibly incomplete input string. The training of the model can use iterative techniques that optimize the predicted result based on a set of training data for which the result is known. The similarity scores can be used to identify a corresponding string stored in memory. Some embodiments allow for the use of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Systems, apparatuses, and methods are provided for identifying a corresponding string stored in memory based on an incomplete input string. A system can analyze and produce phonetic and distance metrics for a plurality of strings stored in memory by comparing the plurality of strings to an incomplete input string. These similarity metrics can be used as the input to a machine learning model, which can quickly and accurately provide a classification. This classification can be used to identify a string stored in memory that corresponds to the incomplete input string.

Description

BACKGROUND[0001]In recent years searching through databases has become an extremely ubiquitous part of business, government, and industrial operations. Outside of commercial search, most organizations maintain some sort of database which can be searched and accessed for information relevant to business operations, such as the names and phone numbers of employees and clients, sales records, and active projects.[0002]These databases receive inputs in the form of queries, typically taking the form of a “string,” or an ordered set of “characters.” Commonly strings take the form of a name of a person or business. Databases rely on accurate entry of string queries. A database which is queried with an incorrect or incomplete string cannot produce the desired information.[0003]Human error is a common cause of producing incorrect or incomplete strings. Over the course of thousands and thousands of keystrokes, a human typist will produce several incorrect or incomplete input strings. Likewise...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(United States)
IPC IPC(8): G06N99/00G06F17/30G06N5/04G06F17/27G06N20/10
CPCG06N99/005G06F17/30657G06F17/30312G06F17/277G06F17/30241G06F17/276G06N5/04G06N20/10G06N20/20G06F40/274G06F40/284G06F16/3343G06F16/90344G06N5/01G06N20/00G06F16/3331G06F16/3329
Inventor SINGH, PRANJALBANERJEE, SOUMYAJYOTI
Owner VISA INT SERVICE ASSOC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products