Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Data Mining Technique With Maintenance of Ancestry Counts

a data mining and ancestry technology, applied in the field of data mining, can solve the problems of premature convergence, difficult to test each individual against the entire database, and difficult to find useful knowledge from such data stores

Pending Publication Date: 2019-07-18
COGNIZANT TECH SOLUTIONS U S CORP
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

This patent is about a computer-based system that uses data mining to improve the performance of individuals in a gene pool. The system analyzes information about the individuals and their fitness to determine which ones should be selected for further development. By taking into account the number of times an individual has been through the gene pool, the system can make informed decisions about which individuals to select. This helps to create a more efficient and effective gene pool, leading to better performance overall.

Problems solved by technology

A computer security environment may record a large number of software code examples that have been found to be malicious.
Despite the large quantities of such data, or perhaps because of it, deriving useful knowledge from such data stores can be a daunting task.
A common problem with evolutionary algorithms is that of premature convergence: after some number of evaluations the population converges to local optima and no further improvements are made no matter how much longer the algorithm is run.
When using genetic algorithms to mine a large database, it may not be practical to test each individual against the entire database.
The system therefore rarely if ever knows the true fitness of any individual.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data Mining Technique With Maintenance of Ancestry Counts
  • Data Mining Technique With Maintenance of Ancestry Counts
  • Data Mining Technique With Maintenance of Ancestry Counts

Examples

Experimental program
Comparison scheme
Effect test

example embodiment

[0041]FIG. 1 is an overall diagram of an embodiment of a data mining system incorporating features of the invention. The system is divided into three portions, a training system 110, a production system 112, and a controlled system 128. The training system 110 interacts with a database 114 containing training data, as well as with another database 116 containing the candidate gene pool. As used herein, the term “database” does not necessarily imply any unity of structure. For example, two or more separate databases, when considered together, still constitute a “database” as that term is used herein. The candidate gene pool database 116 includes a portion 118 containing the elitist pool. The training system 110 operates according to a fitness function 120, which indicates to the training system 110 how to measure the fitness of an individual. The training system 110 optimizes for individuals that have the greatest fitness, however fitness is defined by the fitness function 120. The f...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Roughly described, a computer-implemented evolutionary data mining system includes a memory storing a candidate gene database in which each candidate individual has a respective fitness estimate; a gene pool processor which tests individuals from the candidate gene pool on training data and updates the fitness estimate associated with the individuals in dependence upon the tests; and a gene harvesting module for deploying selected individuals from the gene pool, wherein the gene pool processor includes a competition module which selects individuals for discarding in dependence upon their updated fitness estimate. The system maintains the ancestry count for each of the candidate individuals, and may use this information to adjust the competition among the individuals, to adjust the selection of individuals for further procreation, and / or for other purposes.

Description

CROSS-REFERENCE TO OTHER APPLICATIONS[0001]This application is a continuation of U.S. patent application Ser. No. 14 / 595,991, filed Jan. 13, 2015, entitled “Data Mining Technique With Maintenance of Ancestry Counts,” which claims priority to U.S. Provisional Application No. 61 / 932,659, filed Jan. 28, 2014, entitled “Data Mining Technique With Maintenance of Ancestry Counts,” which applications are incorporated herein by reference in their entirety.[0002]This application also relates to U.S. patent application Ser. No. 13 / 184,307, filed 15 Jul. 2011, entitled “DATA MINING TECHNIQUE WITH EXPERIENCE-LAYERED GENE POOL,” by Babak Hodjat, Hormoz Shahrzad and Greg S. Hornby, now U.S. Pat. No. 8,909,570, which application is incorporated by reference herein.BACKGROUND[0003]The invention relates generally to data mining, and more particularly, to the use of genetic algorithms to extract useful rules or relationships from a data set for use in controlling systems.[0004]In many environments, a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(United States)
IPC IPC(8): G06N3/12G06N5/02
CPCG06N3/126G06N5/025
Inventor FINK, DANIEL E.SHAHRZAD, HORMOZ
Owner COGNIZANT TECH SOLUTIONS U S CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products