Unlock instant, AI-driven research and patent intelligence for your innovation.

Detection and creation of appropriate row concept during automated model generation

A concept, technology of business intelligence system, applied in the field of data classification, can solve problems such as time-consuming

Active Publication Date: 2016-08-10
INT BUSINESS MASCH CORP
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But this is done by someone who understands the data they are modeling, and takes time

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Detection and creation of appropriate row concept during automated model generation
  • Detection and creation of appropriate row concept during automated model generation
  • Detection and creation of appropriate row concept during automated model generation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] As noted above, systems to date have limited ability to convert tables into a form that can be used to answer queries. Instead of generating a data store of multiple tables from the customer's data, a single table (usually in a columnar database) is generated that matches the user's raw data, but with additional metadata describing what the system thinks each column represents.

[0019] In some embodiments, this is done for each column, as the column contains a tag for easy lookup in the classification system, and a set of data values ​​that generally represent the concept of the column. In this way, the columns themselves become query elements that can be matched to the user's question in order to generate an answer.

[0020] Unfortunately, columns are not the only meaningful elements of a data set. Instead, columns typically represent attributes of something (such as age, gender, or salary) and rows represent instances of that thing (such as person 1 or person 2). W...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A system and method for assigning concepts to sets of values. Data is received, wherein the data is organized as a plurality of named fields and as two or more sets of values associated with the named fields, wherein each named field is assigned to a category. For each category, determine whether there is at least one identifier field for that category, wherein each identifier field is a named field that acts as an identifier for that category, and identify identifier fields, if any, for each category that have a unique value in the identifier field for that category for each set of values. Then select one of the categories as a concept representing the sets of values. In some embodiments, the data is organized as a table, wherein the named fields are columns and the sets of values are rows.

Description

technical field [0001] The present invention relates to natural language analysis, and more particularly to classification of data in data sets. Background technique [0002] Accurately converting tabular data into useful query models can be difficult. Often, specialized modeling of the data is required, and the analysis tools required for conversion often require training and specialization that are not common among business users. [0003] Of course, there are many challenges in creating such a tool. If automated modeling does not reflect the data or knowledge it represents, the queries it may generate may not be useful in answering users' questions. If users' questions cannot be parsed and understood by the system, the system cannot accurately generate queries to answer their questions. Over the past 50 years, accurate natural language parsing has become a branch of computer science that is still considered in its infancy. [0004] In traditional analysis systems, the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27
CPCG06F40/205G06F16/212G06F16/285
Inventor M.雷斯-加塞姆G.A.沃茨Q.魏
Owner INT BUSINESS MASCH CORP
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More