Patents

Literature

Patsnap Eureka AI that helps you search prior art, draft patents, and assess FTO risks, powered by patent and scientific literature data.

825results about "Semi-structured data indexing" patented technology

Filter

Efficacy Topic

Property

Owner

Technical Advancement

Application Domain

Technology Topic

Technology Field Word

Patent Country/Region

Patent Type

Patent Status

Application Year

Inventor

Real-time insurance policy underwriting and risk management

InactiveUS20060100912A1FinanceWeb data indexingPosition dependentEngineering

Provided is a technique for evaluating risk associated with underwriting an insurance policy (FIG. 1). At least one location to be covered under the insurance policy is received (FIG. 3, block 320). Risk associated with the location is automatically assessed (FIG. 3, block 338). It is determined whether to underwrite the location based on the assessed risk (FIG. 3, block 340). Also provided is a technique for proximity analysis (FIG. 54). Selection of a proximity center is received (FIG. 55). A function is executed with the proximity center to determine target data items that fall within a proximity area around the proximity center. The target data items are spatially represented.

Real-time insurance policy underwriting and risk management

Real-time insurance policy underwriting and risk management

Real-time insurance policy underwriting and risk management

Owner:THE PARADIGM ALLIANCE

Visual and interactive wrapper generation, automated information extraction from web pages, and translation into xml

InactiveUS20050022115A1Data efficientEfficient codingDigital computer detailsSemi-structured data indexingData transformationEngineering

A method and a system for information extraction from Web pages formatted with markup languages such as HTML [8]. A method and system for interactively and visually describing information patterns of interest based on visualized sample Web pages [5,6,16-29]. A method and data structure for representing and storing these patterns [1]. A method and system for extracting information corresponding to a set of previously defined patterns from Web pages [2], and a method for transforming the extracted data into XML is described. Each pattern is defined via the (interactive) specification of one or more filters. Two or more filters for the same pattern contribute disjunctively to the pattern definition [3], that is, an actual pattern describes the set of all targets specified by any of its filters. A method and for extracting relevant elements from Web pages by interpreting and executing a previously defined wrapper program of the above form on an input Web page [9-14] and producing as output the extracted elements represented in a suitable data structure. A method and system for automatically translating said output into XML format by exploiting the hierarchical structure of the patterns and by using pattern names as XML tags is described.

Visual and interactive wrapper generation, automated information extraction from web pages, and translation into xml

Visual and interactive wrapper generation, automated information extraction from web pages, and translation into xml

Visual and interactive wrapper generation, automated information extraction from web pages, and translation into xml

Owner:LIXTO SOFTWARE

Network based classified information systems

InactiveUS6397219B2Data processing applicationsWeb data indexingData displayClassified information

A system for automatically creating databases containing industry, service, product and subject classification data, contact data, geographic location data (CCG-data) and links to web pages from HTML, XML or SGML encoded web pages posted on computer networks such as the Internet or Intranets. The web pages containing HTML, XML or SGML encoded CCG-data, database update controls and web browser display controls are created and modified by using simple text editors, HTML, XML or SGML editors or purpose built editors. The CCG databases may be searched for references (URLs) to web pages by use of enquiries which reference one or more of the items of the CCG-data. Alternatively, enquiries referencing the CCG-data in the databases may supply contact data without web page references. Data duplication and coordination is reduced by including in the web page CCG-data display controls which are used by web browsers to format for display the same data that is used to automatically update the databases.

Owner:HANGER SOLUTIONS LLC +1

Indexing XML datatype content system and method

InactiveUS20050091188A1Data processing applicationsDigital data processing detailsPaper documentDocument preparation

Storing and querying XML data in a primary table or document utilizes an index of XML data and includes creating a primary table structure, creating a primary XML index commensurate with the primary table structure, populating the primary table and the primary XML index, and running a query on the XML data in a primary table by utilizing the XML index. The XML index can be implemented as a node table. The node table may have a B+-tree structure and be populated by shredding the XML values in the primary table. The XML data may be stored as binary large objects in an XML column of the primary table. Secondary XML indexes may be created to assist in the search and retrieval of XML data stored in the primary table. Both the primary XML index and the secondary XML index tables may be created using data definition language statements.

Indexing XML datatype content system and method

Indexing XML datatype content system and method

Indexing XML datatype content system and method

Owner:MICROSOFT TECH LICENSING LLC

Data collection system having reconfigurable data collection terminal

ActiveUS20070152058A1Semi-structured data indexingNatural language data processingNetwork packetExtensible markup

There is provided in one embodiment a data collection system including a data collection terminal having an encoded information reader device and a computer spaced apart from the data collection terminal. The data collection terminal in one embodiment can be configured to be responsive to configuration data expressed in an extensible markup language. The computer in one embodiment can use an existing extensible markup language document to create a data entry screen to received desired parameter settings for the data collection terminal within data entry fields of the data entry screen. The computer can further combine the extensible markup language document with the desired parameter settings to create configuration data and can initiate a transfer of the configuration data to the data collection terminal. The computer in one embodiment can be used to create for transfer to the data collection terminal a data package including file data corresponding to one or more selected files, together with additional data. The system provided can be used to transfer data, including but not limited to configuration data, between computers that are not data collection terminals and which are devoid of encoded information reader devices.

Data collection system having reconfigurable data collection terminal

Data collection system having reconfigurable data collection terminal

Data collection system having reconfigurable data collection terminal

Owner:HAND HELD PRODS

Stubbing systems and methods in a data replication environment

ActiveUS20110246416A1Web data indexingDigital data processing detailsData managementGoal system

Stubbing systems and methods are provided for intelligent data management in a replication environment, such as by reducing the space occupied by replication data on a destination system. In certain examples, stub files or like objects replace migrated, de-duplicated or otherwise copied data that has been moved from the destination system to secondary storage. Access is further provided to the replication data in a manner that is transparent to the user and / or without substantially impacting the base replication process. In order to distinguish stub files representing migrated replication data from replicated stub files, priority tags or like identifiers can be used. Thus, when accessing a stub file on the destination system, such as to modify replication dath or perform a restore process, the tagged stub files can be used to recall archived data prior to performing the requested operation so that an accurate copy of the source data is generated.

Stubbing systems and methods in a data replication environment

Stubbing systems and methods in a data replication environment

Stubbing systems and methods in a data replication environment

Owner:COMMVAULT SYST INC

Mapping and query service between object oriented programming objects and deep key-value data stores

ActiveUS20200057781A1Semi-structured data indexingSemi-structured data mapping/conversionEngineeringData mining

A mapping and query service for mapping between object-oriented programming objects and deep key-value data stores. The service to implement a store operation for a mapping and query service that supports the storage of a set of one or more objects having classes and fields written in source code of an object-oriented programming language in a deep key-value data store.

Mapping and query service between object oriented programming objects and deep key-value data stores

Mapping and query service between object oriented programming objects and deep key-value data stores

Mapping and query service between object oriented programming objects and deep key-value data stores

Owner:SALESFORCE COM INC

Method for scalable, fast normalization of XML documents for insertion of data into a relational database

InactiveUS20050097128A1Semi-structured data indexingSemi-structured data queryingRelational databaseData file

Disclosed is a method of transferring data from a hierarchical file (having a hierarchical structure, e.g., a markup language file) to a relational database structure (made up of columns and rows. Before processing the actual data, the invention first partitions the hierarchical structure into sections, where each section is dedicated to at least one node of the hierarchical structure. The partitioning process is based on the document type definition file, which is separate from, and different than the hierarchical file. After completing the partitioning, the invention then parses the actual data contained in the hierarchical data file to produce a stream of data pairs and end of section indicators. During the data parsing process, the invention loads the data pairs into corresponding “sections” (created prior to the parsing process) as the data pairs are output from the parsing process. The invention also transfers the node data from these sections to the columns and rows of the relational database structure.

Method for scalable, fast normalization of XML documents for insertion of data into a relational database

Method for scalable, fast normalization of XML documents for insertion of data into a relational database

Method for scalable, fast normalization of XML documents for insertion of data into a relational database

Owner:IBM CORP

Parent-child query indexing for XML databases

ActiveUS7171404B2Data processing applicationsSemi-structured data indexingXML databaseDocumentation

A method for processing queries for a document of elements is provided. The document includes a plurality of subsections where each subsection includes at least a portion of elements in the document. The method comprises: receiving a query for a npath of elements in the document of elements; determining a plurality of step queries from the query, each step query including at least a part of the path of elements; for each step query in the plurality of step queries, determining one or more subsections that include elements that correspond to a step query; and determining at least one subsection that includes the path of elements of the query. A result for the query is generated using the at least one subsection.

Parent-child query indexing for XML databases

Parent-child query indexing for XML databases

Parent-child query indexing for XML databases

Owner:MARKLOGIC

Index structure of metadata, method for providing indices of metadata, and metadata searching method and apparatus using the indices of metadata

InactiveUS20040139091A1Efficient executionData processing applicationsDigital data processing detailsDatabaseMetadata

An index structure of metadata provided for searching for information on contents and a method for providing indices of the metadata, and a method and an apparatus for searching for the metadata using the index structure of the metadata are provided, in which the index structure of the metadata includes values of multi-keys and identification information of the metadata corresponding to the value of the multi-key, wherein the multi-keys are structured by combination of predetermined fields of the metadata.

Index structure of metadata, method for providing indices of metadata, and metadata searching method and apparatus using the indices of metadata

Index structure of metadata, method for providing indices of metadata, and metadata searching method and apparatus using the indices of metadata

Index structure of metadata, method for providing indices of metadata, and metadata searching method and apparatus using the indices of metadata

Owner:SAMSUNG ELECTRONICS CO LTD

Visual and interactive wrapper generation, automated information extraction from Web pages, and translation into XML

InactiveUS7581170B2Efficient codingDigital computer detailsBiological modelsXMLWrapping procedure

A method and a system for information extraction from Web pages formatted with markup languages such as HTML [8]. A method and system for interactively and visually describing information patterns of interest based on visualized sample Web pages [5,6,16-29]. A method and data structure for representing and storing these patterns [1]. A method and system for extracting information corresponding to a set of previously defined patterns from Web pages [2], and a method for transforming the extracted data into XML is described. Each pattern is defined via the (interactive) specification of one or more filters. Two or more filters for the same pattern contribute disjunctively to the pattern definition [3], that is, an actual pattern describes the set of all targets specified by any of its filters. A method and for extracting relevant elements from Web pages by interpreting and executing a previously defined wrapper program of the above form on an input Web page [9-14] and producing as output the extracted elements represented in a suitable data structure. A method and system for automatically translating said output into XML format by exploiting the hierarchical structure of the patterns and by using pattern names as XML tags is described.

Visual and interactive wrapper generation, automated information extraction from Web pages, and translation into XML

Visual and interactive wrapper generation, automated information extraction from Web pages, and translation into XML

Visual and interactive wrapper generation, automated information extraction from Web pages, and translation into XML

Owner:LIXTO SOFTWARE

Efficient type annontation of XML schema-validated XML documents without schema validation

InactiveUS20050177578A1Efficiently annotate either entire XML document and XMLDigital data processing detailsSemi-structured data indexingXML schemaPaper document

Type annotation record information storage for annotated automaton encoding for high-performance XML schema validation is optimized in a space efficient aspect. Subsequent to type annotation record information organization, type annotation records are used for type annotation of validated XML documents, either by implementing annotation records and type annotation part of an algorithm only, or by skipping one or more validation steps in a full validation implementation. Given a schema context, a type annotation may be performed for a validated XML fragment as opposed to an entire document. In addition, default features such as attribute and type are supported.

Efficient type annontation of XML schema-validated XML documents without schema validation

Efficient type annontation of XML schema-validated XML documents without schema validation

Efficient type annontation of XML schema-validated XML documents without schema validation

Owner:IBM CORP

Parallel tree searches for matching multiple, hierarchical data structures

InactiveUS7058644B2Data processing applicationsSemi-structured data indexingData processing systemData set

Methods and systems in a data-processing system for matching data contained in a hierarchical data tree structure. One or more sets of data contained within a first data tree structure can be associated with one or more sets of data contained within a second data tree structure, such that the data associated with the first data tree structure is generally utilized to process the data associated with the second data tree structure. The first data tree structure can then be compared in parallel to the second data tree structure beginning with a first root thereof to thereby identify data similarities between the first and second data tree structures based on a predefined search criteria. Finally, one or more matching set of data between the first data tree structure and the second data tree structure can be identified, in response to comparing the first data structure to the second data structure.

Parallel tree searches for matching multiple, hierarchical data structures

Parallel tree searches for matching multiple, hierarchical data structures

Parallel tree searches for matching multiple, hierarchical data structures

Owner:IBM CORP +1

Electronic information management system for abstracting and reporting document information

InactiveUS7047491B2The process is convenient and fastFinanceSemi-structured data indexingDisplay deviceElectronic information

An electronic information management system abstracts and reports document information. The system utilizes a computer communicating with an electronic database. An input device is operatively connected to the computer for entering document information into the database. A plurality of electronic indexing tags are provided at predetermined locations within the stored document information and cooperate with a computer program to capture and retrieve selected portions of the document information. A display device is operatively connected to the computer for displaying the selected portions of document information apart from non-selected portions of document information.

Electronic information management system for abstracting and reporting document information

Electronic information management system for abstracting and reporting document information

Electronic information management system for abstracting and reporting document information

Owner:AMTDIRECT HLDG LLC

Systems and methods for generating longitudinal data profiles from multiple data sources

PendingUS20160292456A1Cryptography processingDigital data protectionData sourceData records

A computer-implemented method for generating a longitudinal data profile from multiple disparate data sources is provided. The method includes storing, at a central data hub, first de-identified data received from a first data source, the first de-identified data including a plurality of data records having encrypted identifying data and an anonymous ID assigned to each record, wherein the anonymous ID is assigned based on a master list that includes a list of identifiers and corresponding anonymous IDs for each identifier. The method further includes storing second de-identified data received from a second data source, and storing third de-identified data received from a third data source. The method further includes processing the first, second, and third de-identified data to link the first, second, and third de-identified data using the anonymous ID, and generating the longitudinal data profile from the linked first, second, and third de-identified data.

Systems and methods for generating longitudinal data profiles from multiple data sources

Systems and methods for generating longitudinal data profiles from multiple data sources

Systems and methods for generating longitudinal data profiles from multiple data sources

Owner:ABBVIE INC

Method and apparatus for semantic search of schema repositories

InactiveUS20070185868A1Leveling precisionSemi-structured data indexingSpecial data processing applicationsWeb serviceThe Internet

Mechanisms for searching XML repositories for semantically related schemas from a variety of structured metadata sources, including web services, XSD documents and relational tables, in databases and Internet applications. A search is formulated as a problem of computing a maximum matching in pairwise bipartite graphs formed from query and repository schemas. The edges of such a bipartite graph capture the semantic similarity between corresponding attributes of the schema based on their name and type semantics. Tight upper and lower bounds are also derived on the maximum matching that can be used for fast ranking of matchings whilst still maintaining specified levels of precision and recall. Schema indexing is performed by ‘attribute hashing’, in which matching schemas of a database are found by indexing using query attributes, performing lower bound computations for maximum matching and recording peaks in the resulting histogram of hits.

Method and apparatus for semantic search of schema repositories

Method and apparatus for semantic search of schema repositories

Method and apparatus for semantic search of schema repositories

Owner:IBM CORP

Efficiently storing indented threads in a threaded discussion application

InactiveUS20070106635A1Data processing applicationsDigital data processing detailsDatabase

A method, apparatus, and computer-readable media for retrieving records in a hierarchical set of the records having a plurality of hierarchical levels and a plurality of hierarchical depths, each of the records having a tag that is unique within the hierarchical set of the records, comprises identifying one of the records in the hierarchical set of the records; modifying the tag, thereby producing a key; indexing the hierarchical set of the records only once, thereby selecting one or more of the records within the hierarchical set of the records, wherein indexing the hierarchical set of the records only once comprises applying the key to the hierarchical set of the records; and retrieving the selected records.

Efficiently storing indented threads in a threaded discussion application

Efficiently storing indented threads in a threaded discussion application

Efficiently storing indented threads in a threaded discussion application

Owner:ORACLE INT CORP

Method and system for storing, retrieving, and managing data for tags

ActiveUS20070273518A1Semi-structured data indexingSensing record carriersData storeUsage data

This invention relates generally to a method and system for storing, retrieving, and managing data for tags that are associated in some manner to any type of object. More particularly, the present invention writes data to these tags, reads data from these tags, and manages data that is written to and / or read from these tags. In addition, the invention accesses and / or stores data associated with tags from or into repositories, constructs and maintains data structures from these repositories and responds to queries using the data structures.

Method and system for storing, retrieving, and managing data for tags

Method and system for storing, retrieving, and managing data for tags

Method and system for storing, retrieving, and managing data for tags

Network based classified information systems

InactiveUS20010021935A1Data processing applicationsWeb data indexingData displayClassified information

A system for automatically creating databases containing industry, service, product and subject classification data, contact data, geographic location data (CCG-data) and links to web pages from HTML, XML or SGML encoded web pages posted on computer networks such as the Internet or Intranets. The web pages containing HTML, XML or SGML encoded CCG-data, database update controls and web browser display controls are created and modified by using simple text editors, HTML, XML or SGML editors or purpose built editors. The CCG databases may be searched for references (URLs) to web pages by use of enquiries which reference one or more of the items of the CCG-data. Alternatively, enquiries referencing the CCG-data in the databases may supply contact data without web page references. Data duplication and coordination is reduced by including in the web page CCG-data display controls which are used by web browsers to format for display the same data that is used to automatically update the databases.

Owner:HANGER SOLUTIONS LLC +1

Information processing method, information processing device, and information processing program

InactiveUS20090106298A1Semi-structured data indexingSpecial data processing applicationsInformation processingArray data structure

A partial tree is efficiently removed from a tree, or another tree is inserted into a tree. A flag is set for the node identifier of a node constituting a partial tree to be removed in an erasure flag array. Referencing the value of the erasure flag array, node identifiers which are consecutive integers given to the nodes while giving precedence to the nodes in the same generation over the child nodes are given to the nodes in the tree from which the partial tree has been removed. The given node identifiers are associated with those of the nodes in the tree before the partial tree is removed and stored in a new node identifier array, which is stored in a storage device. According to a parent-child relation definition array and the node identifier array, the parent-child definition array of the tree from which the partial tree is removed where the node identifiers of the parent node of the nodes having the node identifiers are stored in association with the new node identifiers is stored in the storage device.

Information processing method, information processing device, and information processing program

Information processing method, information processing device, and information processing program

Information processing method, information processing device, and information processing program

Owner:TURBO DATA LAB INC

Indexing profile for efficient and scalable XML based publish and subscribe system

InactiveUS7062507B2Efficiently providedFast informationData processing applicationsSemi-structured data indexingGraphicsConfigfs

The present invention provides a system and method for the efficient indexing and delivery of information to interested users who have expressed an interest in or “subscribed” to information items that are continuously released or “published” by some data source in XML format. Previously, publish and subscribe systems accepted keyword-based subscription profiles and did not support subscription to XML documents according to their structures. Direct approach to implement XML-based publish and subscribe system by checking each user profile against an XML document is very time consuming. The presentation invention, though, provides an efficient method to identify interested subscribers for each XML document by indexing queries utilizing a graphical structure of nodes. When an XML document is published, the index identifies all matched expressions in the index and delivers at least a portion of an XML document to a user who has expressed an interest in receiving this information.

Owner:THE BOEING CO

Indexing and querying semi-structured data

ActiveUS9507848B1Semi-structured data indexingText database indexingSemi-structured dataData mining

Generating an inverted index is disclosed. Semi-structured data from a plurality of sources is parsed to extract structure from at least a portion of the semi-structured data. The inverted index is generated using the extracted structure. The inverted index includes a location identifier and a data type identifier for one or more entries of the inverted index.

Indexing and querying semi-structured data

Indexing and querying semi-structured data

Indexing and querying semi-structured data

Owner:VMWARE INC

Structured-document search apparatus and method, recording medium storing structured-document searching program, and method of creating indexes for searching structured documents

InactiveUS6853992B2Increase speedData processing applicationsDigital data processing detailsDocument preparationDocumentation

A query in which a sibling relationship among document parts, which are elements of a structured document, can be designated as a search condition is input, and a query tree which represents the query in a tree structure is created. A query converting unit refers to a hierarchical index in which a hierarchical relationship among document parts of each structured document to be searched is expressed in a tree structure, and converts the query tree to a Boolean expression. A text-index referring unit refers to a text index in which is registered information representing a relationship between each set including a character string in text data and a part-ID of a meta part and a document-ID of a document, thereby searching a document corresponding to the Boolean expression converted from the query tree.

Structured-document search apparatus and method, recording medium storing structured-document searching program, and method of creating indexes for searching structured documents

Structured-document search apparatus and method, recording medium storing structured-document searching program, and method of creating indexes for searching structured documents

Structured-document search apparatus and method, recording medium storing structured-document searching program, and method of creating indexes for searching structured documents

Owner:FUJITSU LTD

Mechanism for efficiently searching XML document collections

ActiveUS8650182B2Digital data processing detailsSemi-structured data indexingDocument preparationDocumentation

The techniques presented herein are directed towards providing a user-directed keyword-based search on a large collection of XML documents, and displaying a summary of results to the user. Prior to receiving search requests from a user, an offline analysis of a large collection of XML documents is performed to construct an inverted index of keywords. For each keyword, the index stores a set of location indicators that identify all the instances of the keyword found in the collection of documents. Once the index is constructed, keyword searching can be done efficiently by a keyword lookup in the index. Various display strategies enable the user to see the specific portion of a large XML document containing the keyword and / or path frequency information allowing the user to easily refine the search to specific paths within the collection of documents.

Mechanism for efficiently searching XML document collections

Mechanism for efficiently searching XML document collections

Mechanism for efficiently searching XML document collections

Owner:ORACLE INT CORP

Automated tagging of objects in databases

ActiveUS20100082575A1Difficult to searchMaintain relationshipDigital data processing detailsMetadata multimedia retrievalWeb siteSubject matter

Embodiments of the present invention provide systems and methods for automatically generating tag terms (or tags) for objects in databases of a web site. The metadata of the objects (or data) of the web site are processed and parsed to automatically generate tag terms for the corresponding objects. Information (or data, or content) downloaded from the Internet often comes with metadata, which can exist in titles, description, sources, and authors of the information, etc. The metadata of downloaded information can be process and parsed to generate tag terms for the corresponding objects. The system can automatically generate tag terms for the data, which are stored as objects in the databases, and make the data (or objects) searchable. In addition, the automatically generated tag terms allow associated data to maintain their relationship. For example, data from the same sources, same authors, or same subjects can be identified based on the common tag terms. Automatically generated tag terms enable searching and association of data (or objects) in databases in a web site.

Automated tagging of objects in databases

Automated tagging of objects in databases

Automated tagging of objects in databases

Owner:R2 SOLUTIONS

Keyword based evaluation expert intelligent search and recommendation method

ActiveCN103605665AIntelligent searchAchieve recommendationSemi-structured data indexingSpecial data processing applicationsFull text searchInformation index

The invention discloses a keyword based evaluation expert intelligent search and recommendation method. The keyword based evaluation expert intelligent search and recommendation method specifically comprises step 1, segmenting an expert information main text into substring sequences, performing ICTCLAS word segmentation of Chinese academy of sciences and performing stop word filtering on the result of the word segmentation to obtain the word collection; step 2, extracting feature words of the expert information according to fields; step 3, building an expert knowledge representation model based on the fields and the weight of the feature words and establishing an expert information index database; step 4, performing automatic prompting according to a search term thesaurus when a user inputs keywords and meanwhile performing real-time update on the search term thesaurus through a search term counter; step 5, calculating the search relevance between the keywords and the expert information based on the semantic information and the like; step 6, listing relevant experts from high to low according to the matching degree. According to the keyword based evaluation expert intelligent search and recommendation method, the intelligent full-text search and recommendation of the expert information can be achieved through the keyword input and accordingly the experts which are matched with a pended science and technology project can be searched out accurately.

Keyword based evaluation expert intelligent search and recommendation method

Keyword based evaluation expert intelligent search and recommendation method

Keyword based evaluation expert intelligent search and recommendation method

Owner:HANGZHOU DIANZI UNIV

Index structure for supporting structural XML queries

InactiveUS20050114314A1Data processing applicationsSemi-structured data indexingPaper documentWildcard character

The present invention provides a ViST (or “virtual suffix tree”), which is a novel index structure for searching XML documents. By representing both XML documents and XML queries in structure-encoded sequences, it is shown that querying XML data is equivalent to finding (non-contiguous) subsequence matches. A variety of XML queries, including those with branches, or wild-cards (‘*’ and ‘ / / ’), can be expressed by structure-encoded sequences. Unlike index methods that disassemble a query into multiple sub-queries, and then join the results of these sub-queries to provide the final answers, ViST uses tree structures as the basic unit of query to avoid expensive join operations. Furthermore, ViST provides a unified index on both content and structure of the XML documents, hence it has a performance advantage over methods indexing either just content or structure. ViST supports dynamic index update, and it relies solely on B+Trees without using any specialized data structures that are not well supported by common database management systems (hereinafter referred to as “DBMSs”).

Index structure for supporting structural XML queries

Index structure for supporting structural XML queries

Index structure for supporting structural XML queries

Owner:IBM CORP

Associating objects in databases by rate-based tagging

InactiveUS20100082576A1Increase user ' confidenceWeight moreDigital data processing detailsMetadata multimedia retrievalWeb siteObject store

Embodiments of the present invention provide automatic systems and methods for associating objects in databases of a web site by rate-based tagging. The frequencies of users entering specific tag terms for objects stored in the databases of the web site are used to determine hard associations between objects and tag terms and between objects. When the frequencies of user tags exceed established thresholds, hard associations between objects and tag terms are established. When objects are identified or determined to have hard association with tag terms, the objects are determined to be more clearly associated with the corresponding tag terms. Therefore, they should be highlighted or featured in more prominent locations on web pages of the web site to increase users' confidence in content of the web site. To identify hard-associated objects, more weights can be assigned to the hard-associated objects, which allows them to be more likely to be selected for display in prominent locations. In addition, objects that are determined to have hard associations with tag terms can also have hard associations with one another due to the common tag terms they share. The hard association relationship between objects can be displayed through links to associated objects when an object is selected for display.

Associating objects in databases by rate-based tagging

Associating objects in databases by rate-based tagging

Associating objects in databases by rate-based tagging

Owner:R2 SOLUTIONS

Element query method and system

InactiveUS20080010256A1Efficient processingSemi-structured data indexingSpecial data processing applicationsPosition dependentPaper document

Methods, systems, and computer-readable media for representing and querying positional information for a hierarchical document (such as an XML document) are disclosed. In one set of embodiments, at least one word in the hierarchical document is associated with one or more word positions, and at least one element in the hierarchical document is associated with one or more word position ranges. The word positions and word position ranges are analyzed to determine whether a particular word or phrase is a direct or indirect descendant of a particular element in the hierarchical document. In various embodiments, the word positions are indexed in a first index and the word position ranges are indexed in a second index. Thus, the analysis may be efficiently performed by intersecting the first and second indexes. In further embodiments, the word position ranges may be encoded in a space efficient format for storage or transmittal.

Element query method and system

Element query method and system

Element query method and system

Owner:MARKLOGIC

Automated method for detecting and repairing configuration conflicts in a content management system

InactiveUS8417676B2Digital data processing detailsMultimedia data retrievalConfigfsDocument type definition

Embodiments of the invention provide for detecting and (in at least some cases) repairing XML configuration conflicts in a content management system (CMS). One method allows a CMS to evaluate various configuration components and determine when those components may conflict with one another. If a conflict is detected, the CMS may be configured to notify an administrator of the problem, and in some cases, correct the problem. As a result, administrators may not have to carefully evaluate each configuration file associated with a document type definition for a given document type before creating or modifying a content processing rule.

Automated method for detecting and repairing configuration conflicts in a content management system

Automated method for detecting and repairing configuration conflicts in a content management system

Automated method for detecting and repairing configuration conflicts in a content management system

Owner:INT BUSINESS MASCH CORP

Popular searches

Unstructured textual data retrieval Multiple digital computer combinations Input/output processes for data processing Sensing by electromagnetic radiation Database indexing Two-way working systems Selective content distribution Memory systems Office automation Document management systems