Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

261 results about "Data layout" patented technology

A data layout is a structure applied to a system that defines how the data fields are organized. Data layouts can be different for each system. Different systems organize their data based on their core functionality. A CRM systems data layout may focus mainly on customer and account data where an eCommerce platform...

Media aware distributed data layout

A storage system comprises a plurality of vdisks, with each vdisk containing a plurality of storage segments, and each segment providing a specific class of service (CoS). Each vdisk stores files with data and meta data distributed among its storage segments. A storage system comprises a memory having multiple classes of service. The system comprises an interface for storing a file as blocks of data associated with a class of service in the memory. The interface chooses the class of service for a block on a block by block basis. A file system for storing a file comprises a plurality of vdisks, with each vdisk having a plurality of inodes. Each inode of each vdisk stores data on one or more segments, with each segment having a different class of service. The system comprises a controller which stores data of a file in an inode of a vdisk, in one or more segments of that vdisk. A file system for storing a file comprises a plurality of vdisks, and each vdisk having a plurality of inodes. The system comprises a controller including a plurality of processors, with each processor serving one or more of the vdisks. A file system for storing comprises a plurality of vdisks, with each vdisk having a plurality of inodes, a plurality of inodes of at least one vdisk storing data on a plurality of segments, each segment having a different class of service. The system comprises a controller which stores data of the file in one or more segments of one vdisk. A method for storing a file.
Owner:MICROSOFT TECH LICENSING LLC

Method to reduce I/O for hierarchical data partitioning methods

A method and system for generating a decision-tree classifier from a training set of records, independent of the system memory size. The method includes the steps of: generating an attribute list for each attribute of the records, sorting the attribute lists for numeric attributes, and generating a decision tree by repeatedly partitioning the records using the attribute lists. For each node, split points are evaluated to determine the best split test for partitioning the records at the node. Preferably, a gini index and class histograms are used in determining the best splits. The gini index indicates how well a split point separates the records while the class histograms reflect the class distribution of the records at the node. Also, a hash table is built as the attribute list of the split attribute is divided among the child nodes, which is then used for splitting the remaining attribute lists of the node. The method reduces I/O read time by combining the read for partitioning the records at a node with the read required for determining the best split test for the child nodes. Further, it requires writes of the records only at one out of n levels of the decision tree where n>/=2. Finally, a novel data layout on disk minimizes disk seek time. The I/O optimizations work in a general environment for hierarchical data partitioning. They also work in a multi-processor environment. After the generation of the decision tree, any prior art pruning methods may be used for pruning the tree.
Owner:IBM CORP

Intelligent hash data layout method, cluster storage system and cluster storage method

The invention discloses an intelligent hash data layout method, a cluster storage system and a cluster storage method. The intelligent hash data layout method is used for laying out storage nodes in a data volume. Data is distributed through a hash distribution method by taking a directory as a basic unit. The parent directory of a file utilizes extended attributes to record the mapping information of the storage nodes. Sub-files are distributed in the storage nodes to which the parent directory belongs. The distribution method of the sub-files in the storage nodes to which the parent directory belongs specifically comprises one of the following situations that: the sub-files are distributed in the storage nodes to which the parent directory belongs through the hash distribution method; the sub-files are distributed in the storage nodes to which the parent directory belongs through a zonal two-level distribution method; the sub-files are distributed in the storage nodes to which the parent directory belongs through a duplicate two-level distribution method; and the sub-files are firstly distributed in the storage nodes to which the parent directory belongs through the zonal two-level distribution method and then are distributed through a duplicate three-level distribution method. The system and the methods provided by the invention have the advantages that the extensibility, the performance, the availability and the applicability of the cluster storage system can be remarkably improved, and the load pressure of a storage server can be greatly decreased.
Owner:中关村科技租赁股份有限公司

Method and structure for improving processing efficiency in parallel processing machines for rectangular and triangular matrix routines

A computerized method (and structure) of linear algebra processing on a computer having a plurality of processors for parallel processing, includes, for a matrix having elements originally stored in a memory in a rectangular matrix AR or especially of one of a triangular matrix AT format and a symmetric matrix AS format, distributing data of the rectangular AR or triangular or symmetric matrix (AT, AS) from the memory to the plurality of processors in such a manner that keeps all submatrices of AR or substantially only essential data of the triangular matrix AT or symmetric matrix AS is represented in the distributed memories of the processors as contiguous atomic units for the processing. The linear algebra processing done on the processors with distributed memories requires that submatrices be sent and received as contiguous atomic units based on the prescribed block cyclic data layouts of the linear algebra processing. This computerized method (and structure) defines all of its submatrices as these contiguous atomic units, thereby avoiding extra data preparation before each send and after each receive. The essential data or AT or AS is that data of the triangular or symmetric matrix that is minimally necessary for maintaining the full information content of the triangular AT or symmetric matrix AS.
Owner:IBM CORP

Gray-code-based distributed data layout method and query method

InactiveCN102890678AImplement aggregation queryImprove the efficiency of sequential accessSpecial data processing applicationsTheoretical computer scienceEngineering
The invention belongs to the technical field of a database, and discloses a gray-code-based distributed data layout method. The method comprises the steps of: dividing a range of each attribute into a plurality of equal portions; encoding according to a gray code order; marking a certain attribute value of a tuple comprising a plurality of attributes through the gray codes of the equal portions of the attribute value, namely an index code of the attribute value; forming an index key value of the tuple by mixing the index code of each attribute value in the tuple, wherein the tuple achieves the distributed data layout according to the order of the gray codes, the distributed data layout is deployed on a distributed system, the bitmap index of content perception is achieved at a host computer terminal of the system and the content perception is stored in a file name, and physical storage of data and statistical index of the data are achieved on a slave terminal of the system. The invention also discloses a query method employing the database formed by means of the method. The data layout obtained by the method can meet the requirements of data processing such as exact matching search, range search, multi-dimensional search, multi-attribute search and aggregated analysis, and the method is high in disc access efficiency.
Owner:EAST CHINA NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products