Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

154 results about "Uncertain data" patented technology

In computer science, uncertain data is data that contains noise that makes it deviate from the correct, intended or original values. In the age of big data, uncertainty or data veracity is one of the defining characteristics of data. Data is constantly growing in volume, variety, velocity and uncertainty (1/veracity). Uncertain data is found in abundance today on the web, in sensor networks, within enterprises both in their structured and unstructured sources. For example, there may be uncertainty regarding the address of a customer in an enterprise dataset, or the temperature readings captured by a sensor due to aging of the sensor. In 2012 IBM called out managing uncertain data at scale in its global technology outlook report that presents a comprehensive analysis looking three to ten years into the future seeking to identify significant, disruptive technologies that will change the world. In order to make confident business decisions based on real-world data, analyses must necessarily account for many different kinds of uncertainty present in very large amounts of data. Analyses based on uncertain data will have an effect on the quality of subsequent decisions, so the degree and types of inaccuracies in this uncertain data cannot be ignored.

Data cleaning system and method for aiming at big data

ActiveCN104317801ARealize uncertain sampling functionSolve the problem of not being able to adapt to large-scale data operationsSpecial data processing applicationsDistributed File SystemData preparation
The invention discloses a data cleaning system and method for aiming at big data. A system application layer comprises a data analysis and extraction module, a similar joins module, a similar subgraph gathering module, an entity sampling module and a probability and entity query module, a storage layer stores a structural data record, a similar data record pair and a similar communication subgraph generated in a data cleaning process by utilizing a distributed storage tool HDFS (Hadoop Distributed File System) provided by Hadoop, and the cleaned structural data record is stored by utilizing a distributed storage tool HBase provided by the Hadoop. The method comprises the following steps: obtaining data to be cleaned, carrying out similar joins, enabling the similar subgrpahs to be gathered, sampling an entity, and carrying out probability calculation and entry query. The invention is a data cleaning system for aiming at the big data and an uncertain data certainty method, solves a problem that traditional centralized similar joins can not adapt to large-scale data operation, fully utilizes graphs and relevant knowledge to creatively finish big data cleaning, and provides a data preparation for the analysis of mass data.
Owner:BEIJING INSTITUTE OF TECHNOLOGYGY

Pulmonary nodule image classification method when uncertain data is contained in data set

The invention relates to the technical field of computer vision, and provides a pulmonary nodule image classification method when uncertain data is contained in a data set. The method comprises the following steps: firstly, collecting a pulmonary nodule CT image set, determining the category of the image through a majority voting principle by utilizing an expert voting method, and preprocessing toobtain a pulmonary nodule CT image data set; then, based on a knowledge distillation method, constructing a pulmonary nodule image classification model comprising a teacher model and a student model;next, obtaining a determined tag data set, training a teacher model on the determined tag data set, and calculating a soft tag on the pulmonary nodule CT image data set; then, training a student model on the data set combining the hard label and the soft label; and finally, inputting the preprocessed CT image to be classified into the trained lung nodule image classification model to obtain the category of the lung nodule image classification model. According to the method, the uncertain label data in the data set can be effectively utilized, the accuracy and efficiency of pulmonary nodule diagnosis are improved, and the usability and robustness are high.
Owner:沈阳铭然科技有限公司

Network security situation fuzzy evaluation method based on uncertain data

The invention provides a fuzzy evaluation method used for evaluating a network security situation value of a hierarchical model, and the method comprises the steps that: an evaluation factor set and a judgment set used for evaluating network security situation indexes of a top-level node of the hierarchical model are determined; a membership function is established, and the probability that attribute data of each child node in the evaluation factor set belong to different judgment ranks in the judgment set is determined according to the membership function; a fuzzy evaluation matrix is established on the basis of the determined probability that each child node belongs to different judgment ranks; the importance weight of each child node over the top-level node of the hierarchical model isdetermined; and the network security situation indexes of the top-level node is calculated according to the established fuzzy evaluation matrix and the determined importance weight of each child nodeover the top-level node of the hierarchical model, and thereby the network security situation value of the hierarchical model is finally calculated. The fuzzy evaluation method has the advantages that the security situation of a network is evaluated through introducing fuzzy mathematics and a fuzzy analytical hierarchy process (FAHP) into the fuzzy evaluation method, and thereby the problem of data uncertainty during the evaluation of the network security situation is well solved.
Owner:NAT UNIV OF DEFENSE TECH

Maximal pattern mining method for uncertain data based on depth-first

The invention relates to a maximal pattern mining method for uncertain data based on depth-first. The maximal pattern mining method comprises three major technical parts of uncertain data processing, frequent item set judgment and the maximal pattern mining method. The uncertain data processing refers to converting an uncertain data horizontal format of which the main key is an affair ID into an uncertain data vertical format of which the main key is an item ID by virtue of data vertical format conversion. The frequent item set judgment refers to the process of calculating whether the support degree of an item set is greater than or equal to a given support degree threshold and whether the confidence degree of the item set is greater than or equal to a given confidence degree threshold. The maximal pattern mining method is the process of mining the maximal frequent item set, and in the mining process, the converted vertical-format data is taken as the input, and all the uncertain data maximal pattern frequent item sets are mined out according to the given support degree and confidence degree thresholds. The maximal pattern mining method for the uncertain data based on depth-first is capable of effectively obtaining the value information in the uncertain data, and also has high mining efficiency.
Owner:WUXI SIKURUI TECH INFORMATION

Method and device for determining physical cellular identification

ActiveCN103327505ASolve the problem of unreasonable selectionNetwork planningTelecommunicationsTraffic load
The invention discloses a method and device for determining a physical cellular identification. According to the method, total bidirectional telephone traffic of a cell, load reference signal receiving power (RSRP) of the cell in a measurement report and load RSRP, which is received by the cell, of adjacent cells are loaded to no-load RSRP of the cell and no-load RSRP, which is received by the cell, of all the adjacent cells in a weighting mode to obtain the RSRP of the cell and all interference RSRP of the cell, wherein the RSRP of the cell and the interference RSRP of the cell are weighted and reflect the actual load and network operation conditions of the cell. PCI selection is conducted under the situation that the RSRP of the cell and the interference RSRP of the cell meet predetermined adjustment conditions according to judgment after the RSRP of the cell and the interference RSRP of the cell are weighted. By means of the method and device for determining the physical cellular identification, the network no-load conditions, the actual traffic load conditions of the cell and the actual network operation conditions can be combined for PCI planning and selection, and the problem that the PCI selection is unreasonable due to the fact that PCI planning is conducted according to uncertain data in the prior art can be solved.
Owner:CHINA MOBILE GROUP DESIGN INST

Internet-of-vehicles big data cross-domain analysis fusion method

The invention relates to an Internet-of-vehicles big data cross-domain analysis fusion method, which is mainly characterized in that an Internet-of-vehicles cloud data mining architecture is established, and the Internet-of-vehicles cloud data mining architecture comprises a distributed data access engine, a parallel mining engine, proxy nodes and a Web server cluster; performing data mining by adopting an Internet-of-vehicles data mining algorithm; and realizing a parallel function of the shared memory by adopting a shared memory parallel computing technology. According to the method, a clouddata mining architecture which is composed of a distributed data access engine, a parallel mining engine, a Web server cluster and agent nodes and can support parallel computing is adopted, so that the supporting capability for mass data is improved; through a data preprocessing technology, an uncertain data preprocessing technology and an Internet-of-vehicles industry data processing and fusiontechnology, support of Internet-of-vehicles specific data such as streaming data is optimized; based on novel data mining algorithms such as mining, analysis, clustering technology, behavior recognition and anomaly detection of the Internet-of-vehicles streaming data, the intelligent level of the system is improved.
Owner:天津神舟通用数据技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products