Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

81 results about "CURE data clustering algorithm" patented technology

CURE (Clustering Using REpresentatives) is an efficient data clustering algorithm for large databases. Compared with K-means clustering it is more robust to outliers and able to identify clusters having non-spherical shapes and size variances.

Reactive voltage partitioning method based on spectral clustering

The invention relates to the voltage control of the electric power field and especially relates to a reactive voltage partitioning method based on spectral clustering. A topologic matrix with a weight is used to construct a simplified power grid model. According to a spectral clustering definition, a Laplace matrix is acquired. Through an improved K-means clustering algorithm, clustering is performed on different characteristic vectors in a characteristic matrix. During clustering, modularity Q is introduced to be taken as an index of measuring an area partitioning quality. A partitioning scheme with a largest modularity Q value is selected as an initial partitioning scheme. Connectivity verification and reactive verification are performed on each area of the initial partitioning scheme. If the area can not simultaneously satisfy two conditions of area static state reactive balance and an enough reactive reserve margin, under the condition that a value of partitioning modularity Q does not change greatly, node adjusting is performed till that all the verification conditions are satisfied. In the invention, a topology structure of a complex power grid is embodied, calculating complexity is reduced, an integration evaluation system is established based on the modularity, the reactive balance and a reactive reserve index, and integration verification is performed on a partitioning result so as to ensure feasibility of the partitioning scheme.
Owner:XIHUA UNIV

Box separation method based on k-means clustering

The invention discloses a box separation method based on k-means clustering. The box separation method comprises the following steps that continuous variables are preprocessed; normalization processing is carried out on the preprocessed data, a k-means clustering algorithm is applied on the data obtained after the normalization processing is carried out to divide the data into a plurality of sections; the equal interval method is adopted for setting the initial center of the k-means clustering algorithm to obtain clustering centers; after the clustering centers are obtained, the midpoint of the adjacent clustering centers is used as a classification division point, each object is added into the closest class, and therefore the data are divided into the multiple sections; each clustering center is calculated again, then the data are divided again until each clustering center does not change any more, and the final clustering result is obtained. According to the box separation method, the technical problem that errors are likely to be caused for a data set with the obvious data density distribution bias according to an existing box separation method is solved, the k-means clustering algorithm does not select the initial center randomly any more, and the data separation result is accurate.
Owner:GUANGDONG POWER GRID CO LTD INFORMATION CENT

Brain MR image segmentation method based on superpixel fuzzy clustering

The invention relates to a brain MR image segmentation method based on superpixel fuzzy clustering. The method comprises the following steps: 1) obtaining an MR image; 2) performing superpixel segmentation on the MR image and thus a plurality of atom regions are obtained; 3) performing secondary refinement segmentation on the atom regions whose grey value variance is big; 4) performing fuzzy clustering on the atom regions, so that category membership degree of each atom region is obtained; 5) defining the atom regions, whose membership degrees are not clear enough, as fuzzy blocks and realizing affiliation category judgment for the fuzzy blocks through a function iterative method; and 6) performing superpixel merging operation on the atom regions and thus image segmentation results are obtained. The method is a combination of a superpixel method and a fuzzy c-mean clustering algorithm; advantages of the superpixel method and the fuzzy c-mean clustering algorithm are effectively utilized to targetedly overcome the defect that the fuzzy c-mean clustering algorithm is sensitive to noise and a bias field during pixel level clustering; and compared with the traditional fuzzy c-mean clustering algorithm, the method is higher in segmentation accuracy and robustness.
Owner:山东幻科信息科技股份有限公司

Differential privacy protection-oriented k-means clustering method adopting

The invention discloses a differential privacy protection-oriented k-means clustering method. The K-means clustering method comprises the following steps: performing data preprocessing; ensuring thatC indicates a clustered centered point set, and C indicates a sum of error square of a given data set and a cluster center C; judging the volume of C; performing cyclic execution until retry is greater than a maximum value retrymatx of given retry times, and then returning to the best central point Cbest; traversing each point of the data set X, classifying the points to the nearest central point;setting added random noises; renewedly calculating the sum of the data points of each cluster and the quantity of the points, and adding the noises and finally updating the quality center of the cluster; and repeatedly carrying out the steps until the sum of error square is converged or iteration times reach the upper limit. According to the differential privacy protection-oriented k-means clustering method disclosed by the invention, the appropriate random noises which are specially distributed are added in an iteration process of a k-means clustering algorithm, so that a clustering result is distorted to a certain extent, the aim of privacy protection is fulfilled, and meanwhile, the availability of data is ensured.
Owner:DONGGUAN MENGDA INDAL INVESTMENT

Thunderstorm kernel identification and tracing method based on hybrid clustering algorithm

The invention discloses a thunderstorm kernel identification and tracing method based on a hybrid clustering algorithm. The method specifically includes the following steps of utilizing deployed thunder and lightning monitoring points to conduct exploration and record cloud-to-ground lightning data, conducting preprocessing on the recorded cloud-to-ground lightning data and dividing the data intolightning data sets of each equal time interval; adopting a GPS clock synchronization technology and an algorithm of time differences of arrival for figuring out spatial positioning coordinates of lightning according to the time differences of arrival of changeable radiation pulses of an electric field generated by the lightning to each station; on the basis of thunder and lightning positioning data figured out by means of the procedures above, utilizing a DBSCAN algorithm and a KMEANS algorithm to figure out the relevance of the thunder and lightning positioning data among a thunderstorm kernel center-of-mass coordinate position, the lightning frequency and a thunderstorm kernel. An experimental result indicates that the method can accurately reflect the change tendency of thunder and lightning on thunderstorm days, and great effects of identifying the thunderstorm kernel and movably tracing a thunderstorm are achieved.
Owner:安徽佳讯信息科技有限公司

Dynamic weighted hybrid clustering algorithm based circuit breaker fault diagnosis method

The invention discloses a dynamic weighted hybrid clustering algorithm based circuit breaker fault diagnosis method. The method includes the following steps: (1) capturing the energy changes of mechanical drive during the operation of a circuit breaker by utilizing three-axis vibration and two-way sound signals, decomposing the signals through local mean, and extracting the approximate entropy ofeach PF component as the characteristic quantity of a circuit breaker vibration signal; (2) optimizing the initialized clustering center of fuzzy kernel clustering by utilizing the maximum density peak decision of a density peak clustering algorithm, and considering different influences of different characteristics and different samples on clustering results; (3) performing checking on a clustering number K through a cluster validity index MIA; (4) inputting correctly classified characteristics into a multi-level classifier of a support vector machine to perform training; and (5) finding the optimal parameter of the support vector machine through mesh generation, and inputting test data samples to perform final fault classification prediction so that classification accuracy rates can be obtained;. The method has advantaged of being fast in fault diagnosis speed and high in accuracy rate.
Owner:JIYUAN POWER SUPPLY COMPANY OF STATE GRID HENAN ELECTRIC POWER

An online traffic identification method based on incremental clustering algorithm

The invention belongs to the network technical field, in particular to an online traffic identification method based on an incremental clustering algorithm. The method includes: an offline recognitionstage and an online recognition stage, wherein in the offline recognition stage,a semi-supervised learning flow algorithm based on an improved K-means algorithm is used to perform preliminary clustering and mapping work on the prepared training data sets, and the data sets which are preliminarily classifiedare obtained; in the online recognition stage,based on the completed clustering and mappingdata sets formed in the offline identification phase, incremental clustering is used to determine the network application type of the newly added data streams online, so as to achieve the purpose oftraffic identification. According to the method,based on machine learning technology, by constructing a suitable recognition model to learn the prepared data, the online traffic can be incrementally clustered in real time, and the preliminary semi-supervised classification can be carried out by combining the prepared training set, which can realize the online recognition of network traffic, and has good real-time performance and high recognition rate.
Owner:HARBIN ENG UNIV

SAR image segmentation method based on manifold distance two-stage clustering algorithm

InactiveCN103136757ASmall amount of calculationReflect the distribution characteristicsImage analysisCanopy clustering algorithmCluster algorithm
The invention discloses an SAR image segmentation method based on a manifold distance two-stage clustering algorithm. The method mainly solves the problem that an existing clustering segmentation algorithm is unstable in result and big in calculated amount. Achieving steps includes: (1) setting an ending condition and an operation parameter; (2) inputting an image to be segmented, and conducting pre-segmentation on the image to be segmented; (3) extracting characteristics of an image block obtained by pre-segmentation; (4) taking Euclidean distance to be used as similarity measurement to conduct first stage clustering for the characteristics of the image block; (5) taking a clustering center of a first stage and a point farthest from the center as representative points; (6) calculating a manifold distance between any two representative points; (7) taking the manifold distance as the similarity measurement to conduct clustering of a second stage of the representative points; (8) refreshing a clustering center; and (9) judging whether an end condition is achieved, if the end condition is not achieved, returning the step (7), or outputting a segmentation result image. The method has the advantages of being accurate in segmentation result, stable, short in time, and capable of being used in the technical fields such as image strengthening, pattern recognition and target tracking.
Owner:XIDIAN UNIV

Network table semantic recovery method

The invention provides a network table semantic recovery method. The method comprises the steps that based on a Probase lexeme database, preliminary semantic recovery is conducted on a network table to be recovered, and a candidate concept set of each column in the network table is obtained; according to combination distances among different tuples in the network table, each initial clustering center in a clustering algorithm is determined, the tuples in the network table are involved into clusters where the initial clustering centers are located, the clustering centers of the clusters are adjusted, and according to the final clustering center of the clusters, a network table after the shrinkage is conducted is obtained; according to the candidate concept set of each column in the network table and the network table after the shrinkage is conducted, column tags and column entities of all the columns of the network table are recovered out. According to the method, by selecting the initial clustering centers and calculating the similarity based on the combination distances, the K-means clustering algorithm can be improved, the scale of the network table is effectively shrunk, the complexity to fulfill a task is reduced, and the accuracy of recovering the column tags and the column entities of the network table is improved.
Owner:BEIJING JIAOTONG UNIV

Automatic segmentation method for fuzzy spectral clustering brain tumor images based on super pixel

The invention comprises invention discloses an automatic segmentation method for fuzzy spectral clustering brain tumor images based on super pixel, comprising the following steps of : firstly, performing super pixel segmentation on a FLAIR mode image of magnetic resonance imaging containing brain tumors, and extracting gray histogram features of super pixel blocks as input of an algorithm, calculating a fuzzy similarity matrix of images through the input features; then performing clustering through NJW spectral clustering algorithm to obtain a final segmentation result. According to the automatic segmentation method for fuzzy spectral clustering brain tumor images based on super pixel, fuzzy theory is used to optimize similarity measurement mode of spectral clustering, fuzzy weight parameters are introduced in Gaussian distance measurement method of spectral clustering, and a fuzzy similarity measurement mode based on super pixel features is defined. The invention is an automatic imagesegmentation method, human intervention is not needed, and time complexity of spectral clustering algorithm is greatly reduced and segmentation accuracy can be improved by utilizing fuzzy spectral clustering algorithm based on super pixel.
Owner:ANHUI UNIVERSITY +1

Physiological data preclinical processing method and system

The invention discloses a physiological data preclinical processing method and system. The physiological data preclinical processing method comprises preprocessing physiological data based on the time series; carrying out association rule analysis according to the calculated abrupt change score and by means of a multidimensional abrupt change detection model and an integrated learning algorithm fused with multiple classifiers, and obtaining a disease associated network according to the result of association rule analysis; selecting a disease associated network characteristic from the disease associated network by means of an improved clustering algorithm, obtaining a disease diagnosis result according to the disease associated network characteristic and historical data, wherein the improved clustering algorithm is based on a nonnegative matrix decomposition theory and a self-learning mechanism, and extracting a corresponding connected subgraph from large graph data of the disease associated network as the disease associated network characteristic through adjusting the subgraph density. The physiological data preclinical processing method and system are advantaged in that the method and system are wide in applicability, and high in efficiency and precision, and is flexible and convenient, and can be widely applied to the field of data processing.
Owner:广东速创数据技术有限公司

Internet data analysis system

The invention discloses an internet data analysis system which comprises a data preprocessing module and a data analysis module, wherein the data preprocessing module extracts main content from webpage information of the internet, a text corresponding to each webpage is obtained through filtration, the obtained texts are firstly segmented by a segmentation device to obtain a plurality of segmentation words, and segmentation words highlighting characteristics of the texts are reserved through dimensionality reduction of characteristic values; and the data analysis module selects one or more categories of algorithms from a classification algorithm, a clustering algorithm, an association rule algorithm and special rule matching algorithm according to analysis requirements, each category of algorithm adopts one or more algorithms for processing the segmentation words which are subjected to dimensionality reduction and correspond to the webpages output by the data preprocessing module, and the analysis result is stored. With the adoption of the internet data analysis system, the defect of inaccurate data analysis result caused by a single data mining algorithm is overcome, or the time cost due to the need of secondary system development when other algorithms are used on the basis of one algorithm is saved, and efficiency and accuracy of data analysis are improved.
Owner:SHANGHAI CHRUST INFORMATION TECH

Image division method based on biogeography optimization

The invention belongs to the technical field of image processing, in particular to an image division method based on biogeography optimization capable of being used for image enhancement, mode identification, target tracking and the like. The method comprises the following steps that parameters are initialized; pictures to be divided are input, and clustering centers are initialized; a fuzzy matrix is calculated; the clustering centers are calculated again; the fitness value of each clustering center is extracted; the immigration rate and the emigration rate of each emigration center are extracted; the emigration centers are updated according to mutation operators; and division results are output. Because the biogeography migration strategy is adopted for optimizing the clustering centers, the calculation quantity is reduced, the selected clustering centers after the optimization have the overall situation characteristics, the problem of initialization sensitivity of the traditional clustering algorithm is solved, and the stability and the clustering performance of the clustering algorithm are improved. Because the immigration and emigration strategies are introduced for optimizing the clustering centers, the data center distribution characteristics can be more accurately reflected, in addition, the corresponding clustering center updating rules are designed, and the calculation quantity is reduced.
Owner:HARBIN ENG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products