Patents
Literature
Patsnap Copilot is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Patsnap Copilot

56results about How to "Improve clustering quality" patented technology

Wave band selection method for hyperspectral remote-sensing image

The invention discloses a wave band selection method for a hyperspectral remote-sensing image. The wave band selection method improves the traditional wave band selection method for the hyperspectral remote-sensing image, which is analyzed on the basis of the important point of a time sequence. The wave band selection method for the hyperspectral remote-sensing image comprises the following steps: firstly, on the basis of visual clustering tendency evaluation, clustering by a spectral clustering algorithm to reduce a clustering number search range and improve the clustering quality; then, when an important redundant wave band is finally reduced, removing parts of a high-redundancy wave band according to the condition mutual information among wave bands; and searching an optimal wave band combination with a branch and bound method to improve the classification precision and reduce a final wave band number. Compared with the prior art, the wave band selection method for the hyperspectral remote-sensing image, which is disclosed by the invention, has a bigger advantage on the aspects of finally-selected wave band numbers and the corresponding classification correction rate, and the required calculation time is far lower than calculation time required with most traditional methods.
Owner:HOHAI UNIV

Privacy protection clustering method for big data analysis and computer storage medium

InactiveCN110334757AHigh cluster availabilityGood clustering qualityCharacter and pattern recognitionDigital data protectionCluster resultPrivacy protection
The invention discloses a privacy protection clustering method for big data analysis and a computer storage medium. The method comprises the following steps: normalizing data and selecting a central point; calculating a minimum privacy budget and distributing a privacy budget sequence, dividing a sample point to a nearest center point, generating Laplace noise, adding noise to parameters in the process of updating the center point, and performing continuously iterating until the difference of error quadratic sums of two adjacent iterations is smaller than a threshold value or the maximum iteration frequency is reached. According to the method, the sensitive information in the data set is protected by adding the noise obeying the Laplace distribution to the intermediate parameter in the clustering algorithm execution process; the problem that sensitive information of a data set is leaked in the execution process of the clustering algorithm is solved, the privacy budget allocation mode of the differential privacy protection clustering algorithm is improved, the availability of clustering results is improved under the same privacy protection degree, and the privacy leakage problem inbig data clustering mining is solved.
Owner:NANJING UNIV OF POSTS & TELECOMM

Cluster classification method for wind power plant

The invention belongs to the field of simulation of electric power systems, and particularly relates to a cluster classification for a wind power plant. Clusters are classified in a unit of the wind power plant according to the actually measured operating data of the wind power plant. In the process of acquiring the data, the actually measured data probably contain noise data because of the factors like the defect or the execution error of a measurement system. In order to reduce the interference of the noise data, the isolated point data in the actually measured operating data of the wind power plant are firstly processed according to the potential value of a sample point. When the central initial positions of the two clusters are nearer during the cluster classification, more redundant information is contained, and the classification result easily becomes the locally best. Aiming at the problem, a sample group with the smallest Euclidean distance moves towards the mean value point, the mean value of the moved sample group replaces the original sample group, so that the method acquires the central position of the diversified initial clusters, and the global searching ability is improved. By the adoption of the cluster classification for the wind power plant, provided by the invention, wind turbine generators having the near operating points are classified in the same cluster, and the equivalent modeling approach for the wind power plant is optimized.
Owner:NORTH CHINA ELECTRIC POWER UNIV (BAODING)

Dynamic social network community structure evolution method based on incremental clustering

The present invention discloses a dynamic social network community structure evolution method based on incremental clustering to solve the problems of community structure detection and communication evolution tracking in a large scale network. The method comprises a step of extracting the core node of a whole network to form a core sub graph, a step of running a hierarchical clustering algorithm on the core sub graph at a time t=0 to obtain the initial structure of a core community, and using an extended algorithm on the above basis to obtain the community structure of the whole network, and a step of using an incremental clustering algorithm to obtain the core community structure of the whole network at present time according to the dynamic evolution condition of an adjacent time network at a time t which is larger than 0 and extending the core community structure to obtain a whole community structure. Through introducing the core sub graph, the incremental calculation in the whole network is avoided, the processing speed is accelerated, and thus the method is suitable for the community discovery in the large scale network. In addition, through introducing the concept of a community structure shift, the large error of the community structure after long time evolution is avoided, and the accuracy of community evolution tracking is improved.
Owner:UNIV OF ELECTRONICS SCI & TECH OF CHINA

Clustering method, system and medium for automatically confirming cluster number based on coefficient of variation

The invention discloses a clustering method, a system and a medium for automatically confirming the number of clusters based on a coefficient of variation, wherein, the density value of each data point in a data set is calculated, the density index is calculated according to the density value, and the data point with the largest density index is selected as a first clustering center; Calculating the shortest distance between each data point and the existing clustering center, calculating the probability that each data point is selected as the clustering center according to the shortest distance, and preselecting the clustering center according to the roulette disc method; Until the set cluster centers are selected, the initial cluster centers selected are used for k-means clustering to generate a corresponding number of clusters; Calculate the average intra-cluster coefficient of variation and the minimum inter-cluster coefficient of variation, then calculate the difference between theaverage intra-cluster coefficient of variation and the minimum inter-cluster coefficient of variation, compare the difference with the set value, and if the difference is less than the set value, merge the two clusters with the minimum inter-cluster coefficient of variation; Until the difference is greater than or equal to the set value, the clustering result is output.
Owner:UNIV OF JINAN

Improved density peak clustering-based social network community discovery method

The invention discloses an improved density peak clustering-based social network community discovery method. The method comprises the following steps of: firstly calculating two indexes for each userin a network, wherein the two indexes comprise local densities and relative distances, the local densities are calculated by adoption of Gaussian kernel density estimation, and the relative distancesrepresent a distances between users and points which are greater than the users in the aspect of density and which are close to the users; selecting a point which has a large local density and relatively large relative distance as a community center on the basis of Gaussian distribution, and distributing the residual non-center points to communities of points which are greater than the non-centerpoints in the aspect of density and which are closest to the non-center points; and finally, measuring distance between every two communities on the basis of combination factors, wherein the communities, the combination factors of which are greater than a given threshold value, are combined into one community. Compared with the prior art, the method is capable of discovering spherical and non-spherical community structures in social networks at the same time, so that fewer parameters are needed under the premise of obtaining relatively high correctness and then the problem of clustering communities with any shapes is solved.
Owner:HUAZHONG UNIV OF SCI & TECH

Multi-view clustering and mining oriented personal privacy protection method

The invention discloses a multi-view clustering and mining oriented personal privacy protection method, and belongs to the technical field of information safety. The multi-view clustering and mining oriented personal privacy protection method has the advantages that privacy partial-order topological classification algorithms (PT, privacy topology) are proposed, privacy relations are defined at first for representing sensitivity difference of different privacy data, privacy partial-order sets are constructed for representing the sensitivity difference of the different privacy data, topologicalclassification algorithms are accordingly designed for the privacy data, and privacy line order sets are solved; multi-view clustering is carried out on views of original data, privacy degrees, tuplesensitivity, the privacy line order sets and the like for multiple views of the privacy data; clustering oriented personal anonymity algorithms (PPOC, personal privacy oriented classtering) are proposed, personal protection operation can be carried out on different clusters by variable k-anonymity strategies by the aid of multi-view clustering oriented privacy protection algorithms which can meetpersonal requirements, and personal protection operation with different exertion degrees can be carried out on different tuples in the same clusters by the variable k- anonymity strategies by the aidof the multi-view clustering oriented privacy protection algorithms.
Owner:HARBIN ENG UNIV

Semi-supervised image clustering subspace learning algorithm based on local linear regression

InactiveCN102968639ATotal Forecast Error OptimizationImprove clustering effectCharacter and pattern recognitionData setInner class
The invention discloses a semi-supervised image clustering subspace learning algorithm based on local linear regression. Firstly, a local linear regression model is used for predicting a coordinate of a training sample in a clustering subspace, a local prediction error between a predicted value and a true value is obtained, and then a minimized objective function of a total predicted error is obtained; then according to two constrain conditions of inter-class dispersion maximization and inner-class dispersion minimization, and a marked sample and an unmarked sample are used for calculating an inter-class dispersion matrix and a total dispersion matrix; and finally, the inter-class dispersion matrix and the total dispersion matrix are blended in the minimized objective function of the total predicted error to obtain an objective function for solving clustering subspace, and function solving is performed through generalized characteristic root to obtain the optimal clustering subspace. The semi-supervised image clustering subspace learning algorithm based on the local linear regression makes full use of the marked sample, the unmarked sample and a local adjacent relation in a training data set to obtain good clustering results.
Owner:WUHAN UNIV OF SCI & TECH

Intention recognition method and device based on multi-round K-means algorithm, and electronic equipment

PendingCN111966798APrecise intent classification and identificationImproving intent clustering qualityNatural language data processingSpeech recognitionCluster resultEngineering
The invention provides an intention recognition method and device based on a multi-round K-means algorithm, and electronic equipment. The method comprises steps that a sample data set is established,the sample data set comprises a plurality of semantic vectors obtained through conversion of a dialogue text, and the dialogue text is converted from voice input when a user performs dialogue with anintelligent voice robot; multiple rounds of clustering processing are conducted on the sample data set by using a K-means algorithm, and an initial clustering result is outputted; fusion denoising isconducted on all the initial clustering results to form a final clustering result; and based on the final clustering result, intention recognition is conducted on the voice input when the current userperforms conversation with the intelligent voice robot. According to the method, an improved K-means algorithm is adopted to perform multiple rounds of clustering processing, and fusion denoising isperformed on clustering results of the multiple rounds of clustering so that more accurate intention classification and identification can be realized, the intention clustering quality can be enhanced, and the method can further be optimized.
Owner:北京奇保信安科技有限公司

Kmeans clustering method for efficacy of traditional Chinese medicinal materials based on node similarity

The invention discloses a Kmeans clustering method for the efficacy of traditional Chinese medicinal materials based on node similarity. The method comprises the following steps: collecting related traditional Chinese medicine data, and processing the data to form a prescription composition library, a medicinal material efficacy library and a channel-tropism binary table of the nature and taste ofmedicinal materials; summarizing and classifying the efficacy of the traditional Chinese medicinal materials according to 23 efficacy tables, and constructing a medicinal material efficacy matrix; constructing a prescription-medicinal material bipartite network based on the prescription composition library; calculating expected values of medicinal material pairs based on degree distribution, andtaking the expected values of the medicinal material pairs as the similarity of the traditional Chinese medicinal materials; establishing a Kmeans clustering model based on the similarity of the traditional Chinese medicinal materials; and clustering the traditional Chinese medicinal materials based on the clustering model to obtain potential effects possibly possessed by the traditional Chinese medicinal materials. According to the method, the accuracy of Kmeans clustering via a medicinal material similarity matrix can reach 0.728. Meanwhile, Kmeans is used for clustering the nature-taste channel-tropism data of traditional medicinal materials, an obtained final result is 0.646, which is about 0.08 higher; and therefore, clustering result is allowed to be more accurate through the method.
Owner:HANGZHOU NORMAL UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products