Unlock instant, AI-driven research and patent intelligence for your innovation.

Information management method and device

A document and collection technology, applied in the video field, can solve problems such as high algorithm time overhead, inability to obtain clustering results, and difficulty in estimation, achieving the effect of strong versatility, low overhead, and good scalability.

Active Publication Date: 2018-09-04
XIAOMI INC
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, in the K-means algorithm, the K value is used to describe the number of initial cluster centers, which is a pre-specified value and is usually difficult to estimate, so it is not possible to know in advance how many categories a given data set should be clustered into. most suitable
Secondly, in the K-means algorithm, it is necessary to determine an initial division according to the initial clustering center, and then optimize the initial division. Therefore, the selection of the initial clustering center has a greater impact on the clustering results. Once the initial value is selected Not good, may not be able to get effective clustering results
In addition, the K-MEANS algorithm needs to continuously adjust the sample classification and continuously calculate the adjusted new cluster centers. Therefore, when the amount of data is very large, the time overhead of the algorithm is very large.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Information management method and device
  • Information management method and device
  • Information management method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0068] Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, the same numerals in different drawings refer to the same or similar elements unless otherwise indicated. The implementations described in the following exemplary examples do not represent all implementations consistent with the present disclosure. Rather, they are merely examples of apparatuses and methods consistent with aspects of the present disclosure as recited in the appended claims.

[0069] The terminology used in the present disclosure is for the purpose of describing particular embodiments only, and is not intended to limit the present disclosure. As used in this disclosure and the appended claims, the singular forms "a", "the", and "the" are intended to include the plural forms as well, unless the context clearly dictates otherwise. It should also be understood...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to an information management method and device. The invention provides a text clustering method. The method comprises the following steps: calculating the similarity between inputted search files of a preset quantity and files in a database; respectively clustering the files with the similarity to the search file in the database reaching a threshold value to obtain a first collection cluster; clustering collections under the same file in the first collection cluster to obtain a clustering result. By changing the traditional clustering concept, the quantity of initial clustering centers is not designated, the initial classification is not carried out, the clustering is completed by virtue of a search way, and the universality is higher; meanwhile, the clustering center does not need to be continuously adjusted in the clustering process, so that the expenditure is small, and the expandability is better.

Description

technical field [0001] The present disclosure relates to the field of video technology, in particular to a user text clustering method and device. Background technique [0002] Cluster analysis is one of the main tasks of data mining. The so-called data mining is usually related to computer science, through statistics, online analysis and processing, intelligence retrieval, machine learning, and pattern recognition and many other methods, from a large amount of data through algorithms to search for information hidden in it. [0003] At present, the commonly used clustering algorithm in the field of data mining is the K-MEANS algorithm. The K-MEANS algorithm randomly selects K documents from N documents as centroids, measures the distance from each remaining document to each centroid, and puts It classifies to the nearest centroid, and then recalculates the centroid of each class that has been obtained, and then repeats this process until the new centroid is equal to or smal...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30
CPCG06F16/334G06F16/35
Inventor 于亮王海洲韩爱君
Owner XIAOMI INC