Unlock instant, AI-driven research and patent intelligence for your innovation.

A clustering method and system for user personal files

A user file, user-oriented technology, applied in the direction of the file system, file access structure, relational database, etc., can solve the problem of increasing the amount of calculation

Active Publication Date: 2022-04-01
INST OF INFORMATION ENG CHINESE ACAD OF SCI
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Directly processing files ignores the user's habit of saving similar files (for example, files in the same directory are more likely to belong to the same cluster), and will significantly increase unnecessary calculations

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A clustering method and system for user personal files
  • A clustering method and system for user personal files
  • A clustering method and system for user personal files

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] In order to make the above-mentioned features and advantages of the present invention more comprehensible, the following specific embodiments are described in detail in conjunction with the accompanying drawings.

[0047] This embodiment provides a clustering method for user personal files, such as figure 2 shown, including the following 3 steps:

[0048] 1. File grouping

[0049] Files are grouped by using the user's saving habits for similar files. The goal is to ensure that the files in each file group are classified into as few clusters as possible, and there are no duplicate files between file groups.

[0050] The following grouping strategies can be used:

[0051] 1) Use the file tree directory structure for grouping:

[0052] Use NodeSet to represent all directory nodes in the tree, calculate the "distance to the root node" and "the distance to the farthest leaf node" of each directory node, and use symbols X and Y to represent them respectively, then for eac...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a clustering method for user's personal files. The steps include: using the user's habit of saving similar files to group the user's files to obtain multiple file groups; and clustering the files in the file group to obtain one or There are multiple local clusters, and the contents of files in each local cluster are similar; each local cluster is regarded as a file, and all local clusters are clustered to generate a global cluster. The present invention also provides a clustering system for user personal files, including a clustering calculation unit, a clustering result storage unit, and a clustering result search unit, wherein the clustering calculation unit includes a batch file clustering calculation unit and an incremental file clustering unit. class computing unit.

Description

technical field [0001] The invention relates to the field of user personal file management, which can realize clustering of user files according to content, and in particular relates to file clustering based on user usage habits. Background technique [0002] With the popularity of computer office applications, users edit and generate a large number of files (such as WORD / WPS / PDF, etc.) in their work. According to the design of the existing file system, these files are stored in a tree structure in the computer, which is significantly different from the way people store (memorize) files. People's memory of documents is often organized by task (or topic). One problem caused by this difference is that users need to invest a lot of effort in searching and managing files, especially when multiple versions of the same theme are stored in different locations, making subsequent searches more difficult. [0003] At present, Personal Information Management (PIM) studies how to retr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/13G06F16/172G06F16/28
CPCG06F16/137G06F16/172G06F16/285
Inventor 李鹏王斌齐保元周美林郭莉梅钰
Owner INST OF INFORMATION ENG CHINESE ACAD OF SCI