User personal file-oriented clustering method and system

A user file, user-oriented technology, applied in relational databases, special data processing applications, instruments, etc., can solve problems such as increasing the amount of calculation, and achieve the effect of improving calculation efficiency

Active Publication Date: 2018-08-14
INST OF INFORMATION ENG CHINESE ACAD OF SCI
View PDF6 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Directly processing files ignores the user's habit of saving similar files (for example, files in the same directory are more likely to belong to the same cluster), and will significantly increase unnecessary calculations

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • User personal file-oriented clustering method and system
  • User personal file-oriented clustering method and system
  • User personal file-oriented clustering method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] In order to make the above-mentioned features and advantages of the present invention more comprehensible, the following specific embodiments are described in detail in conjunction with the accompanying drawings.

[0047] This embodiment provides a clustering method for user personal files, such as figure 2 shown, including the following 3 steps:

[0048] 1. File grouping

[0049] Files are grouped by using the user's saving habits for similar files. The goal is to ensure that the files in each file group are classified into as few clusters as possible, and there are no duplicate files between file groups.

[0050] The following grouping strategies can be used:

[0051] 1) Use the file tree directory structure for grouping:

[0052] Use NodeSet to represent all directory nodes in the tree, calculate the "distance to the root node" and "the distance to the farthest leaf node" of each directory node, and use symbols X and Y to represent them respectively, then for eac...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a user personal file-oriented clustering method. The method comprises the steps of grouping user files by utilizing storage habits of users to similar files, thereby obtaining multiple file groups; clustering the files in the file groups to obtain one or more local clusters, wherein file contents in each local cluster are similar; and by regarding each local cluster as a file, clustering all the local clusters to generate a global cluster. The invention furthermore provides a user personal file-oriented clustering system. The system comprises a clustering calculation unit, a clustering result storage unit and a clustering result search unit, wherein the clustering calculation unit comprises a batch file clustering calculation unit and an incremental file clustering calculation unit.

Description

technical field [0001] The invention relates to the field of user personal file management, which can realize clustering of user files according to content, and in particular relates to file clustering based on user usage habits. Background technique [0002] With the popularity of computer office applications, users edit and generate a large number of files (such as WORD / WPS / PDF, etc.) in their work. According to the design of the existing file system, these files are stored in a tree structure in the computer, which is significantly different from the way people store (memorize) files. People's memory of documents is often organized by task (or topic). One problem caused by this difference is that users need to invest a lot of effort in searching and managing files, especially when multiple versions of the same theme are stored in different locations, making subsequent searches more difficult. [0003] At present, Personal Information Management (PIM) studies how to retr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/137G06F16/172G06F16/285
Inventor 李鹏王斌齐保元周美林郭莉梅钰
Owner INST OF INFORMATION ENG CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products