Incremental processing method of hierarchical concept vectorization in personal big data management

A concept vector and processing method technology, applied in the management, organization, query and retrieval of personal big data, can solve problems such as large amount of calculation, large amount of calculation in the process of concept vectorization, deviation, etc.

Active Publication Date: 2020-02-21
ZHEJIANG UNIV OF TECH
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Generally speaking, since the feature items of concepts are usually thousands, the calculation of the concept vectorization process is relatively large.
If the traditional document vectorization method is used for concept vectorization, when the number of concepts changes, such as the addition of new concepts or the deletion of old concepts, all existing concept vectors will be biased; if the vector space is rebuilt, the amount of calculation is usually bigger
[0006] In addition, traditional document vectorization techniques are mostly based on single-level document classification structures, which are not suitable for direct application to concept trees.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Incremental processing method of hierarchical concept vectorization in personal big data management
  • Incremental processing method of hierarchical concept vectorization in personal big data management
  • Incremental processing method of hierarchical concept vectorization in personal big data management

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] In the following, with reference to the accompanying drawings, the method for vectorized incremental processing of hierarchical concepts in personal big data management of the present invention will be further described in detail.

[0058] Reference figure 1 , A concept vectorization method based on hierarchical conceptual structure, which is applied to the conceptual space layer of personal big data management model. The personal big data management model is used to complete a series of functions such as the organization, storage, management, and processing of personal information, including the resource layer, the conceptual space layer, and the application layer:

[0059] F1. The resource layer includes personal information stored in the DBMS, file system, and other systems. Personal information in the file system includes text data and non-text data. Text data includes email, pdf files, office files, html files and other data, and non-text data includes video, audio, an...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Provided is a personal big data management hierarchy concept vectorizing incrementation processing method. The method comprises the following steps that 1, when a system run for the first time, all concepts are vectorized, and all branching nodes are subjected to concept vector merging operation; 2, when a user operates a concept tree, the substeps of 2.1 obtaining concept vectors and total word number of operated nodes and father nodes thereof, 2.2 modifying the concept vectors of the father nodes according to a formula, 2.3 conducting recursive implementation from the substep 2.1 by taking the father nodes as the operated nodes till a root node and 2.4 updating an inverse document frequency vector are executed; 3, when errors are accumulated to a certain degree, the substeps of 3.1 obtaining current inverse document frequency vector and an inverse document frequency initial value vector, 3.2 updating all vector weights in a vector space in a batched mode and 3.3 updating the inverse document frequency initial value vector are executed. According to the method, the personal big data management hierarchy concept vectorizing incrementation calculation method is achieved, the concept vectors in the concept space can be rapidly adjusted, and the execution efficiency is improved.

Description

Technical field [0001] The invention relates to the management, organization, query and retrieval technology of personal big data, in particular to a hierarchical concept vectorization method based on a vector space model and an incremental calculation method thereof. Background technique [0002] With the development of information technology, personal data has exploded, including personal documents (text, images, voice), emails, health data, personal mobile phone contact information (WeChat, QQ), Internet data, etc., which have entered personal big data ( Personal big data) era; the development of wearable devices will further intensify the growth of data. People can record what they hear and see, and collect physiological health data throughout the day. How to manage and organize personal big data, through simple operations, can always get accurate, appropriate, complete, and high-quality information in the right place is a goal of the personal information management system. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/33
CPCG06F16/3347
Inventor 杨良怀汪庆顺庄慧范玉雷龚卫华方文菲
Owner ZHEJIANG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products