Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Personal big data management hierarchy concept vectorizing incrementation processing method

A technology of concept vectors and processing methods, which is applied in the fields of organization and personal big data management, query and retrieval, and can solve problems such as deviation, large amount of calculation, and not suitable for concept trees

Active Publication Date: 2017-05-17
ZHEJIANG UNIV OF TECH
View PDF3 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Generally speaking, since the feature items of concepts are usually thousands, the calculation of the concept vectorization process is relatively large.
If the traditional document vectorization method is used for concept vectorization, when the number of concepts changes, such as the addition of new concepts or the deletion of old concepts, all existing concept vectors will be biased; if the vector space is rebuilt, the amount of calculation is usually bigger
[0006] In addition, traditional document vectorization techniques are mostly based on single-level document classification structures, which are not suitable for direct application to concept trees.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Personal big data management hierarchy concept vectorizing incrementation processing method
  • Personal big data management hierarchy concept vectorizing incrementation processing method
  • Personal big data management hierarchy concept vectorizing incrementation processing method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] In the following, the vectorized incremental processing method of hierarchical concept in personal big data management of the present invention will be further described in detail in conjunction with the accompanying drawings.

[0058] refer to figure 1 , a concept vectorization method based on a hierarchical concept structure, which is applied to the concept space layer of a personal big data management model. The personal big data management model is used to complete a series of functions such as the organization, storage, management, and processing of personal information, including the resource layer, conceptual space layer, and application layer:

[0059] F1. The resource layer includes personal information stored in DBMS, file system, and other systems. Personal information in the file system includes text data and non-text data. Text data includes data such as email, pdf files, office files, and html files. Non-text data includes data such as video, audio, and p...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Provided is a personal big data management hierarchy concept vectorizing incrementation processing method. The method comprises the following steps that 1, when a system run for the first time, all concepts are vectorized, and all branching nodes are subjected to concept vector merging operation; 2, when a user operates a concept tree, the substeps of 2.1 obtaining concept vectors and total word number of operated nodes and father nodes thereof, 2.2 modifying the concept vectors of the father nodes according to a formula, 2.3 conducting recursive implementation from the substep 2.1 by taking the father nodes as the operated nodes till a root node and 2.4 updating an inverse document frequency vector are executed; 3, when errors are accumulated to a certain degree, the substeps of 3.1 obtaining current inverse document frequency vector and an inverse document frequency initial value vector, 3.2 updating all vector weights in a vector space in a batched mode and 3.3 updating the inverse document frequency initial value vector are executed. According to the method, the personal big data management hierarchy concept vectorizing incrementation calculation method is achieved, the concept vectors in the concept space can be rapidly adjusted, and the execution efficiency is improved.

Description

technical field [0001] The present invention relates to the management, organization, query and retrieval technology of personal big data, in particular to a vector space model-based hierarchical concept vectorization method and its incremental calculation method. Background technique [0002] With the development of information technology, personal data has exploded, including personal documents (text, images, voice), emails, health data, personal mobile phone contact information (WeChat, QQ), Internet data, etc., which have entered personal big data ( Personal big data) era; the development of wearable devices will further intensify the growth of data, people can record what they hear and see, and collect physiological health data throughout the day. How to manage and organize personal big data, through simple operations, can always get accurate, suitable, complete and high-quality information in the right place, is a goal of the personal information management system. Ho...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/3347
Inventor 杨良怀汪庆顺庄慧范玉雷龚卫华方文菲
Owner ZHEJIANG UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products