Establishment method of text data classification and classification model for data sharing and exchange
A text data, data-oriented technology, applied in text database clustering/classification, unstructured text data retrieval, electronic digital data processing, etc., to achieve the effect of high degree of automation and high accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0018] A method for establishing a text data classification and classification model for data sharing and exchange, such as figure 1 shown.
[0019] Include the following steps:
[0020] 1) Through the text vectorization technology, the text data is quantified and identified mainly by keywords to form a structured description of the data;
[0021] 1.1) input text set P={P 1 ,P 2 ,...,P t ,...,P M}, where P t is the tth text, and M is the total number of texts. Set the number of keywords for each text extraction to M key , damping factor d, sliding window width ω, iteration stop threshold σ, maximum number of iterations G max ;
[0022] 1.2) Segment each text in the text set P using a word segmentation algorithm and remove stop words;
[0023] 1.3) Utilize the TF-IDF algorithm to calculate the TF-IDF value of each word corresponding in the text set P;
[0024] 1.4) For text P t Perform vectorization, t=1, 2, ..., M;
[0025] 1.4.1) Based on the width ω, the sliding...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


