Text structuring technology based on sliding window and random discrete sampling
A discrete sampling and sliding window technology, applied in the field of natural language processing and deep learning, can solve the problems of short text and unclear semantic representation, and achieve the effect of improving semantic representation and classification accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0025] The actual application environment of the present invention is aimed at short text classification, and the present invention will be further described in detail below in conjunction with the accompanying drawings.
[0026] When the present invention is implemented, such as figure 1 The following steps are shown:
[0027] S1: Input the text that needs to be classified, first perform word segmentation processing on the text, then perform word vector training on the word through Word2Vec, and then add word position information to obtain a new word vector;
[0028] S2: After obtaining the text matrix composed of word vectors, use the sliding window method to obtain multiple subsequences with close contexts to form a new text matrix;
[0029] S3: Use random discrete sampling to obtain multiple subsequences with long context distances but can enhance semantics to form a new text matrix;
[0030] S4: Input the matrices obtained by S2 and S3 to the Encoder layer belonging to ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


