Long text news automatic labeling method based on pre-training
A long-text, pre-training technology, used in natural language data processing, special data processing applications, network data retrieval, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
example 1
[0031] Example 1 The present invention is tested by the long text news collected by oneself
[0032] This data set is a data set composed of 90,000 news long texts. It is a data set for Chinese news classification, including financial, real estate, education, technology, military, automobile, sports, games, and entertainment data.
[0033] The present invention selects the Bert_CNN model as the basic model of the text representation model, and uses three indicators to evaluate its performance, which are completeness (completeness) Rand index MI (MRand index), mutual information AMI (MutualInformation based scores), and at the same time with 3 The three existing methods are compared, namely bertRCNN, bertRNN, and bert. The three existing methods all run under their respective optimal parameters. The relevant parameters of the method of the present invention are set as follows: the number of epochs is 5, the size of the mini-batch is 128, the learning rate is 0.00005, and the dr...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


