Webpage text classification method based on enhanced capsule network and storage medium

A text classification and capsule technology, applied in network data retrieval, network data indexing, neural learning methods, etc., can solve the problems of large loss and low overall accuracy, improve robustness, improve learning ability, and eliminate gradient disappearance problem effect
CN111460818AActive Publication Date: 2020-07-28CHINESE ACAD OF SURVEYING & MAPPING

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
CHINESE ACAD OF SURVEYING & MAPPING
Publication Date
2020-07-28

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention discloses a webpage text classification method based on an enhanced capsule network and a storage medium thereof, and the method comprises the steps: crawling webpage text data in a specific field, carrying out the cleaning and data structuralization of the obtained text data, and finally obtaining an experiment corpus; setting an architecture of an enhanced capsule network, whereinthe architecture sequentially comprises a dense convolutional network, a main capsule layer and a digital capsule layer; and training the enhanced capsule network by taking the training data in the training set as the input of the enhanced capsule network to obtain a classifier, and verifying the accuracy of the classifier by using the test data of the test set. According to the method, the denseconvolutional network is introduced to extract the feature information, so that the features are more discriminative, and the learning ability of the model on a data set is improved. And the main capsule layer is further encoded by adopting a dynamic routing mechanism, so that the obtained features are more directional, and the capsule network is more robust.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The present invention relates to the technical field of natural language processing, and specifically relates to a web page text classification method and storage medium based on an enhanced capsule network, and the method is particularly suitable for related fields such as social public security incidents. Background technique

[0002] With the development of Internet technology, the amount of data related to social public security events on the Internet has exploded. Public security incidents are incidents that endanger the life, health, and property of the majority (not all people, nor individuals), and may cause a series of public problems, which in turn lead to the collapse of the value system and social order disorder. Public safety incidents are usually divided into natural disasters, accident disasters, public health, and social security. Collecting a large amount of relevant webpages and information of social public security event data from t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More