GCN-based text classification method
A text classification and sample technology, applied in text database clustering/classification, neural learning methods, unstructured text data retrieval, etc., can solve the problem of unable to model sequence information, without considering word order information
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0056] The present invention provides a text classification method of GCN, comprising:
[0057] Step 1. Write a python script, which uses the Beautiful Soup framework (an HTML or XML parsing library for python) to extract from the CSDN blog pages including title, text chapter, publication time, article classification (if any, the classification is the author own classification) and other data content; distributed realization of multi-server crawling website data at the same time, speeding up the crawling speed. In short, using the "crawler" technology, mainly collect the data content of java, python, front-end, database and other categories from the CSDN blog, collect and build a text classification corpus, the total number of samples of the corpus is N, and each sample Contains a title and a paragraph of text.
[0058] Step 2, perform preprocessing on the corpus set in step 1; preprocessing is: load the dictionary through the jieba word segmentation component, and perform wo...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com