Science and technology news automatic writing system based on deep learning
A technology of deep learning and journalism, applied in neural learning methods, natural language data processing, biological neural network models, etc., can solve the problems of high cost of news reports, improve the generalization learning ability of deep learning, improve collection speed, and high recognition The effect of accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0042] Such as Figure 1 to Figure 4 As shown, a technology news automatic writing system based on deep learning is characterized in that it includes the following modules:
[0043] Web crawler module: This module collects technology channels, technology news, and relevant content of each technology website from various websites, extracts the text of the collected data, and stores them in the database;
[0044] Science and technology news preprocessing module: for the collected news, word segmentation, named entity recognition, entity relationship extraction, syntax analysis, semantic analysis;
[0045] Sci-tech news classification and clustering module: Mainly aiming at the content of sci-tech news, it is dedicated to further refinement, using intelligent classification and clustering technology to carry out detailed classification for sci-tech news. The generative memory model based on deep learning is used to train and learn news content, and finally realize a News generat...
Embodiment 2
[0057] Such as Figure 1 to Figure 4 As shown, in order to make the inner side occupy less, this embodiment has made further improvements on the basis of embodiment 1, specifically: the crf++ is implemented in c++ language, and a large number of stl data structures are applied. After reading the source code in depth Based on the stl-related part of the source code, the code is rewritten in c language, which can be expressed as: in the tagger.cpp source code file, use the vector structure:
[0058]
[0059] And there is no release of memory after the feature is encoded, the present invention uses char* instead of std::vector>TaggerImpl::x_, and the forced release of memory immediately obtains 10% by experiment and comparison In addition, it is modified for CRF++ and L-BFGS. The L-BFGS algorithm is an improvement of the quasi-Newton algorithm. Its name already tells us that it is based on the improvement of the quasi-Newton method BFGS algorithm. The basic idea of the L-B...
Embodiment 3
[0061] Such as Figure 1 to Figure 4 As shown, the news named entity recognition module uses the crf++ model to train the corpus and identify the names of people, places, organizations, products, professional terms and time of occurrence of news. The automatic news writing system is characterized in that: the automatic news generation module includes a user interaction module and a news generation module.
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com