Unlock instant, AI-driven research and patent intelligence for your innovation.

Epidemic news information extraction method and system

A technology of information extraction and news, applied in the direction of digital data information retrieval, special data processing applications, instruments, etc., can solve problems such as interference of sight, interference of news text information, etc.

Active Publication Date: 2020-11-20
SOUTH CHINA NORMAL UNIVERSITY
View PDF4 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] Internet news webpage information is an important source of information for people, but in the face of massive webpage information, it is often difficult for people to quickly determine and obtain the content they need. Links, script programs, etc. These information greatly interfere with people's sight, which interferes with people's access to news text information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Epidemic news information extraction method and system
  • Epidemic news information extraction method and system
  • Epidemic news information extraction method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0096] Attached as follows Figures 1 to 12 , to further describe the application scheme:

[0097] The invention uses a related natural language processing method to analyze the epidemic notification information, and constructs an epidemic news information extraction system. With the help of the existing NLP toolkit and Baidu map development platform, we combined the text characteristics of the epidemic news, designed relevant rules, and extracted information from three aspects of the epidemic news: patient’s route information, residence / habitual residence information, and transportation information. Finally, we present this system in the form of a website, through which users can access the system and use related functions.

[0098] The specific process is as follows:

[0099] 1. Data crawling

[0100] Since the web pages related to the epidemic notification use dynamic page technology, the text content in the web page cannot be obtained directly by requesting ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an epidemic news information extraction method, which comprises the following steps of: extracting related information in a news text in a specific scene, namely an epidemic news webpage, converting the related information into structured data, and then storing and visually displaying the data; the method is characterized by comprising the following steps: a data crawling step; a data processing step; a path information extraction step; a residence place / permanent residence place information extraction step; a traffic taking information extraction step; outputting and displaying the information; loading a webpage through a crawler tool to obtain a news text; a sentence splicing and text segmentation algorithm is constructed, epidemic situation text characteristics are combined, entity name recognition, map API and other tools are comprehensively applied, three extraction modules of path information, residence / permanent residence information and traffic riding information are constructed, finally, a system is deployed into a user-friendly webpage, and convenience is provided for a user to independently extract information.

Description

technical field [0001] The invention relates to Internet information collection technology, in particular to a method and system for extracting epidemic news information. Background technique [0002] Internet news webpage information is an important source of information for people, but in the face of massive webpage information, it is often difficult for people to quickly determine and obtain the content they need. Links, script programs, etc. These information greatly interfere with people's sight, which interferes with people's access to news text information. In this regard, effective data cleaning methods are needed to filter noise information on news web pages to obtain relevant text information. Contents of the invention [0003] In order to meet the demand for extracting news text information, the present invention proposes a method and system for extracting epidemic news information, which extracts relevant information in the news text for the specific scene of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/951G06F16/9537G06F16/958G06F40/284G06F40/289
CPCG06F16/951G06F16/9537G06F16/986G06F40/284G06F40/289
Inventor 陈佳珊黄景浩杨坦
Owner SOUTH CHINA NORMAL UNIVERSITY