Supercharge Your Innovation With Domain-Expert AI Agents!

Stock information intelligent extraction method

An extraction method and intelligent technology, which can be used in other database retrieval, network data retrieval, instruments, etc., and can solve the problems of high cost and low efficiency.

Inactive Publication Date: 2019-07-16
武汉优品楚鼎科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of this method is to solve the technical defects, high cost and low efficiency of the current method of manually extracting abstracts for individual stock announcements and research reports, and to design a method that can directly generate customized abstracts quickly and effectively

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Stock information intelligent extraction method
  • Stock information intelligent extraction method
  • Stock information intelligent extraction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] The system architecture diagram of this method is shown in figure 1 As shown, the function description of each module is as follows:

[0019] 1: Configure the crawling source URL and crawling rules;

[0020] 2: Crawl announcements according to the configured crawl source URL and crawl rules;

[0021] 3: Use the PDF2HTML open source library to convert the captured announcements into HTML format;

[0022] 4: Clean up redundant tags, styles, etc. in HTML;

[0023] 5: Extract the Table tag in HTML and store it in the form of tableList;

[0024] 6: Extract the plain text information of HTML, and divide it into lists according to the set punctuation marks to store the sentenceList;

[0025] 7: Structure each table, extract the entries and their data in the table, and store them in the form of ;

[0026] 8: According to the preset summary keyword module, extract the data in the tableList according to the keyword and fill the module. For the situation that cannot be extra...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for extracting abstract of individual share announcements and research reports through table extraction and text paragraph similarity. The method comprises; adopting aseparating first and then merging second strategy to separate the form of an announcement or research report from a plain text, carrying out structured processing on the form, carrying out paragraphdivision processing on the plain text, and then extracting keyword index data from the structured form and filling the template by using a predefined abstract template (keyword template); and searching for N top candidate paragraphs most similar to the template from the divided paragraphs as abstract candidate paragraphs, and if keywords cannot be matched in the structured table, searching the most similar paragraphs from the candidate paragraphs as a sub-abstract. According to the method, the accuracy of the abstract is greatly improved, the editing efficiency of an editor is improved, the extraction accuracy is improved through continuous feedback, and finally automation is truly achieved.

Description

technical field [0001] The present invention relates to the field of computer software, in particular to stock-related information, including scenarios of intelligent extraction of information such as announcements issued by listed companies and research reports issued by institutions. Background technique [0002] At present, there are many types of individual stock announcements and research reports, each type of announcement describes different key events, and each type of individual stock announcement is numerous. As an investor, it is urgent to keep abreast of the individual stock announcements disclosed by listed companies and the research reports issued by institutions for their own interests. However, there are many announcements and research reports for each type of stock, and the length is redundant. Investors only want to understand the core events and data (that is, the summary), instead of spending a lot of time and energy downloading and browsing the content o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/951G06F16/957G06F17/27G06Q40/06
CPCG06Q40/06G06F40/289G06F16/951G06F16/9577
Inventor 万雪婷
Owner 武汉优品楚鼎科技有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More