Unlock instant, AI-driven research and patent intelligence for your innovation.

Automatic text information extraction method

A technology of automatic extraction and text information, applied in the computer field, can solve problems such as time-consuming and labor-intensive, and achieve the effect of saving manpower

Pending Publication Date: 2021-04-02
WUHAN UNIV
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In order to solve the above technical problems, the present invention provides a method for automatic text information extraction, which is used to solve the problem of automatic extraction of subject matter parameter information in bidding texts, to replace the current time-consuming and labor-intensive manual extraction method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Automatic text information extraction method
  • Automatic text information extraction method
  • Automatic text information extraction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] In order to facilitate those of ordinary skill in the art to understand and implement the present invention, the present invention will be described in further detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the implementation examples described here are only used to illustrate and explain the present invention, and are not intended to limit this invention.

[0053] In this embodiment, the present invention is further described by automatically extracting target object parameter information oriented to the bidding text.

[0054] Bidding business is an important task for enterprises to carry out project management, and bidding documents have relatively standardized writing requirements and text content. Therefore, if the bidding document text is used as corpus for research, the management, application, feedback, and update iteration of standard bidding documents can be realized. These functions can significantly im...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text information automatic extraction method, and aims to solve the problems that parameter information of bid invitation document subject matter in the prior art is extracted manually, a large amount of labor and time are consumed, and time and labor are consumed. The method comprises the steps: automatically extracting parameter information of a bid inviting text through the natural language processing technology, designing a bid document text structuralization, extracting subject matter parameter information and report extraction system, wherein the bid document text structuralization comprises the steps: extracting the bookmark information through a pypdf2, recognizing a pdf bid document text through a pdflumber, and cleaning the text through a regular rule; and performing structured analysis processing on the text by utilizing rule matching. In the subject matter parameter information extraction, the technical parameter information of the subject matter in the structured bidding document text is accurately identified and extracted by using a regularization technology. And finally, an extraction report is established by utilizing the information in theprocess, and the whole extraction condition is visually reflected.

Description

technical field [0001] The invention belongs to the technical field of computers and relates to a method for automatically extracting text information, in particular to a method for automatically extracting object parameter information oriented to bidding texts. Background technique [0002] With the continuous development of intelligence and automation of information technology, it has brought a huge impact and convenience to people's lives. It can automatically convert text into pictures, and can also convert pictures into text, becoming more and more intelligent. , more and more convenient and convenient; but for some specific fields, the required information is specific, and the existing technology is difficult to extract information in a targeted manner, such as the automatic extraction of target parameter information for bidding texts. [0003] Bidding documents are a concentrated expression of procurement needs, and the quality of bidding documents directly determines...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/903G06F16/907G06F16/9038G06F16/11
CPCG06F16/90344G06F16/907G06F16/9038G06F16/116
Inventor 刘金硕王晨阳邓娟黄朔刘宁唐浩洲
Owner WUHAN UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More