Text data statistical analysis system and method based on natural language processing

A technology of natural language processing and text data, which is applied in the field of text data statistical analysis system, can solve problems such as inability to statistically analyze text document data, and achieve the effect of enhancing readability

Pending Publication Date: 2020-09-25
SHANGHAI POWER EQUIP RES INST
View PDF5 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The purpose of the present invention is to overcome the deficiency that the prior art cannot perform statistical analysis of text document data, and provide a text data statistical analysis system and method based on natural language processing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text data statistical analysis system and method based on natural language processing
  • Text data statistical analysis system and method based on natural language processing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] In the following, the present invention will be further explained in conjunction with the accompanying drawings and specific implementation examples, in order to understand its module composition and operation process more clearly, but this cannot limit the protection scope of the present invention.

[0031] Such as figure 1 as shown, figure 1 It is an organization chart of a text data analysis and statistics system based on natural language processing in the present invention. The text data analysis and statistics system based on natural language processing includes a natural language information extraction subsystem, a knowledge map management subsystem, and a data Analysis subsystem and data visualization subsystem, the four subsystems are in charge of different functions.

[0032] The above-mentioned natural language information extraction subsystem includes a corpus preprocessing module and a knowledge extraction module, and the corpus preprocessing module is used...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a text data statistical analysis system based on natural language processing. The system comprises a natural language information extraction subsystem, a knowledge graph management subsystem, a data analysis subsystem and a chart visualization subsystem. The natural language information extraction subsystem comprises a corpus preprocessing module and a knowledge extractionmodule. The knowledge graph management subsystem comprises an ontology construction module and a data configuration module; the data analysis subsystem comprises a problem understanding module, a graph query calculation module and a semantic matching module, and the data visualization subsystem is used for generating and visually displaying a data graph. The invention further provides a data statistical analysis method based on natural language processing. According to the method, a knowledge base is established by utilizing the natural language processing technology and the knowledge graph technology, then text data statistical analysis is realized through the intelligent data analysis and visualization technology, and a text document data analysis result is automatically generated into achart to be visually displayed, so enterprise production and operation are guided.

Description

technical field [0001] The invention relates to a text data statistical analysis system and method based on natural language processing, belonging to the technical field of artificial intelligence. Background technique [0002] In the process of production and operation of an enterprise, there are a large number of text documents carried in the form of natural language, such as technical reports, inspection records, operation logs, etc. The text documents contain a wealth of key data information. Taking the maintenance reports of power plants over the years as an example, "How many types of equipment are mentioned in the report?" How does the time change”, etc., but most of the above-mentioned text data information is not directly counted into digital form, but exists in text form, such as “This maintenance equipment includes steam turbines, boilers, generators”, so in this example The statistical result of the total number of overhauled equipment is 3. Mining similar text ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/289G06F40/30G06F16/26G06F16/36G06N3/04G06N3/08
CPCG06F40/289G06F40/30G06F16/26G06F16/367G06N3/08G06N3/045
Inventor 华志刚郭荣程卫国汪勇沈铭科
Owner SHANGHAI POWER EQUIP RES INST
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products