Unstructured text processing method and device, computer equipment and storage medium

An unstructured text and unstructured technology, applied in the field of artificial intelligence, can solve the problem that unstructured data cannot be organized and understood, and achieve the effect of rapid processing

Pending Publication Date: 2020-04-21
智器云南京信息科技有限公司
View PDF3 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The purpose of the embodiments of the present invention is to provide a method, device, and computer equipment for processing unstructured text data, so as to solve the problem that unstructured data cannot be organized and understood in the prior art, so as to provide users with better information Acquisition and identification of technical solutions

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unstructured text processing method and device, computer equipment and storage medium
  • Unstructured text processing method and device, computer equipment and storage medium
  • Unstructured text processing method and device, computer equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] In order to more clearly illustrate the embodiment of the present invention or the technical solution in the prior art, the specific implementation manner of the present invention will be described below with reference to the accompanying drawings. Obviously, the accompanying drawings in the following description are only some embodiments of the present invention, and those skilled in the art can also obtain other accompanying drawings based on these drawings and obtain other implementations.

[0038] In order to make the drawing concise, each drawing only schematically shows the parts related to the present invention, and they do not represent its actual structure as a product. In addition, to make the drawings concise and easy to understand, in some drawings, only one of the parts with the same structure or function is schematically shown or only one of them is marked. Herein, "a" not only means "only one", but also means "more than one".

[0039] Such as figure 1 ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an unstructured text data processing method which comprises the steps that format and code conversion is conducted on a text file to be processed, and the text file to be processed comprises unstructured text data; preprocessing the text content of the file subjected to format and code conversion, wherein preprocessing comprises word segmentation, part-of-voice tagging, stopword removal and/or ambiguity elimination of polysemy; performing corresponding knowledge extraction on the preprocessed text content through a knowledge extractor; the method comprises the steps that knowledge obtained through knowledge extraction is subjected to structured conversion, a structured data structure capable of being displayed in a graphical mode is generated, the data structure isrepresented in a predefined serialization format, and the serialization format comprises a file number corresponding to the structured knowledge. According to the embodiment of the invention, knowledge can be extracted from the unstructured text and displayed in a mapping manner, so that key intelligence elements in the file can be extracted, and the unstructured text file can be quickly processed.

Description

technical field [0001] The present invention relates to the technical field of artificial intelligence, in particular to a method, device, computer equipment, and storage medium for processing unstructured text data. Background technique [0002] Structured data is identifiable data that can be organized into row and column structures, that is, data that exists in a fixed format in a record file. Structured data usually includes data content and data model. Typical examples of structured data are various relational databases. [0003] Unstructured data refers to data information that does not have a predefined data model or is not organized in a predefined way, generally refers to text data, and unstructured data may have a lot of information such as time and numbers. Compared with traditional structured data files in databases or marked files, due to the non-characteristic and ambiguity of unstructured data, unstructured data will be more difficult to understand and ident...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F16/332G06F16/36
CPCG06F16/3329G06F16/3335G06F16/3344G06F16/367
Inventor 王海波李志保
Owner 智器云南京信息科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products