Unlock instant, AI-driven research and patent intelligence for your innovation.

XML information extraction method based on human-computer interaction, storage medium and electronic equipment

A technology of information extraction and human-computer interaction, applied in digital data information retrieval, electronic digital data processing, input/output process of data processing, etc. The effect of improving production efficiency

Pending Publication Date: 2022-05-24
上海森亿医疗科技有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In view of the above-mentioned shortcomings of the prior art, the purpose of the present invention is to provide a method for extracting XML information based on human-computer interaction, storage media and electronic equipment, which are used to solve the problem that the prior art cannot avoid the learning cost of related technologies, and the maximum Minimize the problem of manpower and material resources used in information extraction from XML data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • XML information extraction method based on human-computer interaction, storage medium and electronic equipment
  • XML information extraction method based on human-computer interaction, storage medium and electronic equipment
  • XML information extraction method based on human-computer interaction, storage medium and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The embodiments of the present invention are described below through specific specific examples, and those skilled in the art can easily understand other advantages and effects of the present invention from the contents disclosed in this specification. The present invention can also be implemented or applied through other different specific embodiments, and various details in this specification can also be modified or changed based on different viewpoints and applications without departing from the spirit of the present invention. It should be noted that the following embodiments and features in the embodiments may be combined with each other under the condition of no conflict.

[0031] It should be noted that the diagrams provided in the following embodiments are only used to illustrate the basic concept of the present invention in a schematic way, so the diagrams only show the components related to the present invention rather than the number, shape and For dimension ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an XML (Extensible Markup Language) information extraction method based on human-computer interaction, a storage medium and electronic equipment. The XML information extraction method based on human-computer interaction comprises the following steps: acquiring field information in an XML file; generating an information extraction rule according to the field information; performing duplicate removal on the information extraction rule according to the path of the field information; based on a field labeling operation of a user, establishing a mapping relation from the information extraction rule to a key field; and iteratively updating the information extraction rule by using the mapping relation. According to the method, the cost of extracting the information in the XML data can be reduced to the greatest extent, and the production efficiency is improved.

Description

technical field [0001] The invention belongs to the technical field of information extraction, and relates to an information extraction method, in particular to a human-computer interaction-based XML information extraction method, a storage medium and an electronic device. Background technique [0002] At present, in the process of data governance, in order to extract information from XML (eXtensible Markup Language, Extensible Markup Language) format data, it is necessary to manually set rules such as XPath and regular expressions. However, there are often a large number of information fields to be extracted, and the specific rules for extracting different fields are also different. Furthermore, the rules themselves such as XPath and regular expressions have a certain learning cost. How to quickly familiarize operators with rule systems such as XPath and regular expressions, and then effectively use them in the data governance field, has become a difficult point in the data...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/838G06F16/84G06F40/205G06F3/0481
CPCG06F16/84G06F16/838G06F40/205G06F3/0481
Inventor 张少典马汉东沈子浩朱珉薛颜波
Owner 上海森亿医疗科技有限公司