Unstructured data processing method and device

A technology of unstructured data and structured data, applied in electrical digital data processing, special data processing applications, instruments, etc., can solve the problems of inconvenient query statistics, difficult modification, storage difficulties, etc., to save query time and facilitate query Statistics, the effect of saving computing space

Inactive Publication Date: 2014-12-24
BEIJING YOUTEJIE INFORMATION TECH
View PDF3 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Use XML format to organize and save semi-structured data and store different types of information in different XML nodes, but the query efficiency is relatively low, and XPATH (XML Path, XML path language) is needed to complete query statistics
In addition, the disadvantage of using a database to store unstructured data is that the Schema needs to be defined in advance, that is, the format of the database table. After it is defined, it is difficult to modify, resulting in poor flexibility and unable to adapt to various current unstructured data.
[0005] Among them, unstructured data has the characteristics of unstructured data and is a kind of unstructured data. Unstructured data also has the aforementioned problems of inconvenient query statistics and difficult storage.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Unstructured data processing method and device
  • Unstructured data processing method and device
  • Unstructured data processing method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0071] In the first embodiment, the user-defined parsing rules are used to extract the key fields in the unstructured data. Key fields in the data, such as figure 2 As shown, the method includes:

[0072] Step S201, searching for user-defined parsing rules according to the application program information that generates the unstructured data (this is an implementation of the aforementioned step S101). When the user-defined parsing rule can be found, continue to execute step S202; when no user-defined parsing rule is found, continue to execute step S203.

[0073] Wherein, the application program information may be an identification of the application program such as App Name.

[0074] Step S202, using user-defined parsing rules to extract key fields in the unstructured data, and continue to execute step S205.

[0075] In one embodiment, step S201 may be implemented as: searching for user-defined parsing rules pre-configured for unstructured data according to the application ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides an unstructured data processing method and device, which are used for converting unstructured data into structured data. The method comprises the following steps: acquiring a resolving rule for extracting a key field in unstructured data; extracting the key field in the unstructured data by using the analyzing rule; naming the extracted key field as a preset parameter, and assigning the preset parameter as the extracted key field to generate structured data. According to the technical scheme, the unstructured data can be converted into the structured data, so that convenience is brought to inquiry and statistics, and the calculation space and inquiry time are saved.

Description

technical field [0001] The invention relates to the technical field of unstructured data processing, in particular to an unstructured data processing method and device. Background technique [0002] Today, with the rapid development of information technology, people generate a large amount of digital information in various social and economic activities, the scale of enterprise information technology infrastructure construction continues to expand, and IT monitoring and operation and maintenance systems are also widely used. The data generated by home appliances, as well as the data generated by various trading systems (securities trading systems, e-commerce trading systems) are huge in quantity and in different formats, making it difficult to be utilized. [0003] Unstructured data is text information generated by computers or humans. The data in it does not necessarily follow a standard data structure (such as rows and columns defined by schemas), and is not easy to be dir...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/258
Inventor 陈军梁玫娟
Owner BEIJING YOUTEJIE INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products