Unlock instant, AI-driven research and patent intelligence for your innovation.

Method and system for processing unstructured data

A technology of unstructured data and data, applied in the field of big data processing, can solve the problems of poor flexibility of unstructured data analysis and processing methods, and achieve the effect of improving query efficiency, utilization efficiency and flexibility

Pending Publication Date: 2019-11-12
深圳百胜扬工业电子商务平台发展有限公司
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to solve the problem that the flexibility of existing unstructured data analysis and processing methods is relatively poor, so as to improve the query efficiency and utilization efficiency of unstructured data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for processing unstructured data
  • Method and system for processing unstructured data
  • Method and system for processing unstructured data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] The technical solutions of the present invention will be clearly and completely described below in conjunction with the accompanying drawings. Apparently, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0052] Unstructured data refers to data that is inconvenient to be represented by two-dimensional logical tables of the database. Unstructured data is text information generated by computers or people. and columns), which are not easy to be directly understood and utilized by computer programs. Unstructured data includes text in all formats, pictures, various reports, images, audio and video information, etc. Unstructured data may be obtained from various industries in various ways.

[0053]In the embodiment ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and system for processing unstructured data. The method comprises the steps of obtaining unstructured data; extracting keywords from the unstructured data according toa preset analysis rule; judging whether the keywords exist in a keyword library or not, and if the keywords do not exist in the keyword library, adding the keywords into the keyword library; carryingout normalization processing on the unstructured data to obtain normalized data with a uniform format, and storing the normalized data in a normalized numerical library corresponding to the keyword library; and performing format conversion on the normalized data according to user requirements to obtain format-converted data, and outputting the format-converted data. According to the method, the flexibility of analyzing and processing the unstructured data is improved by continuously supplementing and perfecting the keyword library in the database, and the query efficiency and the utilizationefficiency of the unstructured data are improved by normalizing the unstructured data.

Description

technical field [0001] The invention relates to the field of big data processing, in particular to a method and system for unstructured data processing. Background technique [0002] Big data generally has the characteristics of large data volume, high discreteness, high data noise, complex types, and multiple data sources. The pre-processing of big data is very important. If there are problems in the early processing of big data, it will directly lead to problems in the utilization efficiency and data value of the later data; if some data is not structured, it will not be possible to effectively store and analyze data, and it will not be able to fully improve Data availability and utilization. [0003] Specifically, the data types of big data are generally divided into structured data, semi-structured data and unstructured data. Among them, unstructured data has now become the main part of big data, and with the rapid development of big data technology , the amount of uns...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/31G06F16/35
CPCG06F16/31G06F16/35
Inventor 陈万林
Owner 深圳百胜扬工业电子商务平台发展有限公司