Unlock instant, AI-driven research and patent intelligence for your innovation.

A data extraction method, device and medium

A data extraction and data technology, applied in other database retrieval, special data processing applications, other database clustering/classification, etc. The ability to quickly locate and reduce the effect of missing lift ratio

Active Publication Date: 2022-05-17
XIAMEN MEIYA PICO INFORMATION
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The way of data matching can only be matched through a single template or regular expression, which cannot fully improve the valuable data in complex feature scenarios
[0005] The range of data matching can only be matched for each row or raw data read in a specified range, and cannot effectively extract valuable data distributed in different ranges of raw data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A data extraction method, device and medium
  • A data extraction method, device and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] The application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain related inventions, rather than to limit the invention. It should also be noted that, for the convenience of description, only the parts related to the related invention are shown in the drawings.

[0045] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present application will be described in detail below with reference to the accompanying drawings and embodiments.

[0046] figure 1 A data extraction method of the present invention is shown, the method extracts data based on a complex feature adaptive matching method and a variable sliding window method, and the method includes:

[0047] Clustering step S101, clustering after obtaining ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a data extraction method, device and storage medium, and extracts data based on a complex feature adaptive matching method and a variable sliding window method. The method first obtains massive data from multiple data sources and then performs clustering , and set a unique data label for each category of data after clustering; then judge whether the category data is a known type of data according to the data label, and if so, use the complex feature adaptive matching method for the category data Perform feature extraction, if not, use the variable sliding window method to perform feature extraction on the category data, and save the result in the database. The present invention accurately and quickly extracts valuable information in known types of data through a complex feature adaptive matching algorithm, and uses a variable sliding window algorithm to perform feature cycle comparisons on unknown types of original data, reducing the proportion of missed extraction of valuable data , to improve the ability of big data analysis and rapid positioning of core clues.

Description

technical field [0001] The invention relates to the technical field of computer data processing, in particular to a data extraction method, device and storage medium. Background technique [0002] With the rapid development of mobile terminals, society has entered the era of big data. Massive data grows geometrically. Each piece of data may contain a lot of valuable information, such as virtual identity account number, ID number, mobile phone number, bank card number, etc. Information, how to quickly and comprehensively extract valuable information when massive data is stored in the database, has become the core point of improving the value of big data utilization. [0003] Due to the large data volume and complex structure of the original data, the existing data extraction methods currently on the market mainly use templates or regular expressions to extract valuable data. These technologies have the following shortcomings: [0004] The way of data matching can only be mat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/906G06F16/90
CPCG06F16/906G06F16/90
Inventor 王海滨王存远周成祖林文楷鄢小征朱海勇
Owner XIAMEN MEIYA PICO INFORMATION