Data extraction method and apparatus

A data extraction and data technology, applied in the field of data processing, can solve the problem of low efficiency of manual data extraction, and achieve the effect of reducing processing time and improving efficiency

Inactive Publication Date: 2015-11-11
SHIHEZI UNIVERSITY
View PDF4 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The invention provides a data extraction method and device, wh

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data extraction method and apparatus
  • Data extraction method and apparatus
  • Data extraction method and apparatus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0026] In order to solve the problem of low efficiency of manually extracting summary information in the prior art, the embodiment of the present invention provides a data extraction method, such as figure 1 As shown, the method includes:

[0027] 101. Perform preprocessing on data to obtain first data at an earlier position and second data at a later position.

[0028] In this embodiment, the data to be preprocessed includes data...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention discloses a data extraction method and apparatus, relates to the field of data processing, and solves the problem that the efficiency for manually extracting data is low. The method comprises: pre-processing the data to acquire first data in the front and second data in the back; calculating the similarity between the first data and the second data; if the similarity between the first data and the second data meets a preset similarity threshold value, deleting the second data in the back; and keeping the first data in the front to obtain summary information. The data extraction method and apparatus are suitable for being applied to a process of extracting a large amount of festival data with different formats.

Description

technical field [0001] The invention relates to the field of data processing, in particular to a data extraction method and device. Background technique [0002] Nowadays, there are many types of data. In the process of digital processing of data, various types of data can be collected. There are many ways to record and collect a certain type of event or phenomenon, such as text description, camera photography, shooting, telephone recording , SMS, WeChat, blog, email, TV station, radio station, etc. Among these data, there are characteristics such as diverse data types, diverse data sources, complex data structures, and huge data capacity. [0003] Faced with these diverse and huge amounts of data, if some kind of data extraction or data statistics is required, the current mainstream processing idea is to compare the data similarity and select one of the data that is too similar or even identical. Retain and eliminate duplicate data to achieve the purpose of data deduplica...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/9535
Inventor 郑亮陈根方
Owner SHIHEZI UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products