Feature extraction method suitable for time series data in field of cultural science and technology fusion

A technology of time series and feature extraction, applied in text database query, text database clustering/classification, unstructured text data retrieval, etc., can solve problems such as incomplete acquisition of data value, small sample data, different important information, etc.

Pending Publication Date: 2021-04-09
江苏思远集成电路与智能技术研究院有限公司 +1
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to provide a feature extraction method suitable for time series data in the field of cultural and technological integration, which solves the problem of time series data with different time granularities having different important information and less sample data in the prior art; common time series feature mining The algorithm only mines time series information from a single perspective, and cannot fully obtain the value of the data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0022] A feature extraction method suitable for time series data in the field of cultural and technological integration, including the following steps:

[0023] Step 1: Obtain time series data from the target database, classify the series by data type, and obtain text data and numerical data; the specific method is as follows: obtain time series data from the enterprise database or the public database of the corresponding government; sort the data according to the data type Classification, divided into numerical time series data and text time series data; for new data, the sliding window principle is used to form a new classification data source;

[0024] Step 2: The numerical data is classified by time granularity to obtain macro time series data and micro time series data; after the macro data is standardized, the similarity between the sample and the industry standard data is calculated, and the normalized similarity is used as the basis of the D-S evidence theory Input evi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a feature extraction method suitable for time series data in the field of cultural science and technology fusion, and the method comprises the following steps: 1, obtaining the time series data from a target database, classifying a sequence according to a data type, and obtaining text data and numerical data; 2, classifying the numerical data according to time granularity to obtain macroscopic time series data and microcosmic time series data; the method comprises the following steps: firstly, obtaining time sequence data from a target database, classifying sequences according to data types, obtaining text data, firstly obtaining a high-frequency vocabulary set by using a bag-of-words model, and then carrying out secondary filtering on the vocabulary set by using improved TF-IDF to obtain hot vocabularies; if new data exists, executing the step 1-4 again by using a sliding window; if no new data exists, stopping; the method can quickly process and analyze the time series data, and is helpful for making strategic decisions of enterprises.

Description

technical field [0001] The invention relates to a feature extraction method suitable for time series data in the field of cultural and technological integration. Background technique [0002] With the progress of our country's science and technology and the adjustment of economic development strategy, the development of emerging industries and enterprises with culture as the core and technology as the means has officially become a major situation advocated by various places. On the one hand, general enterprises mainly use financial data or user behavior modeling when predicting their own development hotspots, and do not pay special attention to multi-angle mining of time series data; on the other hand, general time series data mining Because it does not have the characteristics of cultural and technological fusion data, it does not perform data mining on the classification of time series data from time granularity. For an industry or enterprise, the amount of information ca...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/33G06F16/35G06F40/216G06K9/62
CPCG06F16/3344G06F16/35G06F40/216G06F18/24G06F18/25G06F18/257
Inventor 郑培清孙莉刘瀚炜
Owner 江苏思远集成电路与智能技术研究院有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products