Multi-source data aggregation sampling strategy based on big data environment
A multi-source data and big data technology, applied in the field of big data, can solve the problems of increased sample noise, increased sample redundancy or missing ratio, lack of multi-source data and multi-form data fusion and cross-validation, etc., to reduce interference. Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0037] In order to make the objects and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.
[0038] The embodiment of the present invention provides a multi-source data aggregation sampling strategy based on a big data environment, including the following steps:
[0039] Preparation stage: Input the initial data sets from multiple sources, and uniformly set the encoding of these data sets to GBK encoding, and use the ID attribute in the first column of the file to identify and distinguish the data of different rows, so as to avoid repeated reading in the experiment Question; the initial data set includes at least social media, news platforms, special websites, patent websites, and talent recruitment data resources about business objecti...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com