Big Data Formation, Reduction, and Its Impact on Sampling: A survey
Keywords:
Big data, Sampling, Big data reduction, Impact of samplingAbstract
The emergence of data in recent years, characterized by the "6Vs" (Volume, Velocity, Variety, Veracity, Value, and Variability), has started the era of big data. While this data holds great potential for uncovering valuable insights and knowledge, its size presents significant challenges for analysis. This paper explores two critical big data reduction techniques: feature selection and sampling. Feature selection focuses on identifying and eliminating irrelevant or redundant features, reducing data dimensionality. Sampling, on the other hand, selects a representative subset of data points for analysis. We compare and contrast these techniques, highlighting their strengths and weaknesses. The paper explores when each approach is most suitable and suggest the potential benefits of combining them for even more efficient big data analysis.
Downloads

Published
How to Cite
Issue
Section
Copyright (c) 2025 Mohammed Zayed, Fadl Mutaher Ba-Alwi

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.