Big Data Formation, Reduction, and Its Impact on Sampling: A survey

https://doi.org/10.59628/jast.v3i1.1516

Authors

  • Mohammed Zayed Department of Computer Science, Faculty of Information Technology and computer , University of Sana’a, Sana’a, Yemen,
  • Fadl Mutaher Ba-Alwi Department of Information System, Faculty of Information Technology and computer , University of Sana’a, Sana’a, Yemen

Keywords:

Big data, Sampling, Big data reduction, Impact of sampling

Abstract

The emergence of data in recent years, characterized by the "6Vs" (Volume, Velocity, Variety, Veracity, Value, and Variability), has started the era of big data. While this data holds great potential for uncovering valuable insights and knowledge, its size presents significant challenges for analysis. This paper explores two critical big data reduction techniques: feature selection and sampling. Feature selection focuses on identifying and eliminating irrelevant or redundant features, reducing data dimensionality. Sampling, on the other hand, selects a representative subset of data points for analysis. We compare and contrast these techniques, highlighting their strengths and weaknesses. The paper explores when each approach is most suitable and suggest the potential benefits of combining them for even more efficient big data analysis.

Downloads

Download data is not yet available.

Published

2025-02-28

How to Cite

Zayed, M., & Mutaher Ba-Alwi, F. (2025). Big Data Formation, Reduction, and Its Impact on Sampling: A survey. Sana’a University Journal of Applied Sciences and Technology , 3(1), 597–603. https://doi.org/10.59628/jast.v3i1.1516

Similar Articles

<< < 1 2 3 4 > >> 

You may also start an advanced similarity search for this article.