Big Data Formation, Reduction, and Its Impact on Sampling: A survey

https://doi.org/10.59628/jast.v3i1.1516

المؤلفون

  • Mohammed Zayed Sana'a University
  • Fadl Mutaher Ba-Alwi Department of Information System, Faculty of Information Technology and computer , University of Sana’a, Sana’a, Yemen

الكلمات المفتاحية:

Big data، Sampling، Big data reduction، Impact of sampling

الملخص

The emergence of data in recent years, characterized by the "6Vs" (Volume, Velocity, Variety, Veracity, Value, and Variability), has started the era of big data. While this data holds great potential for uncovering valuable insights and knowledge, its size presents significant challenges for analysis. This paper explores two critical big data reduction techniques: feature selection and sampling. Feature selection focuses on identifying and eliminating irrelevant or redundant features, reducing data dimensionality. Sampling, on the other hand, selects a representative subset of data points for analysis. We compare and contrast these techniques, highlighting their strengths and weaknesses. The paper explores when each approach is most suitable and suggest the potential benefits of combining them for even more efficient big data analysis.

التنزيلات

بيانات التنزيل غير متوفرة بعد.

منشور

2025-02-28

كيفية الاقتباس

Zayed, M., & Mutaher Ba-Alwi, F. (2025). Big Data Formation, Reduction, and Its Impact on Sampling: A survey. مجلة جامعة صنعاء للعلوم التطبيقية والتكنولوجيا, 3(1), 597–603. https://doi.org/10.59628/jast.v3i1.1516

المؤلفات المشابهة

1 2 3 4 > >> 

يمكنك أيضاً إبدأ بحثاً متقدماً عن المشابهات لهذا المؤلَّف.