Article

Big Data Formation, Reduction, and Its Impact on Sampling: A survey

Cover Image

PDF

Published 2025-02-28

DOI 10.59628/jast.v3i1.1516

Issue Vol. 3 No. 1 (2025): Sana'a University Journal of Applied Sciences and Technology

Section Article

Big data Sampling Big data reduction Impact of sampling

The emergence of data in recent years, characterized by the "6Vs" (Volume, Velocity, Variety, Veracity, Value, and Variability), has started the era of big data. While this data holds great potential for uncovering valuable insights and knowledge, its size presents significant challenges for analysis. This paper explores two critical big data reduction techniques: feature selection and sampling. Feature selection focuses on identifying and eliminating irrelevant or redundant features, reducing data dimensionality. Sampling, on the other hand, selects a representative subset of data points for analysis. We compare and contrast these techniques, highlighting their strengths and weaknesses. The paper explores when each approach is most suitable and suggest the potential benefits of combining them for even more efficient big data analysis.

...

Mohammed Zayed

Sana'a University

...

Fadl Mutaher Ba-Alwi

Department of Information System, Faculty of Information Technology and computer , University of Sana’a, Sana’a, Yemen

Download data is not yet available.

Metrics

0

Views

0

Downloads

0

Citations

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Similar Articles

Nagi Ali Abdullah Al-shaibany, A Hybrid Deep Learning Ensemble for Multi-Class Malicious URL Detection in Arabic and English , Sana'a University Journal of Applied Sciences and Technology: Vol. 4 No. 6 (2026): Sana'a University Journal of Applied Sciences and Technology
Asma’a Ahmed AL-Adhreai, A.M. Abdulwahab, A. H. Al-Hammadi, Structural, Optical and Cytotoxic Analysis of Sr-Doped CuS Nanoparticles for Lung Cancer Applications , Sana'a University Journal of Applied Sciences and Technology: Vol. 3 No. 4 (2025): Sana'a University Journal of Applied Sciences and Technology
Ahmed M. Al-Anweh, Ibrahim A. Al-Akhaly, Evaluation of Crushed Basaltic Rocks as Coarse Aggregate, at Selected Site South Sana’a, Yemen: Properties and Concrete Performance , Sana'a University Journal of Applied Sciences and Technology: Vol. 4 No. 5 (2026): Sana'a University Journal of Applied Sciences and Technology
Abdulkarem Yahya Abohatem , Fadl M.M. Ba-Alwi, Abdualmajed Ahmed Al-Khulaidi, Suggestion Cybersecurity Framework (CSF) for Reducing Cyber-Attacks on Information Systems , Sana'a University Journal of Applied Sciences and Technology: Vol. 1 No. 3 (2023): Sana'a University Journal of Applied Sciences and Technology
Mahfoudh M. AL-Hamadi, Anass A. Alnedhary, Hadi Ali Quria’a, Ghadeer M. al-mutawakel, Adsorption Isotherm, Kinetic and Thermodynamic Studies for Removal of Fluoride Ions from Drinking Water Using Modified Natural Pumice , Sana'a University Journal of Applied Sciences and Technology: Vol. 2 No. 5 (2024): Sana'a University Journal of Applied Sciences and Technology
Ibrahim Ahmed Al-Baltah, Sultan Yahya Al-Sultan, Marwa Abdulrahman Al-hadi, Ammar Thabit Zahary, Factors Influencing the Adoption of Mobile Banking Applications in Yemen Using an Extended Technology Acceptance Model , Sana'a University Journal of Applied Sciences and Technology: Vol. 2 No. 2 (2024): Sana'a University Journal of Applied Sciences and Technology
Ghaleb H. Aljafary , Abdulrahman Hussian, Deep Convolutional Neural Networks for Fingerprint Classification , Sana'a University Journal of Applied Sciences and Technology: Vol. 3 No. 5 (2025): Sana'a University Journal of Applied Sciences and Technology
Eman H. Ba-Othman, Hisham M. Nagi, Ghazi A. Al-Rashidi, Khaled M. Khanbari, Seasonal Evaluation of Physicochemical Parameters of Marine Sediments on the Coasts of Al-Mukalla and Broom Districts, Hadhramout, Yemen , Sana'a University Journal of Applied Sciences and Technology: Vol. 4 No. 3 (2026): Sana'a University Journal of Applied Sciences and Technology
Shakib Alsowidy, Localized Plastic Deformation and Microstructure Properties of Al/Si Alloy Improved with Al3Ni Compound , Sana'a University Journal of Applied Sciences and Technology: Vol. 1 No. 3 (2023): Sana'a University Journal of Applied Sciences and Technology
Salahadeen .N. Al-Emad , Abbas Mohammed Al-Azab, Naif Ahmed Al-Gabri, Brucellosis among exported livestock from the Horn of Africa to Yemen: Seroprevalence study associated with risk factors , Sana'a University Journal of Applied Sciences and Technology: Vol. 3 No. 5 (2025): Sana'a University Journal of Applied Sciences and Technology

You may also start an advanced similarity search for this article.

Most read articles by the same author(s)

Mohammed Taj, Mohammed Zayed, Abdulwahid Alhetar, Mohammed Rajeh, Mohammed Abbas Al-Sharafi, Basem Abdulrhman Munassar, Enabling Arabic Database Querying via Parameter-Efficient Fine-Tuning of Large Language Models , Sana'a University Journal of Applied Sciences and Technology: Vol. 4 No. 1 (2026): Sana'a University Journal of Applied Sciences and Technology

About The Journal

Journal Policies

Editorial guidelines, ethics, and publication standards

About the Journal

A

Journal scope, aims, editorial board, and history

Publication Ethics

E

Ethical guidelines and malpractice statement for all parties

Open Access Policy

O

Open access, archiving, and self-archiving policies

Peer Review Process

P

Review workflow, criteria, and timeline for submissions

Licensing Policy

L

Copyright, licensing, and reuse permissions for published content

Digital Archiving

D

View Digital Archiving

Long-term preservation and digital archiving strategy

Publication Frequency

F

Issuance schedule, volumes, and publication timeline

Language Policy

L

View Language Policy

Submission language, translation, and language services

©

Copyright Policy

C

Author rights, copyright transfer, and permissions

Editorial Independence

I

Editorial autonomy, conflict of interest, and decision-making

AI Ethics and Responsible Use

AI

Guidelines for ethical and transparent AI use in scholarly writing

Journal Meta Data

Journal Metrics

Key indicators of journal quality and impact

Crossref DOI

C

Digital Object Identifier for persistent citation and linking

Google Scholar

G

Search Citations

Comprehensive citation metrics and academic search engine

ISSN Number

I

International Standard Serial Number for journal identification

SJIIF Impact

S

Scientific Journal Impact Factor and quality assessment

H-Index Score

H

Measures journal productivity and citation impact

info block

Platform Information

Dedicated guides · Readers, Authors, Libraries

R

For Readers

reader

Abstracts · downloads · open access

A

For Authors

author

Submission Guidelines

Peer review · rights · citations

L

For Librarians

library

ISSN · aggregation · subscriptions

History Workflow

Journal Timeline & Fees

Fast-Track

Efficient processing with transparent publication costs

Time to first decision 7 days

Rapid initial review response

Review time 45 days

Comprehensive peer-review process

Submission to acceptance 65 days

Complete manuscript processing timeline

Make a Submission

Make a Submission

Categories

Keywords