Article

A Hybrid Deep Learning Ensemble for Multi-Class Malicious URL Detection in Arabic and English

Malicious URLs serve as primary vectors for cyber-attacks, facilitating phishing, malware, and defacement. While conventional systems focus on binary classification, security operations require granular threat identification. Furthermore, literature exhibits a significant bias toward English content, leaving Arabic speaking populations disproportionately exposed to localized threats. This study proposes a hybrid stacked ensemble architecture integrating CNN-BiLSTM with an Attention mechanism, Random Forest, and XGBoost. The methodology incorporates 27 lexical features for English URLs and 23 specialized features tailored to Arabic linguistic structures and Punycode-encoded domains. The model was evaluated on 651,191 English and 20,329 Arabic URLs. The architecture achieved a peak accuracy of 99.43% on English data and 89.07% on the Arabic dataset, outper-forming baseline configurations. Feature correlation analysis demonstrated that class imbalance inflates feature importance, with average correlation coefficients decreasing by 0.214 post-balancing. Comparative experiments utilizing a unified cross-language model yielded inferior results (91.2% English, 82.5% Arabic), confirming that language-specific optimization is essential. This research establishes the first multi-class baseline for Arabic URL detection, providing a robust, scalable framework for regional threat intelligence.

...
Nagi Ali Abdullah Al-shaibany
Department of Information Technology, Faculty of Computer and Information Technology, Sana'a University, Sana'a, Yemen
Download data is not yet available.

Metrics

0
Views
0
Downloads
0
Citations

How to Cite

A Hybrid Deep Learning Ensemble for Multi-Class Malicious URL Detection in Arabic and English. (2026). Sana’a University Journal of Applied Sciences and Technology, 4(6), 2230-2243. https://doi.org/10.59628/jast.v4i6.2961

Similar Articles

You may also start an advanced similarity search for this article.