International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
|
Volume 187 - Issue 44 |
Published: September 2025 |
Authors: Jinsu Ann Mathew, Ninan Sajeeth Philip, Joe Jacob |
![]() |
Jinsu Ann Mathew, Ninan Sajeeth Philip, Joe Jacob . Detecting Algorithmically Generated Domains Using Entropy and Lexical Features. International Journal of Computer Applications. 187, 44 (September 2025), 37-44. DOI=10.5120/ijca2025925758
@article{ 10.5120/ijca2025925758, author = { Jinsu Ann Mathew,Ninan Sajeeth Philip,Joe Jacob }, title = { Detecting Algorithmically Generated Domains Using Entropy and Lexical Features }, journal = { International Journal of Computer Applications }, year = { 2025 }, volume = { 187 }, number = { 44 }, pages = { 37-44 }, doi = { 10.5120/ijca2025925758 }, publisher = { Foundation of Computer Science (FCS), NY, USA } }
%0 Journal Article %D 2025 %A Jinsu Ann Mathew %A Ninan Sajeeth Philip %A Joe Jacob %T Detecting Algorithmically Generated Domains Using Entropy and Lexical Features%T %J International Journal of Computer Applications %V 187 %N 44 %P 37-44 %R 10.5120/ijca2025925758 %I Foundation of Computer Science (FCS), NY, USA
Detecting domain names generated by Domain Generation Algorithms (DGAs) is a key challenge in cybersecurity, as these domains are designed to appear unpredictable and evade standard filtering methods. This work proposes a lightweight and interpretable detection method that relies on lexical properties and entropy-based features derived from domain names. By analyzing character patterns and measuring randomness through Shannon entropy and relative entropy across bigrams, trigrams, and fourgrams, the method captures both structural and statistical differences between legitimate and algorithmic domains. Multiple machine learning classifiers were trained and evaluated, with the best results achieved using XGBoost and Random Forest. Entropy-based features were found to be highly influential in the classification process, highlighting their effectiveness in distinguishing algorithmically generated domains. The findings support the use of entropy as a practical and theoretically grounded feature for DGA detection.