International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
|
Volume 187 - Issue 37 |
Published: September 2025 |
Authors: Godha Naravara, Ahmad Waseem Ghauri, Chad Mourning |
![]() |
Godha Naravara, Ahmad Waseem Ghauri, Chad Mourning . Machine Learning for Turbulence Risk Identification in U.S. Airspace Using Open Flight and Weather Data. International Journal of Computer Applications. 187, 37 (September 2025), 1-9. DOI=10.5120/ijca2025925621
@article{ 10.5120/ijca2025925621, author = { Godha Naravara,Ahmad Waseem Ghauri,Chad Mourning }, title = { Machine Learning for Turbulence Risk Identification in U.S. Airspace Using Open Flight and Weather Data }, journal = { International Journal of Computer Applications }, year = { 2025 }, volume = { 187 }, number = { 37 }, pages = { 1-9 }, doi = { 10.5120/ijca2025925621 }, publisher = { Foundation of Computer Science (FCS), NY, USA } }
%0 Journal Article %D 2025 %A Godha Naravara %A Ahmad Waseem Ghauri %A Chad Mourning %T Machine Learning for Turbulence Risk Identification in U.S. Airspace Using Open Flight and Weather Data%T %J International Journal of Computer Applications %V 187 %N 37 %P 1-9 %R 10.5120/ijca2025925621 %I Foundation of Computer Science (FCS), NY, USA
Turbulence is a persistent and often unpredictable hazard in aviation, frequently occurring without visual or radar cues. This study presents a machine learning framework for identifying severe and extreme turbulence risk across U.S. airspace using only publicly available flight and weather data. The framework combines over 550,000 pilot reports with ERA5 reanalysis data to construct a large labeled dataset. It integrates anomaly-aware downsampling, synthetic oversampling, dimensionality reduction, and both unsupervised (K-Means) and supervised (XGBoost) modeling. In 10-fold cross-validation, the model achieved strong performance (recall = 0.91, F1 = 0.88) in detecting high-risk events. A real-world case study from February 2025 further illustrates the system’s predictive capability. This work demonstrates the feasibility of operational turbulence identification using open-source data and interpretable learning techniques.