Airline-Specific Flight Delay Prediction with Tree-Based Models and Network Metrics

Research output: Contribution to book or proceedingConference articlepeer-review

Abstract

Flight delays present a significant challenge in modern air traffic management and affect airlines, passengers, and the economy. This study proposes a comprehensive approach to predicting flight delays using tree-based machine learning models, integrating flight and weather data with advanced feature engineering techniques. New features, including historical delay metrics and network centrality measures, are derived to enhance predictive accuracy. The dataset is grouped by airlines to account for variations in flight delay patterns across different airlines. Tree-based ensemble models, including random forest, XGBoost, CatBoost, lightGBM, and extra trees, are employed. Results show that prediction metrics improve when models are trained on airline-specific data compared to using the entire dataset with airlines as a feature. For airline-specific analysis, the random forest model achieves the highest average accuracy (92.6%) and precision (97.0%), while the extra trees model achieves the highest average recall (88.5%) and AUC-ROC (97.5%), and both models achieve the highest F1-score (92.2%). These findings emphasize the importance of analyzing airline-specific dynamics and provide actionable insights for mitigating delays. This study advances flight delay prediction by integrating domain-specific features with robust machine learning models.

Original languageEnglish
Title of host publication2025 6th International Conference on Artificial Intelligence, Robotics, and Control, AIRC 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages535-540
Number of pages6
ISBN (Electronic)9798331543488
ISBN (Print)9798331543488
DOIs
StatePublished - May 7 2025
Event6th International Conference on Artificial Intelligence, Robotics, and Control, AIRC 2025 - Savannah, United States
Duration: May 7 2025May 9 2025

Publication series

Name2025 6th International Conference on Artificial Intelligence, Robotics and Control (AIRC)

Conference

Conference6th International Conference on Artificial Intelligence, Robotics, and Control, AIRC 2025
Country/TerritoryUnited States
CitySavannah
Period05/7/2505/9/25

Scopus Subject Areas

  • Mechanical Engineering
  • Control and Optimization
  • Artificial Intelligence
  • Computer Science Applications

Keywords

  • extra trees
  • flight delay prediction
  • machine learning
  • network centrality
  • random forest

Fingerprint

Dive into the research topics of 'Airline-Specific Flight Delay Prediction with Tree-Based Models and Network Metrics'. Together they form a unique fingerprint.

Cite this