
BIG DATA FRAMEWORK FOR PREDICTING AND MANAGING URBAN TRAFFIC FLOW IN SMART CITIES | IJET β Volume 12 Issue 2 | IJET-V12I2P46

Table of Contents
ToggleInternational Journal of Engineering and Techniques (IJET)
Open Access β’ Peer Reviewed β’ High Citation & Impact Factor β’ ISSN: 2395-1303
Volume 12, Issue 2 | Published: March 2026
Author: CH. Divyasri, B. Rupa, B. Archana, B. Saicharan, Dr A. Sathish Kumar, Dr B.VenkataRamana
DOI: https://doi.org/{{doi}} β’ PDF: Download
Abstract
The fast expansion of cities along with the growing number of automobiles has created a severe problem with traffic congestion in today’s urban areas. The current traffic system operates poorly because it results in extended travel durations while drivers consume excessive fuel and produce environmental damage which leads to financial losses. Traditional traffic control systems operate on fixed signal schedules which use past information to operate but they fail to manage the changing traffic situations that occur in real time. The research paper provides a Big Data framework which helps smart cities predict their urban traffic patterns and control their citywide traffic flow.
The system collects live traffic information from multiple different data sources which include IOT traffic sensors and GPS-equipped vehicles and traffic monitoring cameras and mobile phone applications. The system uses Apache Kafka to receive live data streams which it stores in Hadoop Distributed File System (HDFS) for its storage needs and processes using Apache Spark. The research employs three machine learning and deep learning models which include Random Forest and Support Vector Regression (SVR) and Long Short-Term Memory (LSTM) networks to achieve precise traffic flow and congestion predictions. The system uses prediction results to modify traffic signals and find better routes which enhances city traffic management while building sustainable transportation networks for smart cities.
Keywords
Smart Cities, Traffic Congestion Management, Big Data Analytics, Real-Time Traffic Prediction, Machine Learning, Deep Learning, Intelligent Transportation Systems.
Conclusion
This project proposed an integrated Big Dataβbased framework for urban traffic flow prediction and management within smart cities. Rapid urbanization and an immense increase in the number of vehicles necessitate a more effective solution for prevailing traffic management systems, which are highly dependent on fixed signal timings and historical data. This system overcomes these issues by collecting real-time traffic data from various heterogeneous sources such as IoT traffic sensors, GPS-enabled vehicles, traffic surveillance cameras, and mobile apps. Apache Kafka ensured fault-tolerant, high-throughput, reliable real-time data ingestion. HDFS allowed scalable, fault-tolerant storage of large volumes of traffic data. Apache Spark allowed the efficient real-time and batch processing of streams of data, along with data cleaning, aggregation, and feature extraction. The study proposed the incorporation of machine learning and deep learning techniques- Random Forest, Support Vector Regression (SVR), and Long Short-Term Memory-LSTM networks-to forecast the state of traffic flow or congestion. Among those, LSTM emerged as the best, due to its non-linearity and ability to grasp deep complex temporal dependencies that exist in traffic data. The result of prediction outcomes was holistically leveraged for enabling intelligent traffic light control systems as well as route optimization, thus ensuring decreased congestion, average vehicle speed, as well as lowered travel time. Validation of the experimental results has established the scalability, accuracy, as well as real-time capability of the proposed model for dealing with real-time traffic data. In conclusion, the project not only suggests the beneficial usage of Big Data technologies but also their integration with prediction systems, thus ensuring enhanced mobility, reduced environmental effects, as well as intelligent transportation systems for smarter cities in the future.
References
[1]Y. Zheng, L. Capra, O. Wolfson, and H. Yang, βUrban computing: Concepts, methodologies, and applications,β ACM Transactions on Intelligent Systems and Technology, vol. 5, no. 3, pp. 1β55, Sept. 2014. [2]B. M. Williams and L. A. Hoel, βModeling and forecasting vehicular traffic flow as a seasonal ARIMA process: Theoretical basis and empirical results,β Journal of Transportation Engineering, vol. 129, no. 6, pp. 664β 672, Nov. 2003. [3]W. Min and L. Wynter, βReal-time road traffic prediction with spatio-temporal correlations,β Transportation Research Part C: Emerging Technologies, vol. 19, no. 4, pp. 606β616, Aug. 2011. [4]Y. Lv, Y. Duan, W. Kang, Z. Li, and F.-Y. Wang, βTraffic flow prediction with big data: A deep learning approach,β IEEE Transactions on Intelligent Transportation Systems, vol. 16, no. 2, pp. 865β873, Apr. 2015. [5]N. G. Polson and V. O. Sokolov, βDeep learning for short-term traffic flow prediction,β Transportation Research Part C: Emerging Technologies, vol. 79, pp. 1β17, June 2017. [6]Y. Li, R. Yu, C. Shahabi, and Y. Liu, βDiffusion convolutional recurrent neural network: Data-driven traffic forecasting,β in Proc. IEEE International Conference on Learning Representations (ICLR), Vancouver, Canada, 2018, pp. 1β16. [7]M. Abdel-Aty, A. Dhindsa, and S. Pande, βReal-time traffic signal control using integrated real-time traffic data,β Transportation Research Record, no. 2259, pp. 29β39, 2011. [8]T. White, Hadoop: The Definitive Guide, 4th ed. Sebastopol, CA, USA: OβReilly Media, 2015. [9]M. Zaharia et al., βApache Spark: A unified engine for big data processing,β Communications of the ACM, vol. 59, no. 11, pp. 56β65, Nov. 2016. [10]B. L. Smith, B. M. Williams, and R. K. Oswald, βComparison of parametric and nonparametric models for traffic flow forecasting,β Transportation Research Part C: Emerging Technologies, vol. 10, no. 4, pp. 303β321, 2002. [11]J. Medina-Salgado, E. SΓ‘nchez-DΓaz, and J. L. Llorca, βShort-term traffic flow prediction using support vector regression,β Expert Systems with Applications, vol. 39, no. 10, pp. 10416β10425, 2012. [12]L. Breiman, βRandom forests,β Machine Learning, vol. 45, no. 1, pp. 5β32, 2001. H. Karau, A. Konwinski, P. Wendell, and M. Zaharia, Learning Spark: Lightning-Fast Big Data Analysis. Sebastopol, CA, USA: OβReilly Media, 2015.
Cite this article
APA
CH. Divyasri, B. Rupa, B. Archana, B. Saicharan, Dr A. Sathish Kumar, Dr B.VenkataRamana (March 2026). BIG DATA FRAMEWORK FOR PREDICTING AND MANAGING URBAN TRAFFIC FLOW IN SMART CITIES. International Journal of Engineering and Techniques (IJET), 12(2). https://doi.org/{{doi}}
CH. Divyasri, B. Rupa, B. Archana, B. Saicharan, Dr A. Sathish Kumar, Dr B.VenkataRamana, βBIG DATA FRAMEWORK FOR PREDICTING AND MANAGING URBAN TRAFFIC FLOW IN SMART CITIES,β International Journal of Engineering and Techniques (IJET), vol. 12, no. 2, March 2026, doi: {{doi}}.
