Hello there! My name is Onur, I’m a Computer Engineer with a specialization in Artificial Intelligence. I am from a lovely city called Eskişehir in Türkiye. My journey in this field began at Yıldız Technical University in Istanbul, where I earned my Bachelor’s degree in Computer Engineering. For my thesis, I worked on an exciting medical AI project that leveraged computer vision. After completing my Bachelor’s degree, I spent two years in Italy, pursuing a master’s degree in Computer Engineering from the University of Padova.
During my time at UNIPD, I specialized in AI & Robotics and focused my research on Natural Language Processing. My thesis involved proposing a novel static word embedding model. Outside of work, I am passionate about traveling and learning about different culinary practices. Currently, I am thrilled to be a member of the COeXISTENCE team here at Jagiellonian University in the beautiful city of Kraków. Here I continue my learning journey and contribute to building a deeper understanding of the interaction between humans and autonomous agents in future traffic environments.
In today’s fast-changing world of AI, the interactions of various subfields are not just beneficial, but essential. Consequently, I believe that every AI enthusiast is responsible for staying up-to-date with every other subfield, regardless of the specific professional focus. With this mindset, my previous research interests and projects span deep learning, classifiers and regressors, computer vision, clustering algorithms, reinforcement learning, and natural language processing. Also, I have experience with mobile development and The Robotic Operating System. Currently, I am continuing my studies on reinforcement learning. I am exploring innovative methods to model learning in simulations that mirror natural learning processes, aiming for accuracy and efficiency.
List of main publications and preprints
-
URB - Urban Routing Benchmark for RL-equipped Connected Autonomous Vehicles
Akman, Ahmet Onur,
Psarou, Anastasia,
Hoffmann, Michał,
Gorczyca, Łukasz,
Kowalski, Łukasz,
Gora, Paweł,
Jamróz, Grzegorz,
and Kucharski, Rafał
arXiv preprint arXiv:2505.17734
2025
Connected Autonomous Vehicles (CAVs) promise to reduce congestion in future urban networks, potentially by optimizing their routing decisions. Unlike for human drivers, these decisions can be made with collective, data-driven policies, developed by machine learning algorithms. Reinforcement learning (RL) can facilitate the development of such collective routing strategies, yet standardized and realistic benchmarks are missing. To that end, we present \our: Urban Routing Benchmark for RL-equipped Connected Autonomous Vehicles. \our is a comprehensive benchmarking environment that unifies evaluation across 29 real-world traffic networks paired with realistic demand patterns. \our comes with a catalog of predefined tasks, four state-of-the-art multi-agent RL (MARL) algorithm implementations, three baseline methods, domain-specific performance metrics, and a modular configuration scheme. Our results suggest that, despite the lengthy and costly training, state-of-the-art MARL algorithms rarely outperformed humans. Experimental results reported in this paper initiate the first leaderboard for MARL in large-scale urban routing optimization and reveal that current approaches struggle to scale, emphasizing the urgent need for advancements in this domain.
-
RouteRL: Multi-agent reinforcement learning framework for urban route choice with autonomous vehicles
Akman, Ahmet Onur,
Psarou, Anastasia,
Gorczyca, Łukasz,
Varga, Zoltán György,
Jamróz, Grzegorz,
and Kucharski, Rafał
SoftwareX
2025
RouteRL is a novel framework that integrates multi-agent reinforcement learning (MARL) with a microscopic traffic simulation for the development of efficient collective route choice strategies for autonomous vehicles (AVs). The proposed framework models the daily urban route choices of driver agents of two types: human drivers, emulated using behavioral route choice models, and AVs, modeled as MARL agents optimizing their policies for a predefined objective. RouteRL aims to advance research in MARL, transport modeling, and human–AI interaction for transportation applications. This study presents a technical report on RouteRL, outlines its potential research contributions, and showcases its impact via illustrative examples. Initial findings show that human travel times could increase following the introduction of AVs in urban environments, highlighting the importance of future studies to support efficient urban mobility.
-
Social implications of coexistence of CAVs and human drivers in the context of route choice
Jamróz, Grzegorz,
Akman, Ahmet Onur,
Psarou, Anastasia,
Varga, Zoltán György,
and Kucharski, Rafał
Scientific Reports
2025
Suppose in a stable urban traffic system populated only by human driven vehicles (HDVs), a given proportion (e.g. 10 %) is replaced by a fleet of Connected and Autonomous Vehicles (CAVs), which share information and pursue a collective goal. Suppose these vehicles are centrally coordinated and differ from HDVs only by their collective capacities allowing them to make more efficient routing decisions before the travel on a given day begins. Suppose there is a choice between two routes and every day each driver makes a decision which route to take. Human drivers maximize their utility. CAVs might optimize different goals, such as the total travel time of the fleet. We show that in this plausible futuristic setting, the strategy CAVs are allowed to adopt may result in human drivers either benefitting or being systematically disadvantaged and urban networks becoming more or less optimal. Consequently, some regulatory measures might become indispensable.
-
Impact of collective behaviors of autonomous vehicles on urban traffic dynamics: A multi-agent reinforcement learning approach
Akman, Ahmet Onur,
Psarou, Anastasia,
Varga, Zoltán György,
Jamróz, Grzegorz,
and Kucharski, Rafał
arXiv preprint arXiv:2509.22216
2025
This study examines the potential impact of reinforcement learning (RL)-enabled autonomous vehicles (AV) on urban traffic flow in a mixed traffic environment. We focus on a simplified day-to-day route choice problem in a multi-agent setting. We consider a city network where human drivers travel through their chosen routes to reach their destinations in minimum travel time. Then, we convert one-third of the population into AVs, which are RL agents employing Deep Q-learning algorithm. We define a set of optimization targets, or as we call them behaviors, namely selfish, collaborative, competitive, social, altruistic, and malicious. We impose a selected behavior on AVs through their rewards. We run our simulations using our in-house developed RL framework PARCOUR. Our simulations reveal that AVs optimize their travel times by up to 5%, with varying impacts on human drivers’ travel times depending on the AV behavior. In all cases where AVs adopt a self-serving behavior, they achieve shorter travel times than human drivers. Our findings highlight the complexity differences in learning tasks of each target behavior. We demonstrate that the multi-agent RL setting is applicable for collective routing on traffic networks, though their impact on coexisting parties greatly varies with the behaviors adopted.
-
Autonomous Vehicles Using Multi-Agent Reinforcement Learning for Routing Decisions Can Harm Urban Traffic
Psarou, Anastasia,
Akman, Ahmet Onur,
Gorczyca, Łukasz,
Hoffmann, Michał,
Varga, Zoltán György,
Jamróz, Grzegorz,
and Kucharski, Rafał
arXiv preprint arXiv:2502.13188
2025
Autonomous vehicles (AVs) using Multi-Agent Reinforcement Learning (MARL) for simultaneous route optimization may destabilize traffic environments, with human drivers possibly experiencing longer travel times. We study this interaction by simulating human drivers and AVs. Our experiments with standard MARL algorithms reveal that, even in trivial cases, policies often fail to converge to an optimal solution or require long training periods. The problem is amplified by the fact that we cannot rely entirely on simulated training, as there are no accurate models of human routing behavior. At the same time, real-world training in cities risks destabilizing urban traffic systems, increasing externalities, such as CO2 emissions, and introducing non-stationarity as human drivers adapt unpredictably to AV behaviors. Centralization can improve convergence in some cases, however, it raises privacy concerns for the travelers’ destination data. In this position paper, we argue that future research must prioritize realistic benchmarks, cautious deployment strategies, and tools for monitoring and regulating AV routing behaviors to ensure sustainable and equitable urban mobility systems.