Resilient multi-agent reinforcement learning for centralised tactical conflict resolution under uncertain perturbations and non-cooperative traffic in urban air mobility

Rodolphe Fremond et al.

Transportation Research Part C: Emerging Technologies2026https://doi.org/10.1016/j.trc.2026.105542article
ABDC A*
Weight
0.50

Abstract

• A centralised Tactical Conflict Resolution Solver using MARL with transformers. • A perturbation module modelling GNSS noise, comms loss, and sensor defects. • Evaluation under degraded data and mixed cooperative and non-cooperative traffic. • Resilient solver reducing NMAC risks compared with a baseline MARL model. • Limitations appear under dynamic-intent intruders with mixed airspace operations. This research investigates tactical conflict resolution for Unmanned Aircraft Systems (UAS) and Urban Air Mobility (UAM) operations under degraded conditions and in the presence of non-cooperative UAS/UAM and manned Commercial Air Transportation and General Aviation (CAT/GA) intruders. The study adopts a centralised safety-net approach within UAS Traffic Management (UTM) architectures, envisioning ground-based conflict resolution services. We propose a set of Tactical Conflict Resolution Solvers (TCRS), each built upon a Multi-Agent Reinforcement Learning (MARL) core using a shared-policy transformer architecture and executed in a decentralised manner. To assess resilience of TCRS variants, we introduce domain-specific perturbations, including positioning noise, communication loss, and sensor-related defects. The TCRS operates with partial decision-making ability in non-cooperative traffic environments, while the perturbation model increases realism by simulating varying degrees of information availability. Results show that the perturbation-trained models achieve substantial safety gains compared with the baseline TCRS trained in ideal conditions. The most resilient variant; trained under multi-perturbation exposure and evaluated in non-cooperative environments, achieves a threefold reduction in critical safety violations compared with the baseline and remains robust under mixed cooperative/non-cooperative traffic with static intent. It exhibits a modest vulnerability under fully homogeneous non-cooperative scenarios with dynamic intent. Simulations involving concurrent CAT/GA and UAS operations further indicate that integrating UAS operations within the existing airspace classification remains hazardous for ground-based tactical conflict resolution when constrained by short look-ahead horizons and insufficient time to react.

Open via your library →

Cite this paper

https://doi.org/https://doi.org/10.1016/j.trc.2026.105542

Or copy a formatted citation

@article{rodolphe2026,
  title        = {{Resilient multi-agent reinforcement learning for centralised tactical conflict resolution under uncertain perturbations and non-cooperative traffic in urban air mobility}},
  author       = {Rodolphe Fremond et al.},
  journal      = {Transportation Research Part C: Emerging Technologies},
  year         = {2026},
  doi          = {https://doi.org/https://doi.org/10.1016/j.trc.2026.105542},
}

Paste directly into BibTeX, Zotero, or your reference manager.

Flag this paper

Resilient multi-agent reinforcement learning for centralised tactical conflict resolution under uncertain perturbations and non-cooperative traffic in urban air mobility

Flags are reviewed by the Arbiter methodology team within 5 business days.


Evidence weight

0.50

Balanced mode · F 0.40 / M 0.15 / V 0.05 / R 0.40

F · citation impact0.50 × 0.4 = 0.20
M · momentum0.50 × 0.15 = 0.07
V · venue signal0.50 × 0.05 = 0.03
R · text relevance †0.50 × 0.4 = 0.20

† Text relevance is estimated at 0.50 on the detail page — for your query’s actual relevance score, open this paper from a search result.