Abstract: We investigate the Combined Target-Assignment and Path-Finding (TAPF) problem that computes both task assignments and collision-free paths for multiple agents, that is, each agent is ...
I was reviewing the reward calculation logic and believe I've found a bug in how rewards are assigned to the reward_tensor.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results