Abstract: Selecting targets to attack and assigning weapons are among the most critical decisions on the battlefield. The decision problem is represented as a dynamic weapon-target assignment (DWTA) ...
In 2016, an AI program he developed at Google DeepMind, AlphaGo, taught itself to play the famously difficult game of Go with ...
Abstract: In this paper, we propose practical model-based policy optimization (PMBPO) to address the time efficiency issue caused by overly frequent model updates in recent probabilistic model-based ...