I wonder what would be the impact of using Counterfactual Regret Minimization in...

Tenoke · on July 12, 2019

It's not necesserily better but with CFR you can learn beyond what humans have learned, but on the other hand you dont learn their usual mistakes to more easily exploit them. Also in this approach you need CRM since at every point you are checking what would've happened if you picked something else, which is just impossible with a fixed dataset.