winner:
train count:
Q-table count:
player1(o)
player2(x)
State
RandomThreeState
ThreeState