Later models were very crispy, but some of the earlier checkpoints were pretty nice. At least I know I don’t have to train it for multiple hours to get a good result. The sweetspot seems to be around 1000-3000 steps.
I’ll adjust the dataset some more and test again later today



