Working on a new AI/ML project partly to learn but also to see if something interesting will arise from "scratch."
A missing piece this morning is that of a reward system. I don't know how to tell this thing what's good and what's bad programatically. Seems like the perfect time to "do things that don't scale" and execute the reward function myself. I figure I'll let it ask me after each step for π or π. An app would be cool so it could just be taps but I'm a cli-guy so maybe I'll go with vim bindings.