Subnostr

An experimental project applying large-scale Reinforcement Learning techniques to computer usage scenarios, utilizing neural reward models to validate agent actions. The system implements a three-step cycle extending ReACT into reinforcement learning, with multiple training stages focused on developing reasoning skills for computer interaction.