I've heard people say this. They also say the opposite. Then they say Goose and Dork are better at different things. I think this is like how people say whisky and gin have different effects on you. It's literally the same chemical, same abv, same interaction with the brain but it tastes a little different. So I think it's a bias. Two sessions of goose and two sessions of dork would also have this problem. Booth tools are just wrappers around the same AI model and their system prompts aren't fundamentally different.
Discussion
Sure I guess you're right about two sessions in either agent could have very different results too.