Lately, I always ask 4o to check its work, and see if it can improve its answer, because of how often its first answer is wrong. It wouldn’t surprise me if it’s a threaded set of intelligent challenges to the first response an it offers the strongest candidate back as its user response. Even though it likely is, it wouldn’t even need to be a separate LLM.

Reply to this note

Please Login to reply.

Discussion

No replies yet.