Now that we have an open source approximation of Claude Opus 3.0 (1T), the next step is using this to train a Claude Sonnet 3.5 (175B) using the Constitutional AI paper (https://arxiv.org/abs/2212.08073). Not for safety per-se, but to build stronger reasoning skills by rewarding philosophically better responses
Discussion
No replies yet.