The difference in the training of this model is that is uses a smaller but higher quality testing data set. For example, they include chatgpt conversations in the training data. You can use a more powerful model to train a smaller imitating model.

Reply to this note

Please Login to reply.

Discussion

No replies yet.