I started with the Llama 3.1 Base!
The dataset is on relays, most relays should allow downloading ?
Isn’t it better to use an uncensored base model for the training? Will you opensource the dataset?
I started with the Llama 3.1 Base!
The dataset is on relays, most relays should allow downloading ?
Oh, I see. By dataset I was thinking of the [WoT filtered] raw data after cleaning/curation and post-processing.