Thanks!

RL over nostr will be fun!

I thought about using reactions for when determining the pretraining dataset. But right now I don't use them. For RL they can be useful, reactions to answers can be another signal.

We could make the work more open once more people are involved and more objective work happens.

Reply to this note

Please Login to reply.

Discussion

No replies yet.