They have a reinforcement learning phase after they’ve been pretrained

For pretraining they just learn patterns from all the data they are seeing

Reply to this note

Please Login to reply.

Discussion

No replies yet.