Nice writeup on what the Transformer is in a GPT
https://jalammar.github.io/illustrated-transformer/
Also high quality:
http://nlp.seas.harvard.edu//2018/04/03/attention.html
Please Login to reply.
No replies yet.