应该是说单纯的通过堆叠参数数量来提升GPT性能暂时到头了 剩下的就要靠优化算法来实现了

Reply to this note

Please Login to reply.

Discussion

No replies yet.