Link to the code - https://github.com/SJTU-IPADS/PowerInfer
This is huge! Now watch the LLM API costs dropping even further.
These papers almost feel like cheat codes, & why closed companies like OpenAI don’t publish their important works anymore.
Mind-boggling that it is even possible.
Full research paper: https://t.co/Kvj3lRTONE?s=09
Abstract:

Discussion
No replies yet.