also, my hand-made json marshaler is like 3x faster than the encoding/json library, 2x faster than easyjson, and barely slower than the fastest codec in Go, fiatjaf's binary encoder
and i just found some good tweaks to contain memory usage and it made it generally run faster, probably because of more disk cache available