ollama or llama.cpp, see: https://redlib.catsarch.com/r/OrangePI/s/TAEvKJAK4d
Don't know if it'll use the NPUs ootb, possibly not. That thread might provide some crumbs to follow up. Have no experience with that hardware myself. Curious, keep us posted.