also, i'm just gonna say that a group of people with 100mbit+ internet connections distributed across the land can run a well designed distributed compute system and that the benefits of the decentralization of nostr's architecture, combined with some cheap high bandwidth VPSs acting as rendezvous points could turn out to be a lot more cost effective than depending on cloud GPU LLM services.
i've got half a mind to even start running a relay on my own hardware and getting a couple of cheap VPS services for doing rendezvous and small relays precisely to do something like this. my network is pretty reliable, uptime probably over 95% so with two or three sites like this you have got something that can actually do quite a lot of work.
it's probably a little bit more expensive in total than depending on third parties but you can guarantee stronger privacy with such a setup. especially if you have small VPS caches that hold you over while your network has a burp.