Yet another thing a robust tagging system would be good for
Npubs could tag the languages of their own profiles and posts (and each other's if needed) so the filter could be based on analyzing these tags with a simple web of trust algorithm instead of a linguistic AI