Offering many very specific labels - a vector database like Weaviate or FAISS coupled with an LLM can do it and do it well, but man... We are talking datacenter level of resources here.
Not many organisations can offer that, and I'm not sure I want them curating my feed.
Offering a couple of dozen general categories though, that we could run in the client or on a modest VPS, with maybe a BM25 fulltext search for specific terms (will be slow).