Nostr Web Client

I tested RAG but I'm getting poor results, I played a little with chunk size and chunk overlap, but it doesn't seem to help. I only got decent results (but no better than the standard query) with Open WebUI enabling "Full Context Mode" (so the whole document is fed), but it took 30% more time to reply compared to the standard mode.

Any suggestions?

semisol 5mo ago

Since you are dealing with things that could be non-self-descriptive and probably are not what embeddings are trained for, consider feeding your text to an LLM first to summarize and turn into more explaining content.

Then feed that to the embedding model

Reply to this note

Please Login to reply.

Discussion

daniele 5mo ago

I will try that, thanks.

semisol 5mo ago

You can also build sequential embeddings this way:

The summary of the last segment was as follows:

The current segment is:

Please return a summary for the current segment, using the previous segment for context, and also return the current context.

daniele 5mo ago

Uhm, this is hardcore, I need to understand all these pipeline stuff.