Nostr Web Client

iefan 🕊️ 1y ago 💬 9

Lol, did the open-source community accidentally just fix the LLM hallucination problem? 😳

Context:

It seems like the Shrek entropy sampler with early exit solves, if not significantly reduces, the hallucination problem with big boy models. Some people are running evaluations now, and so far, it seems promising. 👀

Reply to this note

Please Login to reply.

Discussion

> Shrek entropy sampler

iefan 🕊️ 1y ago

I did some testing with multiple types of models, also with the Llama 1b model. I haven't seen the official benchmarks yet, but improvements are very noticeable even on very small models.

System prompts:

nostr:nevent1qqs0mr006vwv866frr6mzheqmmdhlflyv6yptmlvfz249esuzj87fhgpzdmhxue69uhhwmm59e6hg7r09ehkuef0qgsvdac80utfn4gvly4fv54la0l6cp0udpptnm3ezzyajkdc44w53lgrqsqqqqqpr2mdyd

Guy Swann 1y ago

Can you share a link or expand on this? What does the entropy sampler do here?

Rif'at Ahdi R 1y ago

I think this is the related repo, right nostr:npub1cmmswlckn82se7f2jeftl6ll4szlc6zzh8hrjyyfm9vm3t2afr7svqlr6f ?

https://github.com/xjdr-alt/entropix

BootyDust 1y ago

What's the Shrek entropy sampler? Is this related to your earlier note? @nostr:nevent1qqsq72y32j707ugtgu26h77marmzrsu0t6dtfpw500q6j49fp3hzckgppemhxue69uhkummn9ekx7mp0qgsvdac80utfn4gvly4fv54la0l6cp0udpptnm3ezzyajkdc44w53lgrqsqqqqqpm0mux9

iefan 🕊️ 1y ago

It's a very early development. Feel free to test it out firsthand. I'll share updates once official benchmarks become available.

nostr:nevent1qqs2u86x8d05tkpjtmxc9jfq2rwqh99q4zau5gwfpsw853ahgcxe7mspzamhxue69uhhyetvv9ujuvrcvd5xzapwvdhk6tczyrr0wpmlz6va2r8e92t990ltl7kqtlrgg2u7uwgs38v4nw9dt4y06qcyqqqqqqgc3zyjv

The Bitruvian Man 1y ago

OSS for the win 😃

Turns out that people working on things they find interesting is pretty cool 😎

The Bitruvian Man 1y ago

OSS for the win 😃

Turns out that people working on things they find interesting is pretty cool 😎

nostr:nevent1qqstp7xa22u8r4kphv65dcx52nv0q63lsq0axdrrgshf3yvvpgd5u3gpr4mhxue69uhkummnw3ezucnfw33k76twv4ezuum0vd5kzmp0qgsvdac80utfn4gvly4fv54la0l6cp0udpptnm3ezzyajkdc44w53lgrqsqqqqqplv6xqy

iefan 🕊️ 1y ago

o1 style chain of thought with a local Llama 1B model (aka shrek sampler) is mostly working...👀

nostr:nevent1qqstp7xa22u8r4kphv65dcx52nv0q63lsq0axdrrgshf3yvvpgd5u3gpzamhxue69uhhyetvv9ujuvrcvd5xzapwvdhk6tczyrr0wpmlz6va2r8e92t990ltl7kqtlrgg2u7uwgs38v4nw9dt4y06qcyqqqqqqgn6ylz8