Nostr Web Client

If you build infrastructure around LLMs to enable it to store and load data, you can build some powerful ai products. I want to build a local LLM that knows how to read my calendar events, email, github issues, nostr mentions, and DMs to have a personalized ai assistant.

Ingwie Phoenix (aka. birb) 1y ago

Look at LocalAI and Open WebUI, especially the latter's integrations. The former is basically YAML based model specs with presets andcontrolling what is loaded when. RAG is annoying as fuck though, but it's totally doable. ^^

Working on the exact same thing, but based off of the Milk-V Oasis, currently considering to get a TensTorrent Wormhole or attempt to tinker with amdgpu to get ROCm working to use an RX7000 card as the basis...

Alternatively, Ampere + NVIDIA works, because NVIDIA has ARM drivers - partially, at least. CUDA is included though. Why ampere? Look at the TDP; paring that with high RAM allows you to configure LocalAI to utilize both CPU and GPU and you can specify exactly what goes where and how many layers.

This way, you can allocate several models with some kind of priority, allowing you to run the embeddings model, Whisper and other tiny things all the time, but swap out bigger models depending on which Pipeline you end up running. :)

Reply to this note

Please Login to reply.

Discussion

jb55 1y ago

I got rocm working with llama.cpp so im gonna try that

nostr:nevent1qqs88zs80vrrndpns2l88hxdgaumg4hstnttth9jfzhxcejww7tjyzcprpmhxw309akk7mnpvshx5c34x5hxxmmd8gurqwpsqyxhwumn8ghj7mn0wvhxcmmvqy28wumn8ghj7un9d3shjtnyv9kh2uewd9hsz9thwden5te0wfjkccte9ekk7um5wgh8qatz9ghra9