What's up, selfhosters? It's selfhosting Sunday again!

tofu@lemmy.nocturnal.garden · 1 day ago

What's up, selfhosters? It's selfhosting Sunday again!

danhab99@programming.dev · 16 hours ago

I’m working on self-hosting my own LLMs.

I realized there are things I wanna talk about and research but I don’t want to send it to open AI. Frankly I feel gross about how much I’ve sent to open AI. My desktop is a beefy gaming rig that I don’t use for gaming much. I have a 20thread core, 64gb ram, an Nvidia gtx 3060 and 5 spare TB so why not.

I keep a few ollama models downloaded and I’m slowly getting to know them and what they can do. Gemma seems to answer the fastest so I’ve been using that. Deepseek is like the reasoning button on chatgpt.
I use openai-whisper to transcribe meetings I record using OBS. It’s really slow so I have a cronjob transcribe all my meetings for that day overnight.
Open Web UI is a fantastic LLM frontend. It provides tools, rags, web searching, and model ranking all as a simple to use UI.
My desktop has a Wireguard server which makes it easy to use my OpenWebUI on my phone.

Now I want to work on giving the LLM access to my Google calendar so it can create reminders for me. I’m sick of forgetting to think about remembering to do things so I hope if I can just ramble at the LLM about what I’m doing or what’s on my mind it can organize my thoughts. What else are these LLM actually for?