What’s up, what’s down and what are you not sure about?

Let us know what you set up lately, what kind of problems you currently think about or are running into, what new device you added to your homelab or what interesting service or article you found.

I finally finished my first iteration of my Minilab including a very smooth migration from the old server yesterday so I can go to the service side of things again. I plan to get some kind of selfhosters VPN for external access to stuff that’s not exposed to the internet, I’ll have to investigate which one.

  • danhab99@programming.dev
    link
    fedilink
    English
    arrow-up
    5
    arrow-down
    1
    ·
    16 hours ago

    I’m working on self-hosting my own LLMs.

    I realized there are things I wanna talk about and research but I don’t want to send it to open AI. Frankly I feel gross about how much I’ve sent to open AI. My desktop is a beefy gaming rig that I don’t use for gaming much. I have a 20thread core, 64gb ram, an Nvidia gtx 3060 and 5 spare TB so why not.

    • I keep a few ollama models downloaded and I’m slowly getting to know them and what they can do. Gemma seems to answer the fastest so I’ve been using that. Deepseek is like the reasoning button on chatgpt.
    • I use openai-whisper to transcribe meetings I record using OBS. It’s really slow so I have a cronjob transcribe all my meetings for that day overnight.
    • Open Web UI is a fantastic LLM frontend. It provides tools, rags, web searching, and model ranking all as a simple to use UI.
    • My desktop has a Wireguard server which makes it easy to use my OpenWebUI on my phone.

    Now I want to work on giving the LLM access to my Google calendar so it can create reminders for me. I’m sick of forgetting to think about remembering to do things so I hope if I can just ramble at the LLM about what I’m doing or what’s on my mind it can organize my thoughts. What else are these LLM actually for?