“888 KiB Assistant” but the assistant itself is a multi terabyte rental-only mod...

seertaak · 2026-03-02T23:53:11 1772495591

The whole point is that this fits on an ESP32, which has wifi. We're not quite at the point where it makes sense to run the whole thing locally - if you do try it, it will need a fan, and be loud etc.

For my part, I installed Nanoclaw on my Arch derived OS (I love Arch!), and it worked fine until the next day some update decided to revert the power management settings, and now my glorious assistant is dead.

There's something to be said for a barebones OS. No bullshit, no updates.

Also, playing with hardware watchdog timers and GPIOs and DACs can be so much fun.

amelius · 2026-03-02T17:22:18 1772472138

I'm getting "serverless" flashbacks.

pgt · 2026-03-03T12:39:33 1772541573

modelless

stuaxo · 2026-03-03T12:52:42 1772542362

This thread reminds me how Javas heavy GUI written in Java itself was called "lightweight" when in fact it did not feel lightweight at all on the hardware of the time.

kristianpaul · 2026-03-02T18:42:56 1772476976

My model is at home... just 16Gb still a lot but just FYI

Rebelgecko · 2026-03-02T18:28:40 1772476120

It seems to support connecting to your own LLM on the same LAN

croes · 2026-03-02T20:14:33 1772482473

The point is the agent is still the LLM. No LLM, no agent.

otterley · 2026-03-03T03:58:06 1772510286

LLMs are not agents. LLMs are language models that simply respond to a text prompt with a textual response. Agents are middleware that take input from the user and then use LLMs to drive tools.

croes · 2026-03-03T06:13:47 1772518427

They are just a to-do list. The real work is done by the LLMs

otterley · 2026-03-03T06:26:40 1772519200

An LLM has no motive power, like a script without an a cast, or a program without a computer to execute it.

dheera · 2026-03-02T19:58:05 1772481485

I tried connecting OpenClaw to ollama with a V100 running qwen3.5:35b but it was really, really, really slow (despite ollama itself feeling fairly fast).

These "claw" agents really multiply the tokens used by an obscenely huge factor for the same request.

jcgrillo · 2026-03-02T22:49:23 1772491763

i recently decided to get into this ocean boiling game too, the 32GB V100 seems like a pretty good VRAM/$. if i may ask, do you make any special accommodations for cooling? i've never dealt with a passively cooled card before and i'm curious whether my workstation fans (HP Z840) will be sufficient. i'm going to try 2 cards at first but i think i might be able to squeeze a third in there

dheera · 2026-03-02T23:21:56 1772493716

No. I have a Titan V CEO edition, which is basically a 32GB V100 but has full active fan cooling which I'm finding works just fine.

jcgrillo · 2026-03-02T23:42:37 1772494957

Oh very cool. Some folks are printing shrouds for dual 40mm fans so I'll probably try that if the stock case fans don't do it