Exploit a memory safety issue in the tokenizer/or other parts of your LLM infra ...

moralestapia · 2024-04-10T11:57:46 1712750266

??? With weights?

fzzzy · 2024-04-10T23:55:19 1712793319

There was a buffer overflow or some other exploit like that in llama.cpp and the gguf format. It has been fixed now, but it's definitely possible. Also weights distributed as python pickles can run arbitrary code.

bevekspldnw · 2024-04-11T02:33:47 1712802827

Distributing anything as python pickles seems utterly batshit to me.

fzzzy · 2024-04-11T10:46:23 1712832383

Completely agree.

abound · 2024-04-10T13:10:41 1712754641

There are plenty of exploits where the payload is just "data" read by some vulnerable program (PDF readers, image viewers, browsers, compression tools, messaging apps, etc)

sp332 · 2024-04-10T22:18:12 1712787492

Yes, there's a reason weights are now distributed as "safetensors" files. Malicious weights files in the old formats are possible, and while I haven't seen evidence of the new format being exploitable, I wouldn't be surprised if someone figures out how to do it eventually.