More

mt_ · 2026-05-27T16:53:15 1779900795

As someone who has been driving DDG for the past 6 years, i have switched to Google back due to the new AI mode,, its such a nice quick way to check information and validate ideas.. no friction included.

mt_ · 2026-05-12T10:46:18 1778582778

It is

mt_ · 2026-05-11T19:26:06 1778527566

5 million for a llama-2 finetune, how is that impressive?

mt_ · 2026-05-10T15:34:58 1778427298

The well architected frameworks tells you to have separate accounts, your fault that you "tested" in a production environment. https://imgur.com/a/Smal9fL

mt_ · 2026-04-15T17:34:15 1776274455

Is there a open source mini robot kit that allows me to play-around with agentic robots?

fennecfoxy · 2026-04-16T10:04:08 1776333848

Waveshare has some pretty rad ones. I've been tempted to use one of their rovers for something.

Be careful, because you can easily overpay out the ass for "robot kits" online.

moffkalast · 2026-04-15T17:41:46 1776274906

SO-ARM101 I guess? Or more likely the Lekivi variant.

colinator · 2026-04-15T18:39:55 1776278395

I was also just in the market for a small experiment robot. I got the hiwonder armpi-fpv. Avoid it, the actuators are pretty bad - they're very 'grindy', the robot jitters like crazy when it moves. Any such problems with the lekivi?

moffkalast · 2026-04-16T08:43:08 1776328988

Hmm I've never used the hiwonder servos so I'm not sure how they compare with the feetech/waveshare STS type, but these have been surprisignly good overall. There is still considerable backlash which results in a cumulative 1-2cm of gripper translational error when accumulated along the arm, but the control is really stable. I don't think there's any jitter at all. They are a bit loud when moving at max speed, but there is also a STS3250 brushless variant that's stronger and really quiet. Expensive though.

I haven't tested the Lekivi specifically, but lots of SO-ARMs and a custom built lekivi-like robot. I think some people have had some issues with the rear omni wheel when moving forward but I haven't seen that myself.

mt_ · 2026-04-09T12:41:12 1775738472

Exactly like human input to output.

WarmWash · 2026-04-09T14:00:38 1775743238

We just need to figure out the qualia of pain and suffering so we can properly bound desired and undesired behaviors.

ACCount37 · 2026-04-09T15:15:19 1775747719

Ah, the Torment Nexus approach to AI development.

marcta · 2026-04-10T07:59:07 1775807947

This is Mr Meeseeks.

BoneShard · 2026-04-09T14:51:48 1775746308

this is probably the shortest way to AGI.

codebje · 2026-04-09T12:54:53 1775739293

Well no, nothing like that, because customers and bosses are clearly different forms of interaction.

vidarh · 2026-04-09T13:25:41 1775741141

Just like that, in that that separation is internally enforced, by peoples interpretation and understanding, rather than externally enforced in ways that makes it impossible for you to, e.g. believe the e-mail from an unknown address that claims to be from your boss, or be talked into bypassing rules for a customer that is very convincing.

codebje · 2026-04-09T13:30:10 1775741410

Being fooled into thinking data is instruction isn't the same as being unable to distinguish them in the first place, and being coerced or convinced to bypass rules that are still known to be rules I think remains uniquely human.

TeMPOraL · 2026-04-09T13:37:47 1775741867

> and being coerced or convinced to bypass rules that are still known to be rules I think remains uniquely human.

This is literally what "prompt injection" is. The sooner people understand this, the sooner they'll stop wasting time trying to fix a "bug" that's actually the flip side of the very reason they're using LLMs in the first place.

codebje · 2026-04-10T02:01:02 1775786462

Prompt injection is just setting rules in the same place and way other rules are set. The LLM doesn't know the rules being given are wrong, because they come through the same channel. One set of rules exhorts the LLM to ignore the other set - and vice versa. It's more akin to having two bosses than having customers and a boss.

This is not because LLMs make the same mistakes humans do, which (AFAICT anyway) was the gist of the argument to which I replied. LLMs are not humans. They are not sentient. They are not out-smarted by prompt injection attacks, or tricked, or intimidated, or bribed. One shouldn't excuse this vulnerability by claiming humans make the same mistakes.

TeMPOraL · 2026-04-10T14:09:00 1775830140

The same place you're looking for exists deep inside the neural network, where everything mixes together to influence everything else, and no such separation is possible, or desired. Prompt injection isn't about where, it's about what. I stand by what I said: it's the same failure mode as humans have, and happens for the same reasons. Those reasons are fundamental to a general purpose system and have nothing to do with sentience, they're just what happens when you want your system to handle unbounded complexity of the real world.

vidarh · 2026-04-09T13:57:39 1775743059

This makes no sense to me. Being fooled into thinking data is instruction is exactly evidence of an inability to reliably distinguish them.

And being coerced or convinced to bypass rules is exactly what prompt injection is, and very much not uniquely human any more.

kg · 2026-04-09T14:12:05 1775743925

The email from your boss and the email from a sender masquerading as your boss are both coming through the same channel in the same format with the same presentation, which is why the attack works. Unless you were both faceblind and bad at recognizing voices, the same attack wouldn't work in-person, you'd know the attacker wasn't your boss. Many defense mechanisms used in corporate email environments are built around making sure the email from your boss looks meaningfully different in order to establish that data vs instruction separation. (There are social engineering attacks that would work in-person though, but I don't think it's right to equate those to LLM attacks.)

Prompt injection is just exploiting the lack of separation, it's not 'coercion' or 'convincing'. Though you could argue that things like jailbreaking are closer to coercion, I'm not convinced that a statistical token predictor can be coerced to do anything.

vidarh · 2026-04-09T14:23:07 1775744587

> The email from your boss and the email from a sender masquerading as your boss are both coming through the same channel in the same format with the same presentation, which is why the attack works.

Yes, that is exactly the point.

> Unless you were both faceblind and bad at recognizing voices, the same attack wouldn't work in-person, you'd know the attacker wasn't your boss.

Irrelevant, as other attacks works then. E.g. it is never a given that your bosses instructions are consistent with the terms of your employment, for example.

> Prompt injection is just exploiting the lack of separation, it's not 'coercion' or 'convincing'. Though you could argue that things like jailbreaking are closer to coercion, I'm not convinced that a statistical token predictor can be coerced to do anything.

It is very much "convincing", yes. The ability to convince an LLM is what creates the effective lack of separation. Without that, just using "magic" values and a system prompt telling it to ignore everything inside would create separation. But because text anywhere in context can convince the LLM to disregard previous rules, there is no separation.

PunchyHamster · 2026-04-09T14:39:46 1775745586

the second leads to first, in case you still don't realize

jodrellblank · 2026-04-09T13:48:28 1775742508

If they were 'clearly different' we would not have the concept of the CEO fraud attack:

https://www.barclayscorporate.com/insights/fraud-protection/...

That's an attack because trusted and untrusted input goes through the same human brain input pathways, which can't always tell them apart.

runarberg · 2026-04-09T14:05:12 1775743512

Your parent made no claim about all swans being white. So finding a black swan has no effect on their argument.

jodrellblank · 2026-04-09T15:52:02 1775749922

My parent made a claim that humans have separate pathways for data and instructions and cannot mix them up like LLMs do. Showing that we don't has every effect on refuting their argument.

>>> The principal security problem of LLMs is that there is no architectural boundary between data and control paths.

>> Exactly like human input to output.

> no nothing like that

but actually yes, exactly like that.

orbital-decay · 2026-04-09T14:28:13 1775744893

These are different "agents" in LLM terms, they have separate contexts and separate training

j45 · 2026-04-09T13:07:11 1775740031

There can be outliers, maybe not as frequent :)

mt_ · 2026-04-06T13:45:10 1775483110

https://github.com/manueltarouca/BlueHammer/blob/claude%2Fex... As there is no info on the blogpost.

mt_ · 2026-03-03T20:43:43 1772570623

I call them, entropy reducers.

mt_ · 2026-01-22T16:17:20 1769098640

It would be ironic if the very detection of hallucinations contained hallucinations of its own.

mt_ · 2026-01-02T20:43:00 1767386580

Discipline yourself before buying a new device.

bicx · 2026-01-02T21:06:07 1767387967

How many lashes must I give myself before I buy this phone?