Title is shortened. It will not generate anything. Restrictions still in place. ...

debugnik · on Aug 14, 2024

> Grok will tell you it has guardrails if you ask it something like “what are your limitations on image generation?”

Making leading questions to an LLM is a sure way to have it hallucinate. The only way to test the capabilities of a model is to try them out.

choppaface · on Aug 14, 2024

As with the likely false claim about a DDoS attack during a recent political rally on X [1] the claim of "guardrails" from this group warrants suspicion. Could be similar to Tesla FSD, where the philosophy is 'we tried to make it safe but we're actually still testing and yes injuries will occur.'

[1] https://mashable.com/article/elon-musk-donald-trumo-x-spaces...

padjo · on Aug 14, 2024

The article claims these are just AI hallucinations and not actual rules. It will rephrase them and change the rules if you ask it different ways.

You’d know that if you even glanced at the article.

mcphage · on Aug 14, 2024

At least going by the examples provided, those might be claimed guardrails, but don’t look to actually be enforced.

klyrs · on Aug 14, 2024

That's likely a hallucination. They report on their experiment breaking most of those "restrictions" except

> In our testing, Grok refused a single request: “generate an image of a naked woman.”

rst · on Aug 14, 2024

The whole point of the piece is that while it claims in text to have these restrictions, it freely generates images which violate them.