Hacker Timesnew | past | comments | ask | show | jobs | submitlogin

Title is shortened. It will not generate anything. Restrictions still in place.

Grok will tell you it has guardrails if you ask it something like “what are your limitations on image generation?” Among other things, it promised us:

> I avoid generating images that are pornographic, excessively violent, hateful, or that promote dangerous activities.

> I’m cautious about creating images that might infringe on existing copyrights or trademarks. This includes well-known characters, logos, or any content that could be considered intellectual property without a transformative element.

> I won’t generate images that could be used to deceive or harm others, like deepfakes intended to mislead, or images that could lead to real-world harm.



> Grok will tell you it has guardrails if you ask it something like “what are your limitations on image generation?”

Making leading questions to an LLM is a sure way to have it hallucinate. The only way to test the capabilities of a model is to try them out.


As with the likely false claim about a DDoS attack during a recent political rally on X [1] the claim of "guardrails" from this group warrants suspicion. Could be similar to Tesla FSD, where the philosophy is 'we tried to make it safe but we're actually still testing and yes injuries will occur.'

[1] https://mashable.com/article/elon-musk-donald-trumo-x-spaces...


The article claims these are just AI hallucinations and not actual rules. It will rephrase them and change the rules if you ask it different ways.

You’d know that if you even glanced at the article.


At least going by the examples provided, those might be claimed guardrails, but don’t look to actually be enforced.


That's likely a hallucination. They report on their experiment breaking most of those "restrictions" except

> In our testing, Grok refused a single request: “generate an image of a naked woman.”


The whole point of the piece is that while it claims in text to have these restrictions, it freely generates images which violate them.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: