Hacker Timesnew | past | comments | ask | show | jobs | submit | orphea's commentslogin

No, they don't mean "you should log off everywhere" literally; rather, "don't open Teams/Slack/${our_corporate_chat_software}".

You're a good person.

My manager doesn't stop overworking. When told on peer performance review that we have people who are consistently overwork because they are swamped, he played it down.

But hey, at least he doesn't encourage overworking either.


And they don't need castrated Fable for that.

  Why are people still upvoting obvious AI slop garbage?
Because it looks good enough to "pass". People are consuming low quality content all the time.

  It’s literally impossible to catch Pac-Man?
Not true.

  You do not move fast enough
Not true.

  the Pac-Man AI is programmed for perfection
So is it slop or programmed for perfection, which one is it? :)

> So is it slop or programmed for perfection, which one is it?

I’m not going to comment on the code (I abandoned the page early due to the terrible controls) but there’s no contradiction in that part you’re commenting on. “Slop” doesn’t just mean “doesn’t work”; “programmed for perfection so it does not make deliberate mistakes” doesn’t mean “the code is perfect”.

To output text in a terminal you could use `echo`, but you could also write a 500 line function which does a bunch of unrelated garbage then finally outputs the text perfectly (i.e. no mistakes). That doesn’t mean the code is good or even acceptable or desirable, even if the outcome is technically correct.


Fair enough!

> So is it slop or programmed for perfection, which one is it? :)

I don’t think you understand. In games it’s not good to program an AI to be a perfect actor, because the difficulty becomes too insane for a human player. You want an AI that deliberately makes mistakes or suboptimal choices sometimes, and where its difficulty can be scaled. Being programmed for perfection is not a compliment.


  Anthropic was founded by people who we know were worried about AI safety
Lol. It was founded by people who were saying they were worried. I'm sorry you fell for it.

Anthropic is just another company of, in my opinion, money-hungry sociopaths; they are not that different from the OpenAI bros.

So yeah, play stupid games - win stupid prizes.


Executive staff seems money-hungry for sure (note the lack of non profit that OpenAI has)

I would say they have researchers with self-important god complexes that makes them think they know better than everyone else.


Read about the LTBT/PBC structure. Anthropic is not accountable to its investors.

If they were money hungry they wouldn't have fought the DOW. Everyone knows that's a retarded thing for a business to do.


Not necessarily. You know what your competitors are likely to do and you want to position yourself differently

> If they were money hungry they wouldn't have fought the DOW

I think it could be reputation management exercises. Especially how it was aligned with airstrike on Iranian girls elementary school and statements that Claude were picking targets.


The actual guardrail should be getting materials being difficult. The information is already out there in the internet. If an LLM knows how to make a bomb or whatever, why do you think it knows?

The material for doing harm is just a computer with access to an LLM and the Internet.

Okay why don't we restrict access to LLMs and internet, then?

We already do, in the form of guardrails, as this article touches on.

https://venturebeat.com/technology/anthropic-ceo-calls-for-f...


If that’s true, then where is it? Post a link, or YouTube video.

https://archive.org/details/ExplosivesEngineeringPaulW.Coope...

(30 seconds of googling.)

Or perhaps you meant Q clearance nuke stuff? That would be QUITE a bit harder to find and illegal to share. But it’s lack of availability is hardly a counterpoint to the comment you were replying to.


Ignoring comments is not a solution because the texts can be put in random strings among the actual code.

And really all it takes is one keyword such as “nuke”.

Nuke is probably too generic but I wouldn't put it past an LLM to get thrown away by that. A safer showstopper probably would be to export symbols like uf6_enrichment_loop and refer to your C&C server as a nuclear reactor controller.

https://www.youtube.com/watch?v=Gbgk8d3Y1Q4

On a second thought, probably better to act like it is a tool for "frontier LLM research". Export symbols like "mythos_distillation_subroutine".


Haha now I’m picturing obfuscation where instead of 0x everything is a scary word.

I'm not a native speaker but I unironically use "nuke" as "delete the whole repo/huge chunk of a project".

Cambridge dictionary seem to agree:

nuke - to destroy or get rid of something completely


This triggered Opus 4.8 the other day for me. Said “nuke that folder” and it said I was violating TOS.

How do you know if this something is done?

If you do the thing yourself, you know your knowledge limits, you know where the thing lacks. With LLMs, you don't. Maybe it works, maybe it doesn't. You have no idea.


That is a good question.

In structural engineering, there probably is no risk tolerance.

In the OP's network or port scan? Perhaps you can get away with verifying a few of the results to get an idea about whether it worked as expected.

I use AI mostly on mobile app side projects, and there QA testing on phone and tablet tells me whether a feature works or not.


So, security (safety) through obscurity?

The phrase "security through obscurity" isn't an argument against all information restriction.

It doesn't imply we should, for example, publish step-by-step instructions for making widespread death easier.


Another „great filter“: How to handle dagerous information?

The argument against security through obscurity isn't that it doesn't work at all. It does to a degree, only it is not as strong as people think.

An example from the meat world: not publishing your vacation dates well in advance for the world to see somewhat reduces your chance of being burglarized. That is security by obscurity; not reliable, but not completely inefficient either.

But if you live in a fortress (security by key material), you can well declare your vacation dates without running the risk.


maybe this is the way forward. Imagine how many tokens one could burn given C suite salaries.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: