Hacker Timesnew | past | comments | ask | show | jobs | submit | epa's commentslogin

Exploiters easily get around this. its a small group of people doing all of the abuse.

Amazing


$10,000 is nothing. Should be $200,000+.


Grok fast..


?


Please wait, the ai is typing a response...


Maybe its the secret recession?


What about promoting renewable energy, space exploration, frontier physics and advanced engineering makes you concerned?


Donating to orphanages after committing a genocide resets your karma only in videogames.


What genocide did Musk commit?


I think they were making a (poor) analogy, not literally accusing Musk of committing genocide.


Well, i can't think of a better analogy to say that you can't offset doing bad things by doing good things. The karma system some games use (e.g. Fallout 3 where you can nuke an entire city that puts your karma in negatives and then give fresh water to beggars to reset your karma) was what i was reminded of.


Musk didn't commit any genocide (that i'm aware of) but that wasn't what i wrote. The point of my comment is that you can't offset doing -what some people perceive as- bad things by doing -what some people perceive as- good things later.


So Grok 4 scores 130 but they put Grok midway in the pack at 110. Bias much?


There are two tests and by default it ranks by the score in the "offline test"


Unfortunately we are still in the prompt optimization stage, garbage in garbage out


I hear this repeated so many times I feel like its a narrative pushed by the sellers. Year ago you could ask for glass of wine filled to the brim and you just wouldnt get it. It wasnt garbage in, garbage out, it was sensibility in, garbage out.

The line where chatbots stop being sensible and start outputting garbage is in movement, but slower than avg joe would guess. You only notice it when you get an intuition of the answer before you see it, which requires a lot of experience on range of complexity. Persisten newbies are the best spotters, because they ask obvious basic questions while asking for stuff beyond what geniuses could solve, and only by getting garbage answer and enduring a process of realizing its actually garbage they truly make wider picture of AI than even most powerusers, who tend to have more balanced querries.


Maybe. That could be true.

But doesn’t happen the same with other tools. I’ll give the same exact prompt to all of LLMs I have access to and look at the responses for the best one. Grok is consistently the worst. So if it’s garbage in, garbage out, why are the other ones so much better at dealing with my garbage?


I think it meant in the training stage, not inference.


There are many users in India training these models. There is also a lot more content out there the models are consuming.


And not to forget, many (most?) Indians are bilingual. Multilingual speakers tend to skip languages within conversation if both parties are fluent -> training material includes those switches.


One of the main issues with the margin business model (Profit of 5% of a payment for example) is that fraud is leveraged. This means that when you lose 100% of a transaction due to a chargeback or fraud loss, it takes you 20 non-fraud loss transactions to make up for it. The fraud leverage is a huge issue for platforms like this, and in certain countries half the transactions can be fraudsters.


How is this relevant?


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: