For background about my site http://www.classhunt.com, read this: https://qht.co/item?id=600478
Since I posted that, I had a meeting with the Vice Chancellor of IT. At the end, he basically said there is nothing he could do to keep my site functioning. I told him I will continue to find ways around them blocking me even though I don't enjoy playing cat and mouse.
Also, the site has become a huge hit amongst incoming freshmen. Back when I posted about it here, we had few hundred users. Today we have few thousand students that are relying on it. This has furthered my motivation to keep the site alive.
Here's the deal: the python bot that scrapes latest scheduling data off of uni's server keeps getting blocked by the uni's IDS system. I've put random delays and intervals but haven't had much luck. After a day or so, the IDS removes the block only to put it back later.
The IDS blocks my personal uni account, not the IP. When I first made the app the data was public. But then the uni, in an attempt to block the site, removed the data from being public. So I had to modify the POST string and go through my uni username/password to retrieve the data.
What I am doing right now:
- randomizing # of threads between 2-4
- randomizing the pause between new requests between 1-3 seconds
- randomizing how often my bot runs between 3-7 minutes
This is what I get when the IDS blocks it:
To protect the univeristy from a denial of service attack, script blockers have been enabled.
Your transaction has been blocked besause you have tried to login too many times in a short period of time.
Please wait and try again soon.
How do I fool this sucker?