Hacker Timesnew | past | comments | ask | show | jobs | submitlogin
Show HN : I have developed a robots.txt full text search engine. (dnsdigger.com)
1 point by Joyfield on Aug 2, 2013 | hide | past | favorite | 2 comments


Very cool project. I'm curious as to how you're getting all these robots - are you scraping them yourself?


Downloading them as we speak. I have a big list of hosts/domains i have collected through spidering for my DNSDigger.com. This is a hobby project that has grown a bit over my head hehe. And there is no scraping needed. Robots.txt is just simple textfiles. Download and parse, repeat a couple of million times and build an index :)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: