Skip to main content

Robots.txt File

Hi, I have come across a robots.txt file where:

The company is disallowing the Nutch bot.

They are implementing a crawl delay of 10 seconds for AhrefsSiteAudit.

There is a crawl delay of 10 seconds for MJ12bot.

A crawl delay of 1 second is set for the Pinterest bot.

In my understanding, this suggests that the company is aware that these specific bots may be causing server loading issues. If not by Following their logic, it raises the question of why they are not implementing similar crawl delays for other bots, such as Semrush, etc… considering they have chosen to delay the crawl for Ahrefs bot.

And generally what do you think about this kind of robots.txt file. all other things are done correctly, I have not copied the whole file

submitted by /u/Idnemato
[link] [comments]

from Search Engine Optimization: The Latest SEO News https://ift.tt/kTKnlGs

Comments

Popular posts from this blog

Local seo vs. natiowide seo?

I've done SEO for local businesses but I recently got my first client that sells an item nation wide. ​ Any suggestions for doing nationwide SEO? ​ I am used to making geopages for local towns. I was going to do the same with some input from the client about what cities or towns he would like to show up in? submitted by /u/Letmeinterviewyou [link] [comments] from Search Engine Optimization: The Latest SEO News http://bit.ly/2JHy0k0

Clients site has a weird issue with 302 redirects that I haven't seen before.

Site is in Drupal, hosted on Amazon CDN & Cloudflare. So here's a quick breakdown: The site itself works normally. It's a bit dated, but you can click on links and navigate around as you'd expect. Seeing no obvious issues, I run a Screaming Frog crawl to begin my audit. Only 5 pages were picked up by the crawl which was super weird, since all internal links are regular html and there shouldn't be any issues. So I go through the site and manually collect a bunch of URLs, which I submit to SF again as a list. Every single link bar the 5 originally crawled return a 302, with the 'redirect' pointing back to the home page. Except as I said, those pages don't browser redirect. Browser side, they work fine. I guess they redirect the crawl bot though, since the rest of the site is functionally invisible. Other tools I've looked at say that the pages return simultaneous 302 and 200s, which doesn't make too much sense. These 302s are also old enough ...