The Most Active and Friendliest
Affiliate Marketing Community Online!

“Adavice”/  “1Win

Is it necessary to use Robots.txt file?

The only reason to have a robots.txt file is to guide web crawlers that actually use it. For example, if you don't want part of your site to be crawled (& by extension indexed) by google, say a subdirectory of /images, then this is the place to do it. Also, if a specific spider is choking your server, you can request that bot throttles it's amount of requests per second.

& while that's all well & good, this again is only for bots that use it. Malicious bots are not bound by any technology to use it, & can chose to ignore it.

So while you don't have to use it, it's a simple thing to setup, that can only help you.
 
At first you have to understand what is Robots.txt . Robots.txt is a file where mentioned crawling permission.
Which you can give permission according to your choice and which page or post search engine can access and inaccessible for indexing.If you are not using Robots.txt search engine default crawling full of your site and collecting default data for indexing,which is the not perfect professional job.Try to use Robots.txt for better optimization.
 
Yes! If there is no robots.txt search engines are given no affirmative permission to crawl your website.
googlebot may just go away without explicit permissions given.


On the other hand, many tools and scrapers posing as search engines disregard the robots.txt directions -- only googlebot and BingBot seem to respect the robots.txt directives. Other bots pretend they do then come in and attept to index with *unmarked* <= faked user signatures. Baidu is famous for this.

View attachment 12389

Takes 2 min ...

Graybeard, Feb 20, 2018
 
Yes, Robot.txt file is a big SEO factor. Using robot.txt allows the bots to crawl your website. If this file is not used the bots are not able to fetch your website.

The robot.txt code is really very simple and also works well for your website. see the format of the file below:

User-agent: *
Disallow: /filename/


 
Robots.txt file is not necessary to use every website, if you want to block specific pages or any specific directory then you should use robots.txt file on root server.
 
banners
Back