Robots.txt

If you have questions about using mojoPortal, you can post them here.

You may want to first review our site administration documentation to see if your question is answered there.

This thread is closed to new posts. You must sign in to post in the forums.
1/29/2009 10:50:55 AM
Gravatar
Total Posts 41

Robots.txt

Joe,

Is there a way to edit the robots.txt file that gets generated for each site. Windows live search is giving me an error about a sitemap tag it wants in the robots file.

Thanks
Rob

 

1/29/2009 11:01:08 AM
Gravatar
Total Posts 18439

Re: Robots.txt

Hi Rob,

Robots.txt is just a file to tell crawlers about folders or paths you don't want them to crawl. Not all crawlers are good citizens though and may ignore it.

Its not generated it exists in the root and applies to all sites withing an installation and can be edited with a text editor but if you do customize it you should keep a backup somewhere because it will be overwritten on upgrades since it is included. If you point me to documentation about something that needs to be there for Live Search, then maybe I will include it.

Or maybe you are thinking about the siteroot/SiteMap.ashx which is the generated site map for submitting to google and other search engines.

Hope it helps,

Joe

1/29/2009 11:03:48 AM
Gravatar
Total Posts 18439

Re: Robots.txt

Just googled and found this:

http://searchengineland.com/live-search-now-supporting-sitemaps-autodiscovery-via-robotstxt-file-11779

Interesting, I'll see what we can do to take advantage of that.

Best,

Joe

 

1/29/2009 11:06:05 AM
Gravatar
Total Posts 41

Re: Robots.txt

No I'm not confusing the sitemap file. I copied the robots.txt file that was in the root to the Live Search robots.tx vaildator in the webmaster section of their site and it gave me the error "Warning: 'sitemap' - tag isn't specified." So I googled it and came up with ..

You're right the way to avoid seeing this message is to add the string.

 

Sitemap: http://yourwebsite.com/sitemap.xml

So I thought I'd try to add it in there since Live doesn't seem to be crawling my site.

Rob

 

1/29/2009 11:06:16 AM
Gravatar
Total Posts 18439

Re: Robots.txt

Actually now after reading it we can't realy do that because it wants an absolute url which won't work for multi site installations. If we could put a relative url it would work. In a single ste installation you are free to add it but again would need to keep a backup of the file.

But auto-discovery of the site map is not needed if you submit the sitemap for each site.

Best,

Joe

 

1/29/2009 11:09:16 AM
Gravatar
Total Posts 41

Re: Robots.txt

Is the sitemap file not dynamically generated?

1/29/2009 11:11:34 AM
Gravatar
Total Posts 18439

Re: Robots.txt

Yes it is but the robots is not and can't be.

The url for the site map is always yoursiteroot/SiteMap.ashx

I'm looking into how to submit it to Live Search now, looks like you have to sign into here:

http://webmaster.live.com/

Best,

Joe

1/29/2009 11:13:06 AM
Gravatar
Total Posts 18439

Re: Robots.txt

Yes, now that I'm signed in I can submit my site, both the root url and the site map url. Doing this precludes any need for autodiscovery via robots.txt.

Best,

Joe

1/29/2009 11:15:34 AM
Gravatar
Total Posts 41

Re: Robots.txt

Ya I was signed in. Wasn't sure if I still needed the robots file as they are still not crawling my site after a month. I'll google some more.

1/29/2009 11:20:18 AM
Gravatar
Total Posts 18439

Re: Robots.txt

They apparently are new to supporting sitemap protocol, there is a disadvantage with Live Search because with google you can submit multiple site maps not just one. So for example the blog feature has its own site map at siteroot/Blog/BlogSiteMap.ashx If you are using your site primarily as a blog it may be better to submit that one instead of the main one to Live Search.

Event Calendar Pro also has its own site map for events at /siteroot/Events/EventSiteMap.ashx

WebStore does not yet have its own site map but I created a plain xml file for my few products and submitted that site map to google as well.

Hopefully Live Search will get on the ball.

Best,

Joe

1/29/2009 11:26:56 AM
Gravatar
Total Posts 18439

Re: Robots.txt

It seems it is possible to list multiple site maps in the robots.txt file, they need to make it possible to submit them through web master tools.

Best,

Joe

You must sign in to post in the forums. This thread is closed to new posts.