J!Extensions Store™
Forum
Welcome, Guest
Please Login to access forum.
Re:Sitemap generates all url's (1 viewing) 
Go to bottom
TOPIC: Re:Sitemap generates all url's
#408
John Dagelmore
Admin
Posts: 3716
User Online Now
Re:Sitemap generates all url's Karma: 79  
Hello Raoul,

in your case try to unpublish the data source of type 'content'.
For some weird reason it seems that sh404sef generates 2 different urls for the same article when linked to a menu.

The issue is not about grabbing rewritten urls from sh404sef, basically JSitemap is an aggregator of urls but doesn't generate urls by itself.

The problem arises when using sh404sef because it manage urls in a different manner from Joomla native SEF and associates raw links to the article alias. Joomla uses different links in the native content component and menu items, this is the reason that leads sh404 to record duplicates and different links to same article. However it usually works fine and choose the same alias for both. You could also try to have a look in the URLs list of sh404sef, just to check if both links are present, in your case you should have both:

brands-en/avaya

and

sells-new-used-and-refurbished-avaya

Regards

John
 
Logged Logged  
  The administrator has disabled public write access.
#409
totoweb
Junior Boarder
Posts: 20
User Offline
Re:Sitemap generates all url's Karma: 0  
Hello John,

"in your case try to unpublish the data source of type 'content'."

Where tot do that? Can't find the button/switch

Regards,Raoul
 
Logged Logged  
  The administrator has disabled public write access.
#410
John Dagelmore
Admin
Posts: 3716
User Online Now
Re:Sitemap generates all url's Karma: 79  
See the screenshot below, just in the main data source list there is standard publish button.
 
Logged Logged  
  The administrator has disabled public write access.
#413
totoweb
Junior Boarder
Posts: 20
User Offline
Re:Sitemap generates all url's Karma: 0  
Hello John,

That did the trick but it now exclude all non rewritten url's....

I also see a lot of duplicate url's in the images sitemap?
http://www.hollandhardware.com/en/?option=com_jmap&view=sitemap&format=images&lang=en

The url's that are generated in jmap do not corespondent with the final url.

Jmap: http://www.hollandhardware.com/index.php?option=com_jmap&view=sitemap&format=images&lang=en

Website: http://www.hollandhardware.com/en/?option=com_jmap&view=sitemap&format=xml&lang=en

The jmap url is added to robots.txt but can the spiders find it because it is different comparing to the url on the website.

Regards,
Raoul
 
Logged Logged  
  The administrator has disabled public write access.
#414
John Dagelmore
Admin
Posts: 3716
User Online Now
Re:Sitemap generates all url's Karma: 79  
Hello Raoul,

unpublishing the content data source all the articles urls will be excluded, if you want to keep some urls you can exclude specific categories or single articles only.

int he images sitemap there are not duplicates, there are same images found in different links like this: http://www.hollandhardware.com/images/content/pijl.png

The image is present in more than one page and probably should be excluded because it's not interesting for indexing, this image probably is more suitable: http://www.hollandhardware.com/en/cisco-en/cisco-catalyst-3750-series

The canonical link to generate sitemap that have to be added to robots.txt is this: http://www.hollandhardware.com/index.php?option=com_jmap&view=sitemap&format=images&lang=en

When you open it in your browser having multilanguage enabled Joomla redirect using a 301 to this url: http://www.hollandhardware.com/en/?option=com_jmap&view=sitemap&format=xml&lang=en

but this won't affect search engines thanks to the JSitemap plugin, it ensures that also search engines like Bing that doesn't support 301 redirects will be able to fetch the sitemap.

Regards

John
 
Logged Logged  
  The administrator has disabled public write access.
#415
totoweb
Junior Boarder
Posts: 20
User Offline
Re:Sitemap generates all url's Karma: 0  
Ok, thank you for the quick response!
 
Logged Logged  
  The administrator has disabled public write access.
Go to top