I'm creating an asset listing of my site so I can generate a sitemap.xml file for submission to search engines. It's all working beautifully except for the fact I need to work out how to get the asset listing to ignore some assets (like the 404 page asset).
I know creating a sitemap page has the option to exclude/include specific assets, how do I accomplish the same thing for an asset listing?
[quote]I'm creating an asset listing of my site so I can generate a sitemap.xml file for submission to search engines. It's all working beautifully except for the fact I need to work out how to get the asset listing to ignore some assets (like the 404 page asset).
I know creating a sitemap page has the option to exclude/include specific assets, how do I accomplish the same thing for an asset listing?[/quote]
I don't think you can. I think that this is being added to a future version of Matrix.
Thanks. It was what I sort of had concluded.
I changed my asset listing and rather than give it one root node of the site, I set multiple root nodes for each navigation section of the site plus the extra nodes that contain the pdfs etc to be indexed as well.
[quote]Thanks. It was what I sort of had concluded.
I changed my asset listing and rather than give it one root node of the site, I set multiple root nodes for each navigation section of the site plus the extra nodes that contain the pdfs etc to be indexed as well.[/quote]
Good idea. The only thing that is frustrating about that is you don't actually get the root node asset included in the asset listing!
I've hardcoded it into my xml design to get round it :)
Correct. It should be in the version released yesterday.
Hmmm...I get 404s when trying to view the changelogs for 3.16.7 and 3.18.0. I assume the change is only in 3.18.0?
Yeah, it looks like the RSS data pump to the forum is a bit iffy at the moment. You can find the changelogs here: http://matrix.squiz.net/developer/changelogs
I'm not sure when the feature was added, but Greg will know. :)
[quote]Yeah, it looks like the RSS data pump to the forum is a bit iffy at the moment. You can find the changelogs here: http://matrix.squiz.net/developer/changelogs
I’m not sure when the feature was added, but Greg will know. :)[/quote]
Any chance of a 3.16 to 3.18 changelog? I don’t want to have to go through all the 3.18RC changelogs to look for what’s changed.
[quote]Yeah, it looks like the RSS data pump to the forum is a bit iffy at the moment. You can find the changelogs here: http://matrix.squiz.net/developer/changelogs
I’m not sure when the feature was added, but Greg will know. :)[/quote]
Looks like it’s a 3.18 only feature.
[quote]I'm creating an asset listing of my site so I can generate a sitemap.xml file for submission to search engines. It's all working beautifully except for the fact I need to work out how to get the asset listing to ignore some assets (like the 404 page asset).
I know creating a sitemap page has the option to exclude/include specific assets, how do I accomplish the same thing for an asset listing?[/quote]
You could try using the Link Value Filtering option on the Asset Listing. If you set any pages you want to exclude to "exclude", then leave the Link Value blank, that might work. I haven't tried it before though.
That works bloody beautifully. :)
Huzzah!
My hack:
Make the asset listing only list live assets. Then on the assets that you don't want to list put the status to 'Up for Review'. Then they wont list and works like a charm.
Nifty, but is not compatible with workflow and people who actually want to use the Up for Review status for its intended purpose. :)
Totally, probably could be worked around that by changing or disconnecting the workflow on those assets? I guess any asset you don't want on your sitemap isn't going to be customer facing page and wouldn't have need for a workflow process.
Just don't do what I did and get nailed for listing everything and robots.txt out the pages you dont want in Google. The bot HATES having to do the legwork on all your files referenced by your sitemap only to find out you've txted them back out…
[quote]Totally, probably could be worked around that by changing or disconnecting the workflow on those assets? I guess any asset you don't want on your sitemap isn't going to be customer facing page and wouldn't have need for a workflow process.
Just don't do what I did and get nailed for listing everything and robots.txt out the pages you dont want in Google. The bot HATES having to do the legwork on all your files referenced by your sitemap only to find out you've txted them back out…[/quote]
It's why I was trying to find a way to remove them from the asset listing itself so I didn't have to robots.txt them out.
You could probably generate the robots.txt with an Asset Listing if you really wanted to :)