We have a three site setup with three start pages.
This can be accessible from the outside as (same page in epi):
Which means every page is accessable by two url:s which is not a great for google etc.
How can we avoid this?
You can use canonical url feature. that let search engine crawlers force to index by that url (added in canonical meta).
See this link for more info.
Yes we have this already.
But we still do not want the dual url:s to be a accessable.
I think in that case you can use the Partial routing. That may be a good option to omit the home page name in url. Just like we do to omit the containers (folder).
Ok do you have any example code or documentation how to use it?
I suggest you to not override the default functionality specially for the home page.
Instead of this you can exclude the /home from the search engines using the robots.txt file. Define the URL /home in the robots.txt and search engines will not index this page.
If you want to go with partial router then you can refer below blog post-
Hm I doubt this is the most efficiant way to go.
There must be a simpler way about it, but lacking the information.
Will suggest this to the customer. But I'm pretty sure they will not accept it.
Unless your client hands out those specific start page URLs, or tells Google (and others) about them, I don't think anyone would notice. And if you even include canonical links on pages (including the start pages), nothing should happen if search engines came across such link.
Amending Episerver routing logic to removing those extra URLs would probably be too much effort and complexity. You might risk breaking something else in the process.
You could also install some kind of redirect manager, or redirect using IIS URL Rewrite.