Skip to content

Sitemap does not include HTTPS links for custom domains #5445

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
strophy opened this issue Mar 13, 2019 · 11 comments
Closed

Sitemap does not include HTTPS links for custom domains #5445

strophy opened this issue Mar 13, 2019 · 11 comments
Labels
Accepted Accepted issue on our roadmap Bug A bug Good First Issue Good for new contributors Sprintable Small enough to sprint on

Comments

@strophy
Copy link

strophy commented Mar 13, 2019

The generated sitemap.xml gets the URL for each available version by calling get_docs_url. For custom domains, this returns HTTP links, which perform a 301 redirect to the HTTPS version. But sitemaps should only point to actual pages (references: Google, Bing) and not redirects, or the crawler may begin losing trust in the sitemap and put a penalty on the information it contains.

Also, the docstring for this method is probably incorrect, as noted here. This method actually returns HTTPS links for non-custom domains. Related issue: #4641

Example: https://docs.dash.org/sitemap.xml

@stsewd stsewd added the Bug A bug label Mar 13, 2019
@humitos humitos added the Good First Issue Good for new contributors label Mar 25, 2019
@stale
Copy link

stale bot commented May 23, 2019

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

@stale stale bot added the Status: stale Issue will be considered inactive soon label May 23, 2019
@dojutsu-user
Copy link
Member

Still valid bot.

@stale stale bot removed the Status: stale Issue will be considered inactive soon label May 23, 2019
@humitos humitos added the Accepted Accepted issue on our roadmap label May 23, 2019
@humitos
Copy link
Member

humitos commented May 23, 2019

Marking the issue as Accepted since is something we want to fix.

@KartikKapil
Copy link
Contributor

hello @humitos i would like take this issue up what exactly do i need to do for this ???

@humitos
Copy link
Member

humitos commented Oct 28, 2019

Hi @KartikKapil! The requirements for this issue are in the description of the issue. Is there any particular point that you don't understand from there?

@KartikKapil
Copy link
Contributor

KartikKapil commented Oct 28, 2019

@humitos i could not figure out that how to change the function" get_docs_url" to make sure it points to the pages but does not redirect

@ashadhaz
Copy link

Hey @humitos I would like to work on this issue. Is it still open?

@humitos
Copy link
Member

humitos commented Dec 22, 2019

@ashadhaz yes, it's still open

@Parth1811
Copy link
Contributor

@ashadhaz are you still working on this issue? If not I would like to work on this.

@Parth1811
Copy link
Contributor

I am working on this issue as mentioned on IRC.

@stsewd how should this be fixed, there are two approaches:

  1. Simply place https instead of http and expect the custom domain to have SSL.
  2. In the get_docs_url method make a request and check if the domain has SSL and then proceed accordingly.

@humitos humitos added the Sprintable Small enough to sprint on label Jan 25, 2020
@ericholscher
Copy link
Member

This is now fixed 👍

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Accepted Accepted issue on our roadmap Bug A bug Good First Issue Good for new contributors Sprintable Small enough to sprint on
Projects
None yet
Development

No branches or pull requests

8 participants