You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
1535: Publish an official crawler policy r=sgrif a=sgrif
This is a formalization of a policy that we've been informally enforcing
for some time now. The policy basically boils down to:
- Just use the index if you can
- Contact us to see if we can help in a way that doesn't require
crawling if you can't.
- If you do crawl, limit to 1RPS
- You also have to provide a user agent, which should actually identify
your crawler and have contact information
- We may still block you if you cause an impact on the integrity of the
service.
I chose not to explicitly call out inflating a single crate's download
numbers as something that's forbidden, as it felt like doing that would
be an instance of ["Don't shove beans up your
nose"](https://en.wikipedia.org/wiki/Wikipedia:Don%27t_stuff_beans_up_your_nose)
That falls under the umbrella of "impacting the integrity of the
service" though, so this policy does give us an explicit reason to block
anyone engaging in that behavior.
Co-authored-by: Sean Griffin <[email protected]>
0 commit comments