Skip to content

Ref #gh_164 First draft of Cricket analytics using NumPy #174

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 9 commits into from
Apr 26, 2020

Conversation

shaloo
Copy link
Contributor

@shaloo shaloo commented Mar 9, 2020

Finally, managed to correct my fork. It is now synced up with NumPy/numpy.org. Here is the first draft of the cricket sports analytics case study.

Brief description of what is fixed or changed

Deployed URL

http://numpy-174.surge.sh/

@matthew-brett
Copy link

Very glad to see this - thanks!

Copy link
Member

@rgommers rgommers left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @shaloo! This reads well overall. I wonder if it's a little too text-heavy at the start, perhaps a few sentences can be trimmed - I made one concrete suggestion about the money part, there may be a couple of other options.

You identified a good number of usages of NumPy in blog posts. I remember there was a package as well previously, but it was a little light. I wonder if there's some other package, or maybe a professional sports analytics firm that could provide a testimonial or insight in their NumPy usage?

Submitting the few comments I have. Could you break lines to <80 chars? Then it's easier to comment on textual things.

@shaloo
Copy link
Contributor Author

shaloo commented Mar 15, 2020

Thanks @shaloo! This reads well overall. I wonder if it's a little too text-heavy at the start, perhaps a few sentences can be trimmed - I made one concrete suggestion about the money part, there may be a couple of other options.

You identified a good number of usages of NumPy in blog posts. I remember there was a package as well previously, but it was a little light. I wonder if there's some other package, or maybe a professional sports analytics firm that could provide a testimonial or insight in their NumPy usage?

Submitting the few comments I have. Could you break lines to <80 chars? Then it's easier to comment on textual things.

Line width updated to 80/100 chars. Clipped out extra text at the start.

@shaloo
Copy link
Contributor Author

shaloo commented Mar 15, 2020

Ralf, should we add this case study draft for the first website release? That will help us get attention from folks / news media/ sports entertainment enthusiasts to contribute and give us more details / point out inaccuracies wrt NumPy usage in Cricket Analytics? So far, I have approached several folks who are connected to the game but haven't figured a NumPy link yet. Have sent out some gentle reminders again today.
The data presented in this case study is based on publicly available information and we need some deeper insights from those who actually have used NumPy for cricket. They may not be on NumPy contributor list as of now so we need to increase the visibility I think.

@rgommers
Copy link
Member

Ralf, should we add this case study draft for the first website release? T

Yes, I'd like to include it!

I rebased, did a copy-edit of the whole text, and fixes some figure captions and attributions. Will comment on some issues within a couple of hours.

Copy link
Member

@rgommers rgommers left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think the topic an imagery are both quite nice. Main takes on the content:

  • The About Cricket section flows well.
  • Key Data Analytics Objectives could be tweaked slightly to focus more on the objectives, however it's already a good section.
  • In The Challenges there is too much text, and I'm not sure "Data Representation" is really one of the main challenges because if "Data cleaning and preprocessing" is done well then the representation will be okay.
  • NumPy’s Role in Cricket Analytics there's significant cross-talk between the three bullets. For example the first bullet discusses "graphing and visualization", while the third bullet has "data visualization" as a heading.
  • Summary text looks good. In the image I'd like to remove "data correlation", because that's not a "key capability" of NumPy.

<blockquote cite="https://www.scoopwhoop.com/sports/ms-dhoni/">
<p>You don't play for the crowd, you play for the country.</p>
<footer align="right">—M S Dhoni, <cite>International Cricket Player, ex-captain, Indian Team, plays for Chennai Super Kings in IPL</cite></footer>
</blockquote>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It could be nice to replace this with a quote related to NumPy. I think of all the links in this case study, the Stats LLC one looks most substantial. @shaloo have you tried contacting http://patricklucey.com/index.html ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Initiated email conversation regarding the same.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Waiting on email response. We can update it once I get more insights from Patrick later as a new / fresh PR. May be we can close this one and open a new issue once I hear from Patrick.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ralf, can we merge cricket case study updates now? I haven't heard back from most of those who I contacted regarding this, Patrick included. Once it shows up on the site, there will be more takers to provide feedback or better conten, I think.

@rgommers
Copy link
Member

@shaloo I think I would give the case studies priority over the team page right now, since there'll be more review and changes needed for this PR than the team gallery one.

@shaloo
Copy link
Contributor Author

shaloo commented Mar 29, 2020

sure, will get this in right away.

@shaloo shaloo requested a review from rgommers March 29, 2020 17:44
Copy link
Member

@rgommers rgommers left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is looking pretty good now, one comment left. NumPy key benefits of "speed" and "numerical analysis" are the right ones I think.

@rgommers rgommers merged commit ad8334d into numpy:master Apr 26, 2020
@rgommers
Copy link
Member

Merged, thanks Shaloo!

Copy link
Member

@anirudh2290 anirudh2290 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @shaloo for this nice writeup. Overall looks good, I have added some small observations.

are used for tactical analysis.

* **Data Visualization:** Data graphing and [visualization](https://towardsdatascience.com/advanced-sports-visualization-with-pandas-matplotlib-and-seaborn-9c16df80a81b)
provides useful insights into relationship between various datasets.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

although these two points make sense in general for how numpy will be used in cricket analytics and i am sure numpy is used in cricket linked sporting analysis, the linked articles aren't specific to cricket. maybe its the case that we dont have concrete examples today ?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That's correct. I have been chasing a few folks from the cricketing analytics world but haven't been able to get these inputs yet. Would you have any leads / insights into the same?

Comment on lines +126 to +127
and other PyData packages like Scikit-learn, SciPy, Matplotlib, and Jupyter.
in addition to latest machine learning and AI techniques. NumPy has been used
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit:
, after Jupter instead of .


## Summary

Sports Analytics have changed the way professional games are played, especially
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should it read : "Sport Analytics has changed the way professional games are played" instead ?

@shaloo
Copy link
Contributor Author

shaloo commented May 7, 2020

Thanks @shaloo for this nice writeup. Overall looks good, I have added some small observations.

Thank you Anirudh, appreciate your inputs!

@rgommers
Copy link
Member

Thanks for the review @anirudh2290. Fixes were made in gh-226 (merged).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants