You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: content/en/case-studies/cricket-analytics.md
+12-39
Original file line number
Diff line number
Diff line change
@@ -87,36 +87,25 @@ metrics for improving match winning chances:
87
87
88
88
***Data Cleaning and preprocessing**
89
89
90
-
There are several public and proprietary sources of cricket data. The
91
-
latter are mostly owned by broadcasting corporations that hold rights to
92
-
various seasons and matches played. IPL has expanded cricket beyond the
93
-
classic test match format to a much larger scale. The number of matches
94
-
played every season across various formats has increased and so has the data,
95
-
the algorithms, newer technologies and simulation models. Real time video
96
-
analysis requires field mapping, player tracking, ball tracking, player shot
97
-
analysis and several other aspects involved in how the ball is delivered, its
98
-
angle, spin, velocity and trajectory.
99
-
100
-
***Data Representation**
101
-
102
-
One of the most tricky challenges with sports analytics is getting the
103
-
right data representation. What this means is getting the raw data in a form
104
-
such that it can be laid out for comparison and building models. If the
105
-
initial representation itself is incorrect, everything that follows is akin
106
-
to fitting noise to detect a signal. In cricket, just like any other sport,
90
+
IPL has expanded cricket beyond the classic test match format to a much larger scale. The number of matches played every season across various formats has increased and so has the data, the algorithms, newer sports data analysis
91
+
technologies and simulation models. Cricket data analysis requires field mapping, player tracking, ball tracking, player shot analysis and several other aspects involved in how the ball is delivered, its angle, spin, velocity and trajectory. All these factors together have increased the complexity of data cleaning and preprocessing.
92
+
93
+
***Dynamic Modeling**
94
+
95
+
In cricket, just like any other sport,
107
96
there can be a large number of variables related to tracking various numbers
108
97
of players on the field, their attributes, the ball and several possibilities
109
-
of potential actions. The complexity of data analytics and representation is
98
+
of potential actions. The complexity of data analytics and modeling is
110
99
directly proportional to the kind of predictive questions that are put forth
111
100
during analysis and are highly dependent on data representation and the
112
101
model. Things get even more challenging in terms of computation, data
113
102
comparisons when dynamic cricket play predictions are sought such as what
114
103
would have happened if the batsman had hit the ball at a different angle or
115
-
velocity?
104
+
velocity.
116
105
117
106
***Predictive Analytics Complexity**
118
107
119
-
In cricket, some of the decision making is based on questions such as "how
108
+
Much of the decision making in Cricket is based on questions such as "how
120
109
often does a batsman play a certain kind of shot if the ball delivery is of a
121
110
particular type", or "how does a bowler change his line and length if the
122
111
batsman responds to his delivery in a certain way".
@@ -132,39 +121,23 @@ and other PyData packages like Scikit-learn, SciPy, Matplotlib, and Jupyter.
132
121
in addition to latest machine learning and AI techniques. NumPy has been used
133
122
for various kinds of cricket related sporting analytics such as:
134
123
135
-
***Data Correlation:**data graphing and [visualization](https://towardsdatascience.com/advanced-sports-visualization-with-pandas-matplotlib-and-seaborn-9c16df80a81b)
124
+
***Data Visualization:**Data graphing and [visualization](https://towardsdatascience.com/advanced-sports-visualization-with-pandas-matplotlib-and-seaborn-9c16df80a81b)
136
125
provides useful insights into relationship between various datasets;
0 commit comments