Merge pull request CamDavidsonPilon#258 from qhfgva/master

CamDavidsonPilon · CamDavidsonPilon · commit 44467e0a658d · 2015-04-28T18:52:20.000-04:00
Fixed some spelling errors
diff --git a/Chapter2_MorePyMC/Chapter2.ipynb b/Chapter2_MorePyMC/Chapter2.ipynb
@@ -851,7 +851,7 @@
       "\n",
       "### *A* and *B* Together\n",
       "\n",
-      "A similar anaylsis can be done for site B's response data to determine the analogous $p_B$. But what we are really interested in is the *difference* between $p_A$ and $p_B$. Let's infer $p_A$, $p_B$, *and* $\\text{delta} = p_A - p_B$, all at once. We can do this using PyMC's deterministic variables. (We'll assume for this exercise that $p_B = 0.04$, so $\\text{delta} = 0.01$, $N_B = 750$ (signifcantly less than $N_A$) and we will simulate site B's data like we did for site A's data )"
+      "A similar analysis can be done for site B's response data to determine the analogous $p_B$. But what we are really interested in is the *difference* between $p_A$ and $p_B$. Let's infer $p_A$, $p_B$, *and* $\\text{delta} = p_A - p_B$, all at once. We can do this using PyMC's deterministic variables. (We'll assume for this exercise that $p_B = 0.04$, so $\\text{delta} = 0.01$, $N_B = 750$ (significantly less than $N_A$) and we will simulate site B's data like we did for site A's data )"
      ]
     },
     {
@@ -1580,7 +1580,7 @@
       "# drop the NA values\n",
       "challenger_data = challenger_data[~np.isnan(challenger_data[:, 1])]\n",
       "\n",
-      "# plot it, as a function of tempature (the first column)\n",
+      "# plot it, as a function of temperature (the first column)\n",
       "print \"Temp (F), O-Ring failure?\"\n",
       "print challenger_data\n",
       "\n",
diff --git a/Chapter3_MCMC/Chapter3.ipynb b/Chapter3_MCMC/Chapter3.ipynb
@@ -1282,7 +1282,7 @@
       "\n",
       "If the priors are poorly chosen, the MCMC algorithm may not converge, or at least have difficulty converging. Consider what may happen if the prior chosen does not even contain the true parameter: the prior assigns 0 probability to the unknown, hence the posterior will assign 0 probability as well. This can cause pathological results.\n",
       "\n",
-      "For this reason, it is best to carefully choose the priors. Often, lack of covergence or evidence of samples crowding to boundaries implies something is wrong with the chosen priors (see *Folk Theorem of Statistical Computing* below). \n",
+      "For this reason, it is best to carefully choose the priors. Often, lack of convergence or evidence of samples crowding to boundaries implies something is wrong with the chosen priors (see *Folk Theorem of Statistical Computing* below). \n",
       "\n",
       "#### Covariance matrices and eliminating parameters\n",
       "\n",
diff --git a/Chapter4_TheGreatestTheoremNeverTold/Chapter4.ipynb b/Chapter4_TheGreatestTheoremNeverTold/Chapter4.ipynb
@@ -952,7 +952,7 @@
      "source": [
       "##### Example: Counting Github stars\n",
       "\n",
-      "What is the average number of stars a Github repository has? How would you calculate this? There are over 6 million respositories, so there is more than enough data to invoke the Law of Large numbers. Let's start pulling some data. TODO"
+      "What is the average number of stars a Github repository has? How would you calculate this? There are over 6 million repositories, so there is more than enough data to invoke the Law of Large numbers. Let's start pulling some data. TODO"
      ]
     },
     {
diff --git a/Chapter6_Priorities/Chapter6.ipynb b/Chapter6_Priorities/Chapter6.ipynb
@@ -860,7 +860,7 @@
       "\n",
       "- Hierarchical algorithms: We can setup a Bayesian Bandit algorithm on top of smaller bandit algorithms. Suppose we have $N$ Bayesian Bandit models, each varying in some behavior (for example  different `rate` parameters, representing varying sensitivity to changing environments). On top of these $N$ models is another Bayesian Bandit learner that will select a sub-Bayesian Bandit. This chosen Bayesian Bandit will then make an internal choice as to which machine to pull. The super-Bayesian Bandit updates itself depending on whether the sub-Bayesian Bandit was correct or not. \n",
       "\n",
-      "- Extending the rewards, denoted $y_a$ for bandit $a$, to random variables from a distribution $f_{y_a}(y)$ is straightforward. More generally, this problem can be rephrased as \"Find the bandit with the largest expected value\", as playing the bandit with the largest expected value is optimal. In the case above, $f_{y_a}$ was Bernoulli with probability $p_a$, hence the expected value for a bandit is equal to $p_a$, which is why it looks like we are aiming to maximize the probability of winning. If $f$ is not Bernoulli, and it is non-negative, which can be accomplished apriori by shifting the distribution (we assume we know $f$), then the algorithm behaves as before:\n",
+      "- Extending the rewards, denoted $y_a$ for bandit $a$, to random variables from a distribution $f_{y_a}(y)$ is straightforward. More generally, this problem can be rephrased as \"Find the bandit with the largest expected value\", as playing the bandit with the largest expected value is optimal. In the case above, $f_{y_a}$ was Bernoulli with probability $p_a$, hence the expected value for a bandit is equal to $p_a$, which is why it looks like we are aiming to maximize the probability of winning. If $f$ is not Bernoulli, and it is non-negative, which can be accomplished a priori by shifting the distribution (we assume we know $f$), then the algorithm behaves as before:\n",
       "\n",
       "   For each round, \n",
       "    \n",
@@ -4924,7 +4924,7 @@
       "Peadar is known as @springcoil on Twitter and is an Irish Data Scientist with a Mathematical focus, he is currently based in Luxembourg. \n",
       "I came across the following blog post on http://danielweitzenfeld.github.io/passtheroc/blog/2014/10/28/bayes-premier-league/ \n",
       "I quote from him, about his realization about Premier League Football -\n",
-      "_It occurred to me that this problem is perfect for a Bayesian model. We want to infer the latent paremeters (every team's strength) that are generating the data we observe (the scorelines). Moreover, we know that the scorelines are a noisy measurement of team strength, so ideally, we want a model that makes it easy to quantify our uncertainty about the underlying strengths.\n",
+      "_It occurred to me that this problem is perfect for a Bayesian model. We want to infer the latent parameters (every team's strength) that are generating the data we observe (the scorelines). Moreover, we know that the scorelines are a noisy measurement of team strength, so ideally, we want a model that makes it easy to quantify our uncertainty about the underlying strengths.\n",
       "\n",
       "_So I googled 'Bayesian football' and found this paper, called 'Bayesian hierarchical model for the prediction of football results.' The authors (Gianluca Baio and Marta A. Blangiardo) being Italian, though, the 'football' here is soccer._\n",
       "\n",
@@ -5238,7 +5238,7 @@
       "                        tau=tau_def, \n",
       "                        size=num_teams, \n",
       "                        value=def_starting_points.values) \n",
-      "# trick to code the sum to zero contraint\n",
+      "# trick to code the sum to zero constraint\n",
       "@pymc.deterministic\n",
       "def atts(atts_star=atts_star):\n",
       "    atts = atts_star.copy()\n",
diff --git a/Chapter7_BayesianMachineLearning/DontOverfit.ipynb b/Chapter7_BayesianMachineLearning/DontOverfit.ipynb
@@ -271,7 +271,7 @@
      "source": [
       "## Develop Tim's model\n",
       "\n",
-      "He mentions that the X variables are from a Unifrom distribution. Let's investigate this:"
+      "He mentions that the X variables are from a Uniform distribution. Let's investigate this:"
      ]
     },
     {
diff --git a/Prologue/Prologue.ipynb b/Prologue/Prologue.ipynb
@@ -90,7 +90,7 @@
       "* **Chapter X1: Bayesian Markov Models**\n",
       "    \n",
       "* **Chapter X2: Bayesian methods in Machine Learning** \n",
-      "    We explore how to resolve the overfitting problem plus popular ML methods. Also included are probablistic explainations of Ridge Regression and LASSO Regression.\n",
+      "    We explore how to resolve the overfitting problem plus popular ML methods. Also included are probablistic explanations of Ridge Regression and LASSO Regression.\n",
       "    - Bayesian spam filtering plus *how to defeat Bayesian spam filtering*\n",
       "    - Tim Saliman's winning solution to Kaggle's *Don't Overfit* problem \n",
       "    \n",
@@ -120,7 +120,7 @@
       "2. The second, preferred, option is to use the nbviewer.ipython.org site, which display IPython notebooks in the browser ([example](http://nbviewer.ipython.org/urls/raw.github.com/CamDavidsonPilon/Probabilistic-Programming-and-Bayesian-Methods-for-Hackers/master/Chapter1_Introduction/Chapter1_Introduction.ipynb)).\n",
       "The contents are updated synchronously as commits are made to the book. You can use the Contents section above to link to the chapters.\n",
       " \n",
-      "3. **PDF versions are coming.** PDFs are the least-prefered method to read the book, as pdf's are static and non-interactive. If PDFs are desired, they can be created dynamically using Chrome's builtin print-to-pdf feature.\n",
+      "3. **PDF versions are coming.** PDFs are the least-preferred method to read the book, as pdf's are static and non-interactive. If PDFs are desired, they can be created dynamically using Chrome's builtin print-to-pdf feature.\n",
       " \n",
       "\n",
       "Installation and configuration\n",

Original file line number	Diff line number	Diff line change
`@@ -952,7 +952,7 @@`
`952`	`952`	`"source": [`
`953`	`953`	`"##### Example: Counting Github stars\n",`
`954`	`954`	`"\n",`
`955`		`- "What is the average number of stars a Github repository has? How would you calculate this? There are over 6 million respositories, so there is more than enough data to invoke the Law of Large numbers. Let's start pulling some data. TODO"`
	`955`	`+ "What is the average number of stars a Github repository has? How would you calculate this? There are over 6 million repositories, so there is more than enough data to invoke the Law of Large numbers. Let's start pulling some data. TODO"`
`956`	`956`	`]`
`957`	`957`	`},`
`958`	`958`	`{`
Original file line number	Diff line number	Diff line change
`@@ -271,7 +271,7 @@`
`271`	`271`	`"source": [`
`272`	`272`	`"## Develop Tim's model\n",`
`273`	`273`	`"\n",`
`274`		`- "He mentions that the X variables are from a Unifrom distribution. Let's investigate this:"`
	`274`	`+ "He mentions that the X variables are from a Uniform distribution. Let's investigate this:"`
`275`	`275`	`]`
`276`	`276`	`},`
`277`	`277`	`{`