Skip to content

Commit cad706b

Browse files
lheagytrevorcampbelljoelostblomactions-user
authored
deploy latest book (#106)
* welcome (#74) * update the welcome page to indicate this is the python version, include links to the R version * minor edits (newline for better diffs) Co-authored-by: Trevor Campbell <[email protected]> * put headings on the table of contents for easier navigation (#75) * Update metadata and add chapter numbering (#44) * Update _config.yml * Update repo and branch info * Make some visible changes * Include packages we actually use and updated jupyterbook to fix netlify * Test rebuilding notebooks again * Fix typo * Update ToC * Add ibis * Always execute all notebooks * Delete requirements.txt * Update source/_config.yml * Update source/_config.yml * Change path and branch * Increase timeout * Update _config.yml (#90) * Ch3: wrangling (#76) * wip on ch3 * working on wrangling chapter * move chaining content to the intro * update content on summary statistics * update the disucssion on apply * remove the discussion on What is a List * move the assign content to the very end * minor wordsmithing on welcome page * edited learning objs in ch1 for new chaining * update discussion on chaining, edits from Trevor * update discussion of split * remove unnecessary call to File: dir Node: Top This is the top of the INFO tree This (the Directory node) gives a menu of major topics. Typing "d" returns here, "q" exits, "?" lists all INFO commands, "h" gives a primer for first-timers, "mEmacs<Return>" visits the Emacs topic, etc. In Emacs, you can click mouse button 2 on a menu item or cross reference to select it. --- PLEASE ADD DOCUMENTATION TO THIS TREE. (See INFO topic first.) --- * Menu: The list of major topics begins on the next line. Emacs * Ada mode: (ada-mode). The GNU Emacs mode for editing Ada. * Autotype: (autotype). Convenient features for text that you enter frequently in Emacs. * CC Mode: (ccmode). Emacs mode for editing C, C++, Objective-C, Java, Pike, and IDL code. * CL: (cl). Partial Common Lisp support for Emacs Lisp. * Dired-X: (dired-x). Dired Extra Features. * EUDC: (eudc). A client for directory servers (LDAP, PH) * Ebrowse: (ebrowse). A C++ class browser for Emacs. * Ediff: (ediff). A visual interface for comparing and merging programs. * Emacs: (emacs). The extensible self-documenting text editor. * Emacs FAQ: (efaq). Frequently Asked Questions about Emacs. * Emacs MIME: (emacs-mime). The MIME de/composition library. * Eshell: (eshell). A command shell implemented in Emacs Lisp. * Forms: (forms). Emacs package for editing data bases by filling in forms. * Gnus: (gnus). The newsreader Gnus. * IDLWAVE: (idlwave). Major mode and shell for IDL and WAVE/CL files. * MH-E: (mh-e). Emacs interface to the MH mail system. * Message: (message). Mail and news composition mode that goes with Gnus. * PCL-CVS: (pcl-cvs). Emacs front-end to CVS. * RefTeX: (reftex). Emacs support for LaTeX cross-references and citations. * SC: (sc). Supercite lets you cite parts of messages you're replying to, in flexible ways. * Speedbar: (speedbar). File/Tag summarizing utility. * VIP: (vip). An older VI-emulation for Emacs. * VIPER: (viper). The newest Emacs VI-emulation mode. (also, A VI Plan for Emacs Rescue or the VI PERil.) * Widget: (widget). Documenting the "widget" package used by the Emacs Custom facility. * WoMan: (woman). Browse UN*X Manual Pages `Wo (without) Man'. Texinfo documentation system * Info: (info). Documentation browsing system. Miscellaneous * Screen: (screen). Full-screen window manager. * Standards: (standards). GNU coding standards. GNU admin * Autoconf: (autoconf). Create source code configuration scripts Individual utilities * aclocal: (automake)Invoking aclocal. Generating aclocal.m4 * autoconf: (autoconf)autoconf Invocation. How to create configuration scripts * autoreconf: (autoconf)autoreconf Invocation. Remaking multiple `configure' scripts * autoscan: (autoconf)autoscan Invocation. Semi-automatic `configure.ac' writing * config.status: (autoconf)config.status Invocation. Recreating a configuration * configure: (autoconf)configure Invocation. Configuring a package * ifnames: (autoconf)ifnames Invocation. Listing the conditionals in source code GNU programming tools * automake: (automake). Making Makefile.in's Utilities * Bash: (bash). The GNU Bourne-Again SHell. GNU Packages * Tar: (tar). Making tape (or disk) archives. Individual utilities * tar: (tar)tar invocation. Invoking GNU `tar' Software development * Cpp: (cpp). The GNU C preprocessor. * Cpplib: (cppinternals). Cpplib internals. * gcc: (gcc). The GNU Compiler Collection. * gccinstall: (gccinstall). Installing the GNU Compiler Collection. * gccint: (gccint). Internals of the GNU Compiler Collection. * gfortran: (gfortran). The GNU Fortran Compiler. GNU Libraries * libgomp: (libgomp). GNU OpenMP runtime library Programming & development tools * gdbm_dump: gdbm_dump(gdbm). Dump the GDBM database into a flat file. * gdbm_load: gdbm_load(gdbm). Load the database from a flat file. Utilities GNU libraries * gmp: (gmp). GNU Multiple Precision Arithmetic Library. Software libraries * GnuTLS: (gnutls). GNU Transport Layer Security Library. * GnuTLS-Guile: (gnutls-guile). GNU Transport Layer Security Library. Guile bindings. * libidn2: (libidn2). Internationalized domain names (IDNA2008/TR46) processing. * libtasn1: (libtasn1). Library for Abstract Syntax Notation One (ASN.1). * mpfr: (mpfr). Multiple Precision Floating-Point Reliable Library. GNU Packages * mpc: (mpc)Multiple Precision Complex Library. Development * fftw3: (fftw3). FFTW User's Manual. Individual utilities * aclocal-invocation: (automake)aclocal Invocation. Generating aclocal.m4. * autoconf-invocation: (autoconf)autoconf Invocation. How to create configuration scripts * autoheader: (autoconf)autoheader Invocation. How to create configuration templates * autom4te: (autoconf)autom4te Invocation. The Autoconf executables backbone * automake-invocation: (automake)automake Invocation. Generating Makefile.in. * autoreconf: (autoconf)autoreconf Invocation. Remaking multiple ‘configure’ scripts * autoscan: (autoconf)autoscan Invocation. Semi-automatic ‘configure.ac’ writing * autoupdate: (autoconf)autoupdate Invocation. Automatic update of ‘configure.ac’ * config.status: (autoconf)config.status Invocation. Recreating configurations. * configure: (autoconf)configure Invocation. Configuring a package. * ifnames: (autoconf)ifnames Invocation. Listing conditionals in source. * libtool-invocation: (libtool)Invoking libtool. Running the 'libtool' script. * libtoolize: (libtool)Invoking libtoolize. Adding libtool support. * testsuite: (autoconf)testsuite Invocation. Running an Autotest test suite. Software development * Autoconf: (autoconf). Create source code configuration scripts. * Automake: (automake). Making GNU standards-compliant Makefiles. * Automake-history: (automake-history). History of Automake development. * GNU libtextstyle: (libtextstyle). Output of styled text. * GNU libunistring: (libunistring). Unicode string library. * Libtool: (libtool). Generic shared library support script. Localization * idn2: (libidn2)Invoking idn2. Internationalized Domain Name (IDNA2008/TR46) conversion. Encryption * Nettle: (nettle). A low-level cryptographic library. System Administration * certtool: (gnutls)certtool Invocation. Manipulate certificates and keys. * gnutls-cli: (gnutls)gnutls-cli Invocation. GnuTLS test client. * gnutls-cli-debug: (gnutls)gnutls-cli-debug Invocation. GnuTLS debug client. * gnutls-serv: (gnutls)gnutls-serv Invocation. GnuTLS test server. * psktool: (gnutls)psktool Invocation. Simple TLS-Pre-Shared-Keys manager. * srptool: (gnutls)srptool Invocation. Simple SRP password tool. Libraries * libgpg-error: (gnupg). Error codes and common code for GnuPG. GNU Libraries * libgcrypt: (gcrypt). Cryptographic function library. C++ libraries * autosprintf: (autosprintf). Support for printf format strings in C++. GNU Gettext Utilities * ISO3166: (gettext)Country Codes. ISO 3166 country codes. * ISO639: (gettext)Language Codes. ISO 639 language codes. * autopoint: (gettext)autopoint Invocation. Copy gettext infrastructure. * envsubst: (gettext)envsubst Invocation. Expand environment variables. * gettext: (gettext). GNU gettext utilities. * gettextize: (gettext)gettextize Invocation. Prepare a package for gettext. * msgattrib: (gettext)msgattrib Invocation. Select part of a PO file. * msgcat: (gettext)msgcat Invocation. Combine several PO files. * msgcmp: (gettext)msgcmp Invocation. Compare a PO file and template. * msgcomm: (gettext)msgcomm Invocation. Match two PO files. * msgconv: (gettext)msgconv Invocation. Convert PO file to encoding. * msgen: (gettext)msgen Invocation. Create an English PO file. * msgexec: (gettext)msgexec Invocation. Process a PO file. * msgfilter: (gettext)msgfilter Invocation. Pipe a PO file through a filter. * msgfmt: (gettext)msgfmt Invocation. Make MO files out of PO files. * msggrep: (gettext)msggrep Invocation. Select part of a PO file. * msginit: (gettext)msginit Invocation. Create a fresh PO file. * msgmerge: (gettext)msgmerge Invocation. Update a PO file from template. * msgunfmt: (gettext)msgunfmt Invocation. Uncompile MO file into PO file. * msguniq: (gettext)msguniq Invocation. Unify duplicates for PO file. * ngettext: (gettext)ngettext Invocation. Translate a message with plural. * xgettext: (gettext)xgettext Invocation. Extract strings into a PO file. The Algorithmic Language Scheme * Guile Reference: (guile). The Guile reference manual. * R5RS: (r5rs). The Revised(5) Report on Scheme. * take care of colons preceding code blocks * take care of chapter references * add discussion of lists and dicts * add table and discussion on basic data structures in python: * add description of info * some general cleanup in apply, assign * typo fix in the intro * a couple of type fixes: * polish chaining/multiline exps * polish ch3 up to and incl tidy data * polish indexing * more polish ch3 * fix python exercises link * more on groupby * improve groupy and discussion of lambda functions * try re-ordering the assign and apply content * global find replace to remove . in naming conventions * caption on fig24 fixed * polish up to apply * cleanup through Using to create new columns * through the summery * add :tags: [output_scroll] for large code outputs, change figure types * trim vertical whitespace on figures: * Update source/wrangling.md Co-authored-by: Joel Ostblom <[email protected]> * Apply suggestions from code review Co-authored-by: Joel Ostblom <[email protected]> * polishing ch3 * final polish on wrangling * final polish on ch3 joel comments Co-authored-by: Trevor Campbell <[email protected]> Co-authored-by: Joel Ostblom <[email protected]> * added altair_saver extension * update build_html.sh script with new docker image * Ch4: Viz (#77) * code formatting for viz * update viz chapter * updating the viz chapter * comments addressed through faithful dataset * more progress on the viz chapter (part way through morley data) * add back code to create the csv for mauna_loa * a couple minor typo fixes * polishing ch4 * minor polish on ch4 * code tags in learning objs * polish on ch4, fixed number -> percentage in figure labels * re-added other filetypes... * better line formatting in saving section * ignore altair warnings; committed faithful plots * moved faithful plots to img/ * done polishing ch4 Co-authored-by: Trevor Campbell <[email protected]> * removed unused material * Front matter (#96) * preface python * remove foreword * added editors page * fix appendix,references * added py acks * minor ed * Update editors.md add Lindsey bio * Add joels bio Co-authored-by: Lindsey Heagy <[email protected]> Co-authored-by: Joel Ostblom <[email protected]> * Add jupyterlab help section (#101) * Ch1 fig cleanup (#99) * first figures in ch1: * code figures for ch1, including ppt to edit them * update figure sizes * remove old lingering image * removed hidden pptx cache file Co-authored-by: Trevor Campbell <[email protected]> * Ch2 fig cleanup (#102) * update output scrolling for ch2 * update scrolling of large output tables * Ch3 fig cleanup (#103) * figure polishing for ch3 * more ch3 figures Co-authored-by: Trevor Campbell <[email protected]> Co-authored-by: Joel Ostblom <[email protected]> Co-authored-by: GitHub Action <[email protected]>
1 parent 3917877 commit cad706b

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

52 files changed

+2489
-4082
lines changed

Dockerfile

+1
Original file line numberDiff line numberDiff line change
@@ -18,6 +18,7 @@ RUN mamba install --quiet --yes \
1818
'numpy' \
1919
'jinja2' \
2020
'altair_data_server' \
21+
'altair_saver' \
2122
'click' \
2223
'ibis-framework' \
2324
'ghp-import' \

build_html.sh

+1-1
Original file line numberDiff line numberDiff line change
@@ -1,2 +1,2 @@
11
chmod -R o+w source/
2-
docker run --rm -v $(pwd):/home/jovyan ubcdsci/py-intro-to-ds:202212191809333bdc71 /bin/bash -c "jupyter-book build source"
2+
docker run --rm -v $(pwd):/home/jovyan ubcdsci/py-intro-to-ds:20230104230634037f38 /bin/bash -c "jupyter-book build source"

requirements.txt

-3
This file was deleted.

source/_config.yml

+7-15
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,7 @@
1-
#######################################################################################
2-
# Config file for EOSC211 jupyter book
3-
#######################################################################################
41
# Book settings
5-
6-
title: DSCΙ 100
2+
title: "Data Science: A First Introduction (Python Edition)"
73
author: UBC
8-
copyright: "2021" # Copyright year to be placed in the footer
4+
copyright: "2022" # Copyright year to be placed in the footer
95
logo: "" # A path to the book logo
106
# Patterns to skip when building the book. Can be glob-style (e.g. "*skip.ipynb")
117
exclude_patterns: [_build, Thumbs.db, .DS_Store, "*.ipynb_checkpoints"]
@@ -15,10 +11,10 @@ only_build_toc_files: true
1511
#######################################################################################
1612
# Execution settings
1713
execute:
18-
execute_notebooks: "cache" # Whether to execute notebooks at build time. Must be one of ("auto", "force", "cache", "off")
14+
execute_notebooks: "auto" # Whether to execute notebooks at build time. Must be one of ("auto", "force", "cache", "off")
1915
cache: "" # A path to the jupyter cache that will be used to store execution artifacts. Defaults to `_build/.jupyter_cache/`
2016
# exclude_patterns: [] # A list of patterns to *skip* in execution (e.g. a notebook that takes a really long time)
21-
timeout: 30 # The maximum time (in seconds) each notebook cell is allowed to run.
17+
timeout: 90 # The maximum time (in seconds) each notebook cell is allowed to run.
2218
run_in_temp:
2319
false # If `True`, then a temporary directory will be created and used as the command working directory (cwd),
2420
# otherwise the notebook's parent directory will be the cwd.
@@ -65,19 +61,15 @@ latex:
6561
latex_engine: pdflatex # one of 'pdflatex', 'xelatex' (recommended for unicode), 'luatex', 'platex', 'uplatex'
6662
use_jupyterbook_latex: true # use sphinx-jupyterbook-latex for pdf builds as default
6763

68-
6964
#######################################################################################
70-
7165
# Launch button settings
7266
launch_buttons:
7367
binderhub_url: ""
7468

75-
76-
7769
repository:
78-
url: https://github.com/phaustin/eosc211_students # The URL to your book's repository
79-
path_to_book: "" # A path to your book's folder, relative to the repository root.
80-
branch: e211_live_main # Which branch of the repository should be used when creating links
70+
url: https://github.com/UBC-DSCI/introduction-to-datascience-python # The URL to your book's repository
71+
path_to_book: "source" # A path to your book's folder, relative to the repository root.
72+
branch: production # Which branch of the repository should be used when creating links
8173

8274
#######################################################################################
8375
# Advanced and power-user settings

source/_toc.yml

+11-6
Original file line numberDiff line numberDiff line change
@@ -1,15 +1,18 @@
11
format: jb-book
22
root: index.md
3-
options:
4-
numbered: true
53
parts:
6-
- caption: First draft
4+
- caption: Front Matter
75
chapters:
86
- file: preface-text.md
9-
- file: foreword-text.md
7+
#- file: foreword.md
108
- file: acknowledgements.md
9+
- file: acknowledgements-python.md
1110
- file: authors.md
12-
- file: setup.md
11+
- file: editors.md
12+
#- file: setup.md
13+
- caption: Chapters
14+
numbered: 3
15+
chapters:
1316
- file: intro.md
1417
- file: reading.md
1518
- file: wrangling.md
@@ -20,5 +23,7 @@ parts:
2023
- file: regression2.md
2124
- file: clustering.md
2225
- file: inference.md
23-
- file: references.md
26+
- caption: Appendix
27+
chapters:
2428
- file: appendixA.md
29+
#- file: references.md

source/acknowledgements-python.md

+25
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,25 @@
1+
---
2+
jupytext:
3+
cell_metadata_filter: -all
4+
formats: py:percent,md:myst,ipynb
5+
text_representation:
6+
extension: .md
7+
format_name: myst
8+
format_version: 0.13
9+
jupytext_version: 1.13.8
10+
kernelspec:
11+
display_name: Python 3 (ipykernel)
12+
language: python
13+
name: python3
14+
---
15+
16+
# Acknowledgments for the Python Edition
17+
18+
We'd like to thank everyone that has contributed to the development of
19+
[*Data Science: A First Introduction (Python Edition)*](https://ubc-dsci.github.io/introduction-to-datascience-python/).
20+
This is an open source Python translation of the original [*Data Science: A First Introduction*](https://datasciencebook.ca);
21+
the original focused on the R programming language. Both of these books are
22+
used to teach DSCI 100, a new introductory data science course
23+
at the University of British Columbia (UBC).
24+
25+
We will finalize this acknowledgements section after the book is complete!

source/acknowledgements.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ kernelspec:
1313
name: python3
1414
---
1515

16-
# Acknowledgments -- TBD
16+
# Acknowledgments
1717

1818
We'd like to thank everyone that has contributed to the development of
1919
[*Data Science: A First Introduction*](https://datasciencebook.ca).

source/appendixA.md

+1-3
Original file line numberDiff line numberDiff line change
@@ -13,9 +13,7 @@ kernelspec:
1313
name: python3
1414
---
1515

16-
# Appendix
17-
18-
# Downloading files from JupyterHub {#appendixA}
16+
# Downloading files from JupyterHub
1917

2018
This section will help you
2119
save your work from a JupyterHub web-based platform to your own computer.

source/authors.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -13,7 +13,7 @@ kernelspec:
1313
name: python3
1414
---
1515

16-
# About the authors -- TBD
16+
# About the authors
1717

1818
**Tiffany Timbers** is an Assistant Professor of Teaching in the Department of
1919
Statistics and Co-Director for the Master of Data Science program (Vancouver

source/editors.md

+51
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,51 @@
1+
---
2+
jupytext:
3+
cell_metadata_filter: -all
4+
formats: py:percent,md:myst,ipynb
5+
text_representation:
6+
extension: .md
7+
format_name: myst
8+
format_version: 0.13
9+
jupytext_version: 1.13.8
10+
kernelspec:
11+
display_name: Python 3 (ipykernel)
12+
language: python
13+
name: python3
14+
---
15+
16+
# About the editors of the Python Edition
17+
18+
**Trevor Campbell** is an Assistant Professor in the Department of Statistics at
19+
the University of British Columbia. His research focuses on automated, scalable
20+
Bayesian inference algorithms, Bayesian nonparametrics, streaming data, and
21+
Bayesian theory. He was previously a postdoctoral associate advised by Tamara
22+
Broderick in the Computer Science and Artificial Intelligence Laboratory
23+
(CSAIL) and Institute for Data, Systems, and Society (IDSS) at MIT, a Ph.D.
24+
candidate under Jonathan How in the Laboratory for Information and Decision
25+
Systems (LIDS) at MIT, and before that he was in the Engineering Science
26+
program at the University of Toronto.
27+
28+
+++
29+
30+
**Lindsey Heagy** is an Assistant Professor in the Department of Earth, Ocean, and Atmospheric
31+
Sciences and director of the Geophysical Inversion Facility at the University of British Columbia.
32+
Her research combines computational methods in numerical simulations, inversions, and machine
33+
learning to answer questions about the subsurface of the Earth. Primary applications include
34+
mineral exploration, carbon sequestration, groundwater and environmental studies. She
35+
completed her BSc at the University of Alberta, her PhD at the University of British Columbia,
36+
and held a Postdoctoral research position at the University of California Berkeley prior to
37+
starting her current position at UBC.
38+
39+
+++
40+
41+
**Joel Ostblom** is an Assistant Professor of Teaching in the Department of
42+
Statistics at the University of British Columbia.
43+
During his PhD, Joel developed a passion for data science and reproducibility
44+
through the development of quantitative image analysis pipelines for studying
45+
stem cell and developmental biology. He has since co-created or lead the
46+
development of several courses and workshops at the University of Toronto and
47+
is now an assistant professor of teaching in the statistics department at the
48+
University of British Columbia. Joel cares deeply about spreading data literacy
49+
and excitement over programmatic data analysis, which is reflected in his
50+
contributions to open source projects and data science learning resources. You
51+
can read more about Joel on his [personal page](https://joelostblom.com/).

source/img/altair_syntax.png

180 KB
Loading

source/img/code-figures.pptx

386 KB
Binary file not shown.

source/img/completion_menu.png

35.4 KB
Loading
Loading
Loading
Loading

source/img/faithful_plot.png

-118 KB
Loading

0 commit comments

Comments
 (0)