Data Science Links
Careers & Industry
- Data Scientist: The Sexiest Job of the 21st Century
- Data Science Is Different Now
- On Moving from Statistics to Machine Learning, the Final Stage of Grief
- An Intake Form for Data Requests
- Is Data Scientist Still the Sexiest Job of the 21st Century?
- Why Business Data Science Irritates Me
- Big Data is Dead
- Most Data Work Seems Fundamentally Worthless
- Your Organization Probably Doesn’t Want To Improve Things
- I Will F***ing Piledrive You if You Mention AI Again
Data Visualization
- Review of The Visual Display of Quantitative Information
- A Brief History of Data Visualization
- What Ordinary People Need Most from Information Visualization Today
- Beyond Bar and Line Graphs: Time for a New Data Presentation Paradigm
- Market Cafe Magazine
- Visualizing the Uncertainty in Data
- From Data to Viz
- Error Bars Considered Harmful: Exploring Alternate Encodings for Mean and Error
- Fundamentals of Data Visualization
- Learning Data Visualization
- Dataviz Inspiration
- Friends Don’t Let Friends Make Bad Graphs
- 1 Dataset 100 Visualizations
- How People Actually Lie With Charts
- Best Practices for Data Visualisation
- The Mythos of Visualization Literacy
- Scikit-learn Visualization Guide: Making Models Speak
- Reflections on UpSet
- List of Data Visualization Books
- The Data Visualization Catalogue
- Data Visualization Teaching and Learning Materials
History
Philosophy
- Prior Probabilities
- The Well-Posed Problem
- Monkeys, Kangaroos, and N
- Statistical Modeling: The Two Cultures
- Review of Probability Theory: The Logic of Science
- Philosophy of Probability
- Probability That a Number Is Prime
- Who Knows What Evil Lurks in the Hearts of Men? The Bayesian Doesn’t Care
- Kolmogorov’s Axiomatisation and its Discontents
- Levels of Uncertainty
- Philosophy of Statistics
- Abandon Statistical Significance
- Simplicity and Indifference
- What Is Probability?
- Can You Have Confidence in a Confidence Interval?
- Surely God Loves 51 km/h Nearly As Much as 49 km/h?
- Statistical Factuality Versus Practicality Versus Poetry
- Evidence Versus Logic in Scientific Reform
Puzzles & Paradoxes
References & Resources
- Awesome Public Datasets
- Introduction to Modern Statistics (2nd Ed)
- D3 in Depth
- Happy Git and GitHub for the useR
- 3Blue1Brown: Probability
- Causal Inference: The Mixtape
- The Good Research Code Handbook
- Bayes Rules! An Introduction to Applied Bayesian Modeling
- easystats: An R Framework for Easy Statistical Modeling, Visualization, and Reporting
- Elements of Data Science
- An Introduction to Bayesian Thinking
- Statistical Rethinking with brms, ggplot2, and the tidyverse
- Generalized Additive Models in R
- Mathematical Indroduction to Deep Learning: Methods, Implementations, and Theory
- Modern Data Science with R
- Regression Modeling Strategies
- Understanding Deep Learning
- What Is Entropy?
- Active Statistics
- Causal Inference in R
- Review of Statistical Rethinking
- Which Books, Papers, and Blogs Are in the Bayesian Canon?
- Code Review for Statisticians, Data Scientists & Modellers
- Data Creators Club
- Bayesian Data Analysis
- The Truth About Linear Regression
Statistical Methodology
- Stats Can’t Make Modeling Decisions
- The Mythos of Model Interpretability
- Common Statistical Tests Are Linear Models (or: How To Teach Stats)
- The Medical Test Paradox, and Redesigning Bayes’ Rule
- Regression, Fire, and Dangerous Things
- Bayes Rule in Odds Form
- Why Do Tree-Based Models Still Outperform Deep Learning on Tabular Data?
- There Are No Magic Outcome Variables
- Never Test For Normality
- The Shortcomings of Standardized Regression Coefficients
- Preventing Common Misconceptions About Bayes Factors
- Simulating Confounders, Colliders and Mediators
- Go Get the Data
- Three Advantages of Non-AI Models
- Log Transforms, Geometric Means and Estimating Population Totals
- Directed Acyclic Graphs
- Statistician’s Time Series Hack
- Tell Me What You Really Want: How to Identify the Real Business Question
- First Time Seeing a Rare Event
- Solving for the Hidden Data
- What Size Is That Correlation?
- Sensitivity Counts Against You
- A/B Tests for Engineers
- Using Dimensional Analysis To Check Probability Calculations
- Outlier Detection
- Why Effect Sizes Selected for Significance are Inflated
- What Good is Analysis of Variance?
- Stepwise Selection of Variables in Regression Is Evil