Data Science: References

Most recent update: 2016-03-26

“Data science” is a big topic, one that encompasses the academic discipline of Statistics, as well as data collection, data carpentry (a.k.a. munging or cleaning), data visualization, and communication (including data journalism).

This is an attempt to catalogue a variety of data science references. Rather than try to group them thematically, I’ve added a variety of metatags and annotations.

  • exploratory data analysis
  • data journalism
  • data science
  • data visualization

Additionally, some of the more prominent and prolific authors will get their own sections (and the metatags are applied to their work in general).

References

Augur, Hannah (2016) “Beginner’s Guide to the History of Data Science”, Dataconomy.com, 2016-03-11.

  • data science

Barabba, V.P. (1990) “Through a Glass Less Darkly”, Presidential Address at the Annual Meeting of the American Statistical Association, Aug. 7, 1990. Printed in the Journal of the American Statistical Association, [Volume 86, Issue 413, 1991] (http://amstat.tandfonline.com/doi/abs/10.1080/01621459.1991.10474995). (An alternate version that is not behind a paywall is on the ASA website, in the Presidential Papers section {PDF}.)

  • data science

Barlow, Mike (2015) Learning to Love Data Science: Explorations of Emerging Technologies and Platforms for Predictive Analytics, Machine Learning, Digital Manufacturing and Supply Chain Optimization, O’Reilly Media.

  • data science

#### Alberto Cairo

Cairo, Alberto (2013) The Functional Art: An Introduction to Information Graphics and Visualization, New Riders.

Cairo, Alberto (2016) The Truthful Art: Data, Charts, and Maps for Communication, New Riders.


Chang, Winston (2012) R Graphics Cookbook, O’Reilly.

  • data visualization
  • R

William S. Cleveland

  • data science
  • data visualization

Cleveland, William S. (1993) Visualizing Data, Hobart Press.

Cleveland, William S. (1994) The Elements of Graphing Data (revised edition), Hobart Press.


Conway, Drew (2013) “The Data Science Venn Diagram”.

  • data science

Darmon, David (2013?) “Statistics, Data Science, and Silver”, Third Order Scientist, 2013-08-11

  • data science

Deming, W. Edwards (1965) “Principles of Professional Statistical Practice”. Ann. Math. Statist., 36 (1965), no. 6, 1883–1900.

  • data science

Donoho, David (2015) “50 years of Data Science”, 2015-09-18.

  • data science

Evergreen, Stephanie D.H. (2014) Presenting Data Effectively: Communicating Your Findings for Maximum Impact, Sage.

  • data visualization

#### Stephen Few

  • data journalism
  • data science
  • data visualization

Few, Stephen (2009) Now You See It: Simple Visualization Techniques for Quantitative Analysis, Analytics Press.

Few, Stephen (2012) Show Me the Numbers: Designing Tables and Graphs to Enlighten (second edition), Analytics Press.

Few, Stephen (2013) Information Dashboard Design: Displaying Data for At-A-Glance Monitoring (second edition), Analytics Press.

Few, Stephen (2015) Signal: Understanding What Matters in a World of Noise, Analytics Press.


Foreman, John W. (2014) Data Smart: Using Data Science to Transform Information into Insight. Wiley.

  • data science

Franck, Christopher (2013) “Is Nate Silver a Statistician?”, AmStatNews, 2013-10-01.

  • data science

Friendly, Michael and David Meyer (2016) Discrete Data Analysis with R: Visualization and Modeling Techniques for Categorical and Count Data, CRC Press.

  • R

Grolemund, Garrett (2014) Hands-On Programming With R, O’Reilly Media.

  • R

Grolemund, Garrett and Hadley Wickham (2016), R for Data Science

  • data science
  • R

Hayes, Bob (2015) “Getting More Insights from Data: Nine Facts about the Practice of Data Science”, businessoverbroadway.com, 2015-12-14.

  • data science

Hayes, Bob (2016) “Top 10 Skills in Data Science”, businessoverbroadway.com, 2016-03-12.

  • data science

Hays, Constance L. (2004) “What Wal-Mart Knows About Customers’ Habits”, New York Times, 2004-11-14

  • data science

Hilfiger, John Jay (2015) Graphing Data with R, O’Reilly Media.

  • data visualization
  • R

Ishikawa, Kaoru (?) Seven Basic Tools of Quality

  • data visualization
  • exploratory data analysis

Jaffe, Andrew (2016) “The Evolution of a data Scientist”, simplystatistics.org, 2016-03-21.

  • data science

Kennedy, Pagan (2013) “Who Made That Universal Product Code?”, New York Times, 2013-01-04.

  • big data

Knaflic, Cole Nussbaumer (2015) Storytelling with Data: A Data Visualization Guide for Business Professionals, Wiley.

Loukids, Mike (2010) “What is data science? The future belongs to the companies and people that turn data into products.”, o’reilly.com 2010-06-02.

  • data science

Matloff, Norm (2016) “Some Comments of Donoho’s “50 Years of Data Science”, matloff.wordpress.com, 2016-01-23.

  • data science

Mimno, David (2014) “Data carpentry”, blog entry at mimno.org, 2014-08-19.

  • data carpentry
  • data science

Naur, Peter (1974) Concise Survey of Computer Methods, Studentlitteratur. (A summary of the book is here.)

  • data science

New York University, Data Science Department (?) “What is Data Science?”

  • data science

Ojeda, Tony; Sean Patrick Murphy; Benjamin Bengfort; and Abhijit Dasgupta (2014) Practical Data Science Cookbook, Packt Publishing.

  • data science
  • R

#### Roger D. Peng

  • data science
  • R

Peng, Roger D. (2015) Exploratory Data Analysis with R, Lean Pub.

Peng, Roger D. (2014-2016) R Programming for Data Science, Lean Pub.

Peng, Roger D. (2015-2016) Report Writing for Data Science in R, Lean Pub.

Peng, Roger D. (2016) “Non-tidy data”, simplystatistics.org, 2016-02-17.

Peng, Roger D. and Elizabeth Matsui (2015) The Art of Data Science: A Guide for Anyone Who Works with Data, Lean Pub.


Rickert, Joseph (2013) “Nate Silver addresses assembled statisticians at this year’s JSM”, Revolution Analytics, 2013-08-08.

  • data science

Robbins, Naomi B. (2004) Creating More Effective Graphs (reprinted 2013), Chart House.

  • data science
  • data visualization

Saunders, Todd (2014) “Data Science and Data Scientists: What’s in a Name?”, CBIG Counsulting, 2013-11-11

  • data science

Schutt, Rachel & O’Neil, Cathy (2014) Doing Data Science: Straight Talk From the Frontline. O’Reilly.

  • data science

#### Simply Statistics blog

  • data science

Simply Statistics (2012) “Schlep blindness in statistics”, 2012-05-28.

Simply Statistics (2012) “Statistics/statisticians need better marketing”, 2012-08-14.

Simply Statistics (2013) “Data scientist is just a sexed up word for statistician”, 2013-08-08.


Statistical & Scientific Thinking (2015) “Basic Principles of Using Data”, Statistical & Scientific Thinking, 2015-08-25.

  • data science

Statistics Views (2013) “Nate Silver: What I need from statisticians”, Statistics Views, 2013-08-23.

  • data science

Stray, Jonathan (2016) The Curious Journalist’s Guide to Data, at www.gitbook.com

  • data journalism
  • data science

Toomey, Dan (2014) R for Data Science, Packt Publishing.

  • data science
  • R

#### Edward R. Tufte

  • data visualization

Tufte, Edward R. (1983) The Visual Display of Quantitative Information, Graphics Press.

Tufte, Edward R. (2001) The Visual Display of Quantitative Information (2nd edition), Graphics Press.

Tufte, Edward R. (1990) Envisioning Information, Graphics Press.

Tufte, Edward R. (1997) Visual Explanations: Images and Quantities, Evidence and Narrative, Graphics Press.

Tufte, Edward R. (2006) Beautiful Evidence, Graphics Press.


#### John Tukey

  • exploratory data analysis
  • data science
  • data visualization

Tukey, John (1962) “The Future of Data Analysis”, The Annals of Mathematical Statistics, 33-1, pp.1-67.

Tukey, John (1977) Exploratory Data Analysis, Addison Wesley.


Hadley Wickham

  • data visualization
  • R

Grolemund, Garrett and Hadley Wickham (2016), R for Data Science

Wickham, Hadley (2015) Advanced R, CRC Press.

Wickham, Hadley (2015) R Packages, O’Reilly Media.

Wickham, Hadley (2016) ggplot2: Elegant Graphics for Data Analysis (2nd edition), Springer.


Wong, Dona M. (2010) The Wall Street Journal Guide to Information Graphics: The Dos & Don’ts of Presenting Data, Facts, and Figures, W.W. Norton & Company.

  • data visualization

Nathan Yau

Yau, Nathan (2011) Visualize This: The FlowingData Guide to Design, Visualization, and Statistics, Wiley.

Yau, Nathan (2013) Data Points: Visualization That Means Something, Wiley.


Zumel, Nina and John Mount (2014) Practical Data Science with R, Manning.

  • data science
  • R

### Miscellany

-30-

Written on March 26, 2016