-
Notifications
You must be signed in to change notification settings - Fork 11
/
Copy path02_data_science_defined.Rmd
137 lines (58 loc) · 6.71 KB
/
02_data_science_defined.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
# (PART) Data Science {-}
# This thing called "data science" {#datasciencedefined}
## Theory
David Donoho (2015) [_50 Years of Data Science_](http://courses.csail.mit.edu/18.337/2015/docs/50YearsDataScience.pdf), based on a presentation at the Tukey Centennial workshop, Princeton NJ Sept 18 2015.
* reprinted in [_Journal of Computational and Graphical Statistics_](http://amstat.tandfonline.com/toc/ucgs20/26/4?nav=tocList), Volume 26, No. 4 (2017), including a variety of discussion papers / responses, including:
- Jenny Bryan and Hadley Wickham, "Data Science: A Three Ring Circus or a Big Tent?"(https://arxiv.org/ftp/arxiv/papers/1712/1712.07349.pdf) {discussion of Donoho, _50 Years of Data Science_}
Hadley Wickham (2018) [Readings in Applied Data Science](https://github.com/hadley/stats337), course materials for Standford Stats337 (spring 2018)
Iain Carmichael and J.S. Marron (2018) "Data science vs. statistics: two cultures?" [@Carmichael_Marron_2018]
David Robinson (2018-01-09) ["What's the difference between data science, machine learning, and artificial intelligence?"](http://varianceexplained.org/r/ds-ml-ai/)
<blockquote class="twitter-tweet" data-lang="en"><p lang="en" dir="ltr">Let's try that again <a href="https://t.co/1G4wHvGvdd">pic.twitter.com/1G4wHvGvdd</a></p>— Data Science Renee (/@/BecomingDataSci) <a href="https://twitter.com/BecomingDataSci/status/773001808096661504?ref_src=twsrc%5Etfw">September 6, 2016</a></blockquote>
Mango Solutions (2018-08-15) ["Demystifying Data Science Terminology"](https://www.mango-solutions.com/blog/demystifying-data-science-terminology)
Martin Monkman (2019-06-02) ["Same name, different bird"](https://martinmonkman.com/post/2019-06-02_same-name/)
## Philosophy
Angela Bassa (2017) [Data Alone Isn’t Ground Truth … and why you should always carry a healthy dose of skepticism in your back pocket](https://medium.com/@angebassa/data-alone-isnt-ground-truth-9e733079dfd4)
Tim Davies and Mark Frank (2013) [‘There’s no such thing as raw data’. Exploring the sociotechnical life of a government dataset](https://eprints.soton.ac.uk/352115/), conference paper from Web Science 2013, France (02 - 04 May 2013)
* [alternate source 1](https://dl.acm.org/citation.cfm?id=2464472)
* [alternate source 2](http://students.ecs.soton.ac.uk/mwra1g13/msc/comp6037/timed_ex_pdf/p75-davies.pdf)
Bertrand Russell, "The Social Responsibilities of Scientists" [@10.2307/1705325]
***
## Using R for Data Science
Hadley Wickham & Garrett Grolemund (2016) [_R for Data Science_] [@Wickham_Grolemund2016]
Roger Peng, [_R Programming for Data Science_] [@Peng2018]
* Roger Peng's [other books on LeanPub](https://leanpub.com/u/rdpeng)
Chester Ismay and Albert Y. Kim, 2019-02-24, [_Modern Dive: Statistical Inference via Data Science (A moderndive into R and the tidyverse)_](http://moderndive.com/) [@Ismay_Kim_2018] (was _An Introduction to Statistical and Data Sciences via R_)
JD Long and Paul Teetor, 2019-09-26, [_R Cookbook, 2nd Edition_](https://rc2e.com/)
Chester Ismay and Patrick C. Kennedy, 2018-05-23, [Getting used to R, RStudio, and R Markdown](https://ismayc.github.io/rbasics-book/)
Gordon Shotwell, 2019-12-30, "Why I use R: They said the war was over..."](https://blog.shotwell.ca/posts/why_i_use_r/)—a well-articulated explication as to why R is the best tool for data science
### Using R for Data Journalism
[.Rddj: Hand-curated, high quality resources for doing data journalism with R](https://rddj.info/)
[R for Journalists](http://www.scoop.it/t/r-for-journalists) at scoop.it
***
## The Practice of Data Science & Statistics
[Data science terminology](https://ubc-mds.github.io/resources_pages/terminology/) -- University of British Columbia, Master of Data Science program
Steph de Silva and John Ormerod, [The Bayesian and The Frequentist](https://www.thebayesianandthefrequentist.com/2019/03/12/to-whom-it-may-concern/) {blog}
Hadley Wickham, [Stats 337: Readings in Applied Data Science](https://github.com/hadley/stats337) -- reading list for Stanford University course, Spring 2018.
### Data Science and Public Policy
[Using Big Data to Solve Economic and Social Problems](http://www.equality-of-opportunity.org/bigdatacourse/) -- course at The Equality of Opportunity Project
### Data science careers
Jonny Brooks-Bartlett (2018-03-28) [Here’s why so many data scientists are leaving their jobs](https://towardsdatascience.com/why-so-many-data-scientists-are-leaving-their-jobs-a1f0329d7ea4) -- a splash of cold water realism in the face
Nate Oostendorp (2019-03-01) ["Radical Change Is Coming To Data Science Jobs"](https://www.forbes.com/sites/forbestechcouncil/2019/03/01/radical-change-is-coming-to-data-science-jobs/#5755e19bdfcc), forbes.com
***
## The skills of data science
Of course, how you set out to learn data science hinges on how you define data science. A typology based on data users might be helpful; knowing what sort of data scientist you are will shape what you might want to learn.
### Data science leadership
Thomas H. Davenport and Jeanne G. Harris, _Competing on Analytics: The New Science of Winning_, Harvard Business School Press, January 2007. [@Davenport_Harris_2007]
* Thomas H. Davenport, ["Competing on Analytics"](https://hbr.org/2006/01/competing-on-analytics), _Harvard Business Review_, January 2006.
Angela Bassa, ["Managing a Data Science Team"](https://hbr.org/2018/10/managing-a-data-science-team), _Harvard Business Review_, 2018-10-24
[Executive Data Science](https://www.coursera.org/specializations/executive-data-science) (Coursera)
[Building a Data Science Team](https://www.coursera.org/learn/build-data-science-team) (Coursera)
### Business analytics & business intelligence
Sahil Arora, ["Top Data Analytics Skills Required to Become a Data Analyst"](https://www.digitalvidya.com/blog/data-analytics-skills/), Digital Vidya, 2017-03-24
Samantha Leonard, ["6 Must-Have Skills For Data Analysts"](https://www.northeastern.edu/levelblog/2018/08/31/6-must-have-skills-data-analyst/), Northeastern University, 2018-08-31
Jay Gendron, (2016) [Introduction to R for Business Intelligence](https://www.oreilly.com/library/view/introduction-to-r/9781785280252/)
### Data science
[Data Science](https://www.coursera.org/specializations/jhu-data-science), Johns Hopkins University via Coursera
[Mango Solutions' R training](https://www.mango-solutions.com/additional-solutions/r-training) provides structure by user proficiency.
Chris Engelhardt, [data_sci_guide](https://github.com/Chris-Engelhardt/data_sci_guide) -- A wealth of data science learning resources. "The overarching goal here is to provide anyone interested in learning data science with a wealth of open source, industry-best learning materials and learning tracks."
***