-
Notifications
You must be signed in to change notification settings - Fork 11
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
add, edit, some minor formatting changes
- Loading branch information
Showing
5 changed files
with
197 additions
and
1 deletion.
There are no files selected for viewing
File renamed without changes.
File renamed without changes.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,153 @@ | ||
# Data Sources & How to Read Them {#datasources} | ||
|
||
|
||
```{r echo = FALSE} | ||
library(knitr) | ||
opts_chunk$set(message = FALSE, warning = FALSE, cache = TRUE) | ||
options(width = 100, dplyr.width = 100) | ||
library(ggplot2) | ||
theme_set(theme_light()) | ||
``` | ||
|
||
|
||
|
||
|
||
## Introduction | ||
|
||
What is data science without _data_? Links to tools to import data from a variety of sources, along with a few indexes and compendiums of data sources. | ||
|
||
--- | ||
### Sources | ||
|
||
#### listings | ||
|
||
University of Alberta Libraries, Economics: [List of databases](http://guides.library.ualberta.ca/c.php?g=329741&p=2334221) | ||
|
||
Simon Fraser University Library: [Gender, Sexuality & Women's Studies Information Resources: Facts & Data](http://www.lib.sfu.ca/help/research-assistance/subject/gsws/factsdata) | ||
|
||
#### open data sources | ||
|
||
[United Nations Population Prospects](https://esa.un.org/unpd/wpp/) - detailed country population data | ||
|
||
* [populationpyramid.net](https://www.populationpyramid.net/) uses this data | ||
|
||
[OECD world data, by country](https://data.oecd.org/) | ||
|
||
[Gapminder](https://www.gapminder.org/data/) - all indicators displayed in Gapminder World | ||
|
||
--- | ||
|
||
### R packages | ||
|
||
|
||
##### `cancensus` | ||
|
||
[Census of Canada (including the National Household Survey)](https://github.com/mountainMath/cancensus) | ||
|
||
|
||
|
||
#### `cansim` | ||
|
||
**package** | ||
|
||
[github](https://github.com/mountainMath/cansim) | ||
|
||
**articles** | ||
|
||
Dmitry Shkolnik (2018-08-01) [The CANSIM package, Canadian tourism, and slopegraphs](https://www.dshkol.com/2018/cansim-package-tourism-slopegraphs/) | ||
|
||
|
||
#### `CANSIM2R` | ||
|
||
[CANSIM2R: Directly Extracts Complete CANSIM Data Tables](https://cran.r-project.org/web/packages/CANSIM2R/index.html) | ||
|
||
github: [CANSIM2R](https://github.com/MarcoLugo/CANSIM2R) | ||
|
||
* Andrew Clarke (2017-08-09) [StatCan API's Discovered](https://www.mytinyshinys.com/2017/08/09/statcanapi/) | ||
|
||
|
||
##### `gapminder` | ||
|
||
[gapminder: Data from Gapminder](https://cran.r-project.org/web/packages/gapminder/index.html) An excerpt of the data available at [Gapminder.org]. For each of 142 countries, the package provides values for life expectancy, GDP per capita, and population, every five years, from 1952 to 2007. | ||
|
||
|
||
##### `Lahman` | ||
|
||
[Lahman: Sean 'Lahman' Baseball Database](https://cran.r-project.org/web/packages/Lahman/) Provides the tables from the 'Sean Lahman Baseball Database' as a set of R data.frames. It uses the data on pitching, hitting and fielding performance and other tables from 1871 through 2015, as recorded in the 2016 version of the database. | ||
|
||
|
||
--- | ||
### R readers | ||
|
||
**articles** | ||
|
||
[R database interfaces](http://www.burns-stat.com/r-database-interfaces/) | ||
|
||
|
||
#### `rio` | ||
|
||
**package** | ||
|
||
CRAN page: _currently only development version_, see tidyverse link below | ||
|
||
vignette: [Import, Export, and Convert Data Files](https://cran.r-project.org/web/packages/rio/vignettes/rio.html) | ||
|
||
|
||
|
||
#### `googledrive` | ||
|
||
**package** | ||
|
||
CRAN page: _currently only development version_, see tidyverse link below | ||
|
||
tidyverse page: [`googledrive`](https://tidyverse.github.io/googledrive/) | ||
|
||
|
||
|
||
#### `foreign` | ||
|
||
**package** | ||
|
||
CRAN page: [foreign: Read Data Stored by Minitab, S, SAS, SPSS, Stata, Systat, Weka, dBase, ...]( https://CRAN.R-project.org/package=foreign) | ||
|
||
**articles** | ||
|
||
* [How to open an SPSS file into R](http://www.milanor.net/blog/how-to-open-an-spss-file-into-r/), by Davide Massidda (2014-03-26) | ||
|
||
|
||
|
||
#### Stata files | ||
|
||
**package `read.dta`** | ||
|
||
Reads a file in Stata version 5–12 binary format into a data frame. | ||
|
||
CRAN page: [`read.dta`: Read Stata Binary Files](http://stat.ethz.ch/R-manual/R-devel/library/foreign/html/read.dta.html) | ||
|
||
|
||
**package readstata13** | ||
|
||
Function to read and write the 'Stata' file format. | ||
|
||
CRAN Page: [readstata13: Import 'Stata' Data Files](readstata13: Import 'Stata' Data Files) | ||
|
||
|
||
|
||
|
||
|
||
#### `TSdbi` and related packages | ||
|
||
**package** | ||
|
||
CRAN page: [TSdbi: Time Series Database Interface]( https://CRAN.R-project.org/package=TSdbi) | ||
|
||
Note: `TSdbi` has some related extension packages: | ||
|
||
* CRAN page: [TSdata: 'TSdbi' Illustration](https://cran.r-project.org/web/packages/TSdata/index.html) | ||
* This package gives an overview and usage examples for all the `TSdbi` family of packages | ||
|
||
* CRAN page: [TSPostgreSQL: 'TSdbi' Extensions for 'PostgreSQL'](https://cran.r-project.org/web/packages/TSPostgreSQL/index.html) | ||
|
||
* CRAN page: [TSsdmx: 'TSdbi' Extension to Connect with 'SDMX'](https://cran.r-project.org/web/packages/TSsdmx/index.html) | ||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
File renamed without changes.