Skip to content

Commit 6c9d35f

Browse files
committed
fixes boston talk slides
1 parent 0ccc4c6 commit 6c9d35f

File tree

2 files changed

+63
-28
lines changed

2 files changed

+63
-28
lines changed

odsc-boston/odsc-open-gov-beer.Rmd

Lines changed: 26 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
22
title: "<p style='color:#ffffff; background-color:#009fe1;'>Open Government Data and Beer Analytics</p> <p>🏛 + 🍺</p> <p>Open Data Science Conference | East</p>"
33
author: "Jasmine Dumas | <a href='https://twitter.com/jasdumas'>@jasdumas</a> | <a href='http://jasdumas.github.io/'>jasdumas.github.io</a>"
4-
date: "Fridad, May 5th, 2017"
4+
date: "Friday, May 5th, 2017"
55
output:
66
slidy_presentation: default
77
beamer_presentation: default
@@ -19,6 +19,7 @@ library(ggplot2)
1919
library(ttbbeer)
2020
library(tidyr)
2121
library(plotly)
22+
library(emo)
2223
```
2324

2425
## Hi Boston!
@@ -56,17 +57,25 @@ library(plotly)
5657
- measure customer preferences and predict seasonality
5758
- gather insights on industry performance
5859

59-
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/4/42/William_Sealy_Gosset.jpg/240px-William_Sealy_Gosset.jpg" style="position: center; right: 0; left: 0; top: 1"></img>
60+
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/4/42/William_Sealy_Gosset.jpg/1200px-William_Sealy_Gosset.jpg" style="position: absolute; right: 0; top: 0; width: 450px"></img>
61+
62+
<img src="https://upload.wikimedia.org/wikipedia/commons/thumb/4/41/Student_t_pdf.svg/300px-Student_t_pdf.svg.png" style="position: center; right: 3"></img>
6063

6164
## Discovering analysis-ready beer datasets can be difficult
6265

63-
- **Datasets**: [Craft Beer Sample Data](http://www.craftbeeranalytics.com/beer-data.html), [SNAP-RateBeer*](https://snap.stanford.edu/data/web-RateBeer.html) | **Accessible to academic researchers only**
64-
- **API's**: [RateBeer](http://www.ratebeer.com/json/ratebeer-api-agreement.asp), [Untapped](https://untappd.com/api/docs), [The Beer Mapping Project](http://beermapping.com/api/), [BreweryDB](http://www.brewerydb.com/apps) | **Restrictions on usage or require a user account**
65-
- **Projects about beer**: [yhat Beer Advocate Recommender](http://blog.yhat.com/posts/recommender-system-in-r.html) and [Crowd-sourced Beer Distribution Map](http://www.seekabrew.com/distro/index.html)
66+
- **Datasets**:
67+
- [Craft Beer Data](https://www.kaggle.com/nickhould/craft-cans) `r emo::ji("free")`
68+
- [SNAP-RateBeer](https://snap.stanford.edu/data/web-RateBeer.html) `r emo::ji("arrow_forward")` **Accessible to academic researchers only**
69+
70+
- **API's**:
71+
- [RateBeer](http://www.ratebeer.com/json/ratebeer-api-agreement.asp)
72+
- [Untapped](https://untappd.com/api/docs) `r emo::ji("arrow_forward")` **Restrictions on usage or require a user account**
73+
- [The Beer Mapping Project](http://beermapping.com/api/)
74+
- [BreweryDB](http://www.brewerydb.com/apps)
6675

67-
<img src="img/fred-beer.gif" style="position: center; right: 0; left: 0; top: 1"></img>
76+
![](img/fred-beer.gif)
6877

69-
## How to search for analysis-ready datasets in general
78+
## How to generally search for analysis-ready datasets
7079

7180
- [Google public data explorer](https://www.google.com/publicdata/directory)
7281
- [Github](https://github.com/jasdumas/awesome-public-datasets)
@@ -93,7 +102,7 @@ The "clearinghouse" for open U.S. government data is located at [data.gov](https
93102

94103
## Examples of datasets that are not analysis-ready
95104

96-
- Datasets that have inconsistent formats, mixed column types, and cryptic data fields make it difficult for analysis
105+
- Datasets that have **inconsistent formats, mixed column types, and cryptic data fields** make it difficult for analysis
97106

98107
<img src="img/TTB-Reports.png" style="position: center; right: 0; left: 0; top: 1"></img>
99108

@@ -111,22 +120,25 @@ The "clearinghouse" for open U.S. government data is located at [data.gov](https
111120

112121
## I developed a R package for beer statistics, called `ttbbeer`
113122

114-
- This R data package provides **one dataset** for materials used at U.S. breweries as listed in the Beer Monthly Statistical Releases, and **eight datasets** for historical tax rates of distilled spirits, wine, beer, champagne, and tobacco from the U.S. Department of the Treasury, Alcohol and Tobacco Tax and Trade Bureau (TTB)
115-
116-
- Available on [CRAN](https://cran.r-project.org/web/packages/ttbbeer/index.html) and [Github](https://github.com/jasdumas/ttbbeer)
123+
- This R data package provides data from the **U.S. Department of the Treasury, Alcohol and Tobacco Tax and Trade Bureau (TTB)**:
124+
- **one dataset** for materials used at U.S. breweries as listed in the Beer Monthly Statistical Releases
125+
- **eight datasets** for historical tax rates of distilled spirits, wine, beer, champagne, and tobacco
126+
- Available on [CRAN](https://cran.r-project.org/web/packages/ttbbeer/index.html) [![](http://cranlogs.r-pkg.org/badges/grand-total/ttbbeer)](http://cran.rstudio.com/web/packages/ttbbeer/index.html) and [Github](https://github.com/jasdumas/ttbbeer)
117127

118128
![](img/gatsby-drink.gif)
119129

120130
## How I developed the `ttbbeer` package
121131

122-
- Hand transcribing from PDF documents to an excel workbook 😭
123-
- Web-scraping using the R package [`rvest`](https://github.com/hadley/rvest) 😀
132+
- Hand transcribing from PDF documents to an excel workbook `r emo::ji("cry")`
133+
- Web-scraping using the R package [`rvest`](https://github.com/hadley/rvest) `r emo::ji("smile")`
134+
- Data cleaning with `stringr`
135+
- Date Fixes `# originally "101/01/1951"`
124136

125137
![](img/jlaw-beer.gif)
126138

127139
## Insights from this data
128140

129-
```{r, echo=FALSE, fig.height=6, fig.width=8, message=FALSE, warning=FALSE}
141+
```{r, echo=FALSE, fig.height=6.5, fig.width=10, message=FALSE, warning=FALSE}
130142
131143
data("beermaterials")
132144

0 commit comments

Comments
 (0)