This is a work-in-progress page where I collect some of my projects to show my skills with statistical analysis, visualizations, and coding. I use R, several tidyverse packages, R Markdown, rtweet, httr and jsonlite to work with APIs, and dashboard tools such as Tableau and Google Data Studio. I also created a GitHub repository containing the SQL queries necessary to complete a popular tutorial.
When politicians tweet exclamation points
Was it me, or did Trump tweet a lot of exclamation points? In this project, I try to answer this question. I used rtweet to scrape a few thousand tweets from the Twitter handle of 10 politicians, then I used ggplot to visualize the result. Click here to see the project.
White suicide in the United States
In this project, I revisit my research on the link between suicide and employment among white residents of the United States in 2000-2016. I used Tableau to show interactively the trend in suicide and the relationship between suicide and employment in each state. The Tableau dashboard is available at this link.
Does the popularity of a name predict its age?
This is a fun project I did to practice using APIs with R. I collected data on the most popular names in the United States in the last 100 years (source: the Social Security Administration) and used Agify’s API to find the predicted age of a person having those names. I plot the data with ggplot2. So, does the popularity of a name predict its age? Well, if we look at the most popular names in the last century, not so much. See the project at this link.
Suicide trends in the United States by race
In this project, I use another dashboard tool (Google Data Studio) to show the racial trends in suicide in the United States. Overall, suicide increased for almost two decades before declining in 2018. However, when you take race into account, you realize that this decline did not occur among all groups. I think these recent trends will generate some very interesting research in the years ahead. See the project at this link.
Most academics do not work with databases, so they do not use SQL for their research. SQL is a key tool to learn if you want to explore a career outside of academia. There are some good news: SQL is quite straightforward, and there are many free resources to learn it. One of the best is the tutorial on the data platform Mode. I created a GitHub repository to collect all the SQL queries from the Mode tutorial. It's a work in progress, but eventually this will become a great one-page resource for everyone who want to learn, refresh, or quickly find a solution to anything SQL-related.