Let’s scrape Reddit data in R!

Manpreet Singh
4 min readMay 25, 2021

Welcome back! R is an awesome programming language for data science, so let’s do some data processing with this language! In this specific project we’ll be scraping some data from Reddit and essentially formatting it, it’s a pretty basic project but definitely a great project nonetheless. Funny fact, i’ve created a tool for myself (definitely a bit more intricate than this) that I use on an everyday basis, so some components of this project is definitely reproducible in other projects. With that out of the way let’s get to coding!

Requirements

Before going further make sure to have RStudio installed on your machine, check out this article I made the other month on how to install this:

I’m hoping you also have some experience with R, with that out of the way let’s get started!

Building The Project

Let’s start off by installing the following packages (if needed):

install.packages("jsonlite")
install.packages("tidyverse")
install.packages("dplyr")
install.packages("http")

--

--