Let’s scrape Reddit data in R!

4 min readMay 25, 2021

Welcome back! R is an awesome programming language for data science, so let’s do some data processing with this language! In this specific project we’ll be scraping some data from Reddit and essentially formatting it, it’s a pretty basic project but definitely a great project nonetheless. Funny fact, i’ve created a tool for myself (definitely a bit more intricate than this) that I use on an everyday basis, so some components of this project is definitely reproducible in other projects. With that out of the way let’s get to coding!

Requirements

Before going further make sure to have RStudio installed on your machine, check out this article I made the other month on how to install this:

How To Install R & RStudio On MacOS / Windows / Linux

R is a very powerful tool for a ton of different data operations, but you already know that, so you don’t need a 3…

preettheman.medium.com

I’m hoping you also have some experience with R, with that out of the way let’s get started!

Building The Project

Let’s start off by installing the following packages (if needed):

install.packages("jsonlite")
install.packages("tidyverse")
install.packages("dplyr")
install.packages("http")

Let’s scrape Reddit data in R!

Requirements

How To Install R & RStudio On MacOS / Windows / Linux

R is a very powerful tool for a ton of different data operations, but you already know that, so you don’t need a 3…

Building The Project

Written by Manpreet Singh