R: Table() Function in R

The table function in R is a great way to aggregate data for better evaluation and visualization. It is basically a pivot table like function for R. For those not familiar with pivot tables, I do have quick intro here: Excel: Intro to Pivot Tables

Building a table in R is easy. The syntax is simply table() with the name of variable you wish to aggregate inside the parenthesis.

In this example, I will be working with the imbalanced data set. You can download it here: imbalanced

Note: I’m importing data from an excel file in this example. You will need the readxl library installed to follow along. For instructions on installing libraries: R: Installing Packages with Dependencies

Let’s load the data and take a look at it:

tab1

This data set has 1000 records, with 3 columns (or variables). The data represents a listing of possible fraudulent credit card transactions.

So what if you just want to know how many records are Fraud or Not-Fraud

Simply use the table() function with the variable df$Fraud

tab2

Next we can try it with location

tab3

You can even use more than one variable with the function:

tab4

 

Advertisements