This post focuses on looking at the various variables in the dataset individually in order to get a brighter picture of how these variables could be distributed before combining them with other variables. Also the US dataset was subset-ted to select only observations which had a severity level of 4(implying a significant effect or long delay in traffic). This measure was taken in order to avoid computational difficulty with the Altair package in R.