Initial Data Visualization
Last updated
Last updated
Note: Of the following plots, some were made in R and some in Python. Of course the same plots can be achieved in both languages, but they might look a bit different. Don't worry to much about recreating the exact same plots as we did.
Now that we got a feeling for the data, it’s time to explore and visualize it before diving into cleaning. Working with raw data helps identify potential issues and anomalies that need to be addressed during the cleaning process. Data visualization helps reveal patterns, trends, and insights that may not be immediately obvious from the raw data alone. By creating bar charts and other visual aids, you’ll enhance your understanding of the information and uncover valuable insights that can guide your analysis. Let’s dive in and see what the raw data has to show!
First we want to look at our sample by plotting some demographics (age, gender, descent). Think about your choice of plot type!
This is how a bar chart of the victims gender might look like:
Here is an example for the victim's gender distribution:
Often times we get unexpected results especially when working with data that we didn’t collect ourselves. Causes can be very diverse so that's why it's important to understand the underlying data before making conclusions. Can you explain everything in your plots or do you find any anomalies?
: Matplotlib's pyplot
module offers a variety of functions to create visually appealing charts.
: Quick guide to creating visually pleasing charts in Python.
For more, check and out.