Getting started
Last updated
Last updated
For this project, you will work with two crime datasets from Los Angeles covering the years 2020 to 2023. You will need to choose one of the following crime datasets to work with:
The original crime dataset, which includes all reported crimes.
The cleaned crime dataset, where crimes involving sensitive topics (such as sexual abuse or other potentially triggering content) have been removed.
Data source & variable's explanations can be found .
You only need to download and work with one of these datasets based on your preference. However, please note that the cleaned dataset is not fully representative of the actual crime data, as it excludes certain categories of crimes. Any conclusions drawn from this version should be interpreted with caution, as key data has been omitted. You cannot infer strong causal relationships based on this dataset due to the missing information.
In addition to your selected crime dataset, you will also need to download the weather dataset, which includes weather data for the same time period. This will be useful for analyzing how weather conditions may have influenced crime patterns.
Once you've downloaded the weather dataset and your selected crime dataset, the next step is to load them into your working directory and begin the Exploratory Data Analysis (EDA). This process will help you understand the structure and contents of the data, laying the foundation for more detailed analysis later on.