Pareto Analysis

We saw earlier that the vessel count distributions are highly right-skewed, meaning that there are a few ports that handle significantly more traffic than the rest. This suggests that there is a small portion of all ports that is responsible for the majority of global maritime trade. The Pareto Principle (commonly known as the 80-20 rule) states that roughly 80% of the consequences come from 20% of the causes. Your task is to examine if it applies here.


🔧 Your tasks:

  • Here you need the cutoff index and not the port name, as you want to know how many ports come before this cutoff. For example, if cutoff port has index 10, then 9 ports account for 80% of results.

You can use .sum() for the first task. .sort_values() to sort the dataset, make sure to reset the index (Find out why and how on your own)

You can use .cumsum() for calculating cumulative index.

Last updated