# Structure the Data

To remind you of the task at hand and what output we expect:

* [ ] Plot the top 15 significant ports, by using the total amount of vessels handled by the port. The resulting diagram should look something like this:

<figure><img src="https://2669499530-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FnYNN3nXNuXMJpHACcH73%2Fuploads%2FBBdPpxbTU3XwwMWpieef%2Fimage.png?alt=media&#x26;token=56b97ba8-9236-4925-b88b-0989acba800d" alt="" width="563"><figcaption></figcaption></figure>

* [ ] Give us an extract of your table and explain what you observe. Notice that we haven't cleaned the dataset yet, do you notice any issues with the data already?
* [ ] 🤔 Let us know what your thought process is, and extensively comment your code!

> <img src="https://2669499530-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FnYNN3nXNuXMJpHACcH73%2Fuploads%2Ft1yAGmUambZeYVQvPSeu%2Fp.png?alt=media&#x26;token=01872756-9ca8-44f9-9ec1-1ff5f70ce561" alt="" data-size="line">
>
> To sort the rows you need `.sort_values()`, additionally you need to set the `by` and `ascending` parameters to specify the column to sort by and whether to sort in ascending order or not.
>
> To get the top *n* rows you can use `.head(n)`.
