๐ŸŽฌ
Data Science - Wintersemester 24/25
  • Welcome
  • Whatโ€™s Data Science and How Do I Do It?
    • ๐Ÿ“†Timeline
    • ๐Ÿดโ€โ˜ ๏ธR Overview
      • ๐Ÿ“ฉInstallation
      • ๐Ÿˆโ€โฌ›GitHub Setup
      • ๐Ÿฅ—DataCamp Courses
    • ๐ŸPython Overview
      • ๐Ÿ“ฉInstallation
      • ๐Ÿˆโ€โฌ›GitHub Setup
      • ๐Ÿ“ฆVirtual Environment Setup
      • ๐Ÿฅ—DataCamp Courses
  • Introduction to Your Project
    • About the Project Guide
    • What is this Project About?
  • Exploratory Data Analysis (EDA)
    • Getting started
    • Discovering the Data ๐Ÿ”Ž
      • Initial Exploration Tasks
      • Initial Data Visualization
    • Data Cleaning and Transformation
      • Cleaning the Crime Dataset๐Ÿ‘ฎ๐Ÿผ
      • Cleaning the Weather Dataset๐ŸŒฆ๏ธ
    • Data Visualization
      • Crime Rate Over Time
      • Crime Types
    • Grouping and Merging Data
    • Linear Regression
    • Impress us!
    • Internship Complete!
  • Advanced
    • Introduction
    • K-Means Clustering
      • The Clustering Model
      • Visualize the clusters
    • Impress us!
  • โœ…Exercise Checklist
  • Legal Disclaimer
Powered by GitBook
On this page
  1. Advanced

Impress us!

Optional

PreviousVisualize the clustersNextExercise Checklist

Last updated 6 months ago

We are really proud of you, you reached the end!

"End? No, the journey doesn't end here. XGBoost is just another path... One that we all must take."

~ Gandalf, probably... if he knew data science

So in that sense, we encourage you to go on even further.

There are still numerous algorithms we have not covered so far.

If you do not know how to find one, it always helps to look at the , if you are stuck on some problem concerning machine learning. may be helpful to find other methods for classification.

If you are using R do not worry, most algorithms you found on the sci-kit website are also available in R. This page is only for getting inspiration as it is the best overview of existing algorithms, but you do not have to do any programming in Python.

The possibilities are overwhelming, so if you cannot decide on a specific algorithm, you can use XGBoost - short for extreme gradient boosting - a powerful boosting method that is regularly used by winners in competitions on Kaggle. To learn more about it, visit the .

You can learn more about the implementation in R on this .

sci-kit learn online documentation
This page
following page
page