Kaggle Challange

TITANIC- Machine Learnign from disaster

This is the legendary Titanic ML competition. The competition is simple: use machine learning to create a model that predicts which passengers survived the Titanic shipwreck.

I adopted Exploratory Data Analysis for data sets analyzation to sum up the main characteristics by using Visualization method. Further, I leveraged Feature engineering to optimize various features in the data set.

Import the necessary modules

Responsive image

The Data

The data consists of two groups: a training set (train.csv) and test set (test.csv)

Load and process the Training Data

Responsive image

Create the Model and Train

Responsive image

To visualize the correlation with gender, we will execute the following:

Responsive image
Responsive image

Among male passengers, 80% died (20% survived); among female passengers, only 30% died (70% survived). Among all survivors, 70% were female and 30% were male. Perhaps the reason being female and children were rescued first.

Responsive image