This is the legendary Titanic ML competition. The competition is simple: use machine learning to create a model that predicts which passengers survived the Titanic shipwreck.
I adopted Exploratory Data Analysis for data sets analyzation to sum up the main characteristics by using Visualization method. Further, I leveraged Feature engineering to optimize various features in the data set.
The data consists of two groups: a training set (train.csv) and test set (test.csv)
Among male passengers, 80% died (20% survived); among female passengers, only 30% died (70% survived). Among all survivors, 70% were female and 30% were male. Perhaps the reason being female and children were rescued first.