Blog posts

2021

2019

XGBoost

2 minute read

Published:

From the latest post about Machine Learning, I dealt with boosting and bagging. Here, in this post, I want to introduce XGBoost, one of boosting algorithms.

2018

Bagging, Boosting

2 minute read

Published:

When study Machine Learning, there are some confusing concept which is not easily understand. In my case, I got confused with the terms Bagging and Boosting. So, let me explain what I studied about Bagging, Boosting and related terms.

Kaggle - GA Customer Revenue prediction - EDA

21 minute read

Published:

There is an ongoing competition in Kaggle. I want to share the exploration of the data as learned from other competitor’s kernel. Of course I will not include the detailed feature engineering and modeling here and now.

Kaggle - NYC taxi fare description (1)

6 minute read

Published:

As I study machine learning and data science, I have interest to solving Kaggle problem. My goal with Kaggle is to improve my data science knowledge and skill. Before this, my first object is to be accustomed to Kaggle and this kind of problem solving. So firstly, I will analyze how others win and get better score. NYC taxi fare prediction competition is the first problem from Kaggle that I want to study. The goal of this competition is to predict NYC taxi fare, and here in this post, will share how others solved and what I learned from.

Sampling

less than 1 minute read

Published:

There is a term “Sampling” in statistics. In this page, will see the concept of sampling and how it is adjusted to machine learning.

Entropy

less than 1 minute read

Published:

When study Machine Learning, there are some confusing concept which is not easily understand. Want to share those terms in this blog. 1st thing I want to share is Entropy mainly learned from Bishop and Shannon.