Data Science Portfolio
Portfolio of my data science projects for course assignment and personal purposes.
Attitudes Toward Wife Beating
logistic regression
statistical test
EDA
Melihat bagaimana hubungan tingkat pendidikan, tingkat ekonomi, daerah tinggal, serta pernikahan di bawah umur perempuan mempengaruhi sikap terhadap pemukulan istri dan apakah hubungan tersebut dapat digambarkan oleh sebuah model statistik. Dibuat sebagai final project salah satu course Pacmann Analytics & Data Science.
Health Insurance Risk Analysis
probability
EDA
Analisis variabel yang mempengaruhi tagihan asuransi kesehatan. Dibuat sebagai final project salah satu course Pacmann Analytics & Data Science.
NY Citi Bike Performance
data viz
data cleaning
EDA
Analisis kinerja program NY Citi Bike di tahun 2014-2016 serta eksplorasi karakteristik pengguna. Visualisasi ditampilkan dalam bentuk dashboard Tableau. Dibuat sebagai final project salah satu course Pacmann Analytics & Data Science.
NYC311 Housing and Building Complaint Analysis
data viz
data cleaning
scikit-learn
decision tree
flask
plotly
An analysis of complaints received by NYC Department of Housing Preservation and Development (HPD) to help manage the high-volume of complaints by suggesting the most pressing complaint as well as the area the HPD should focus on. A final project for Purwadhika Data Science and Machine Learning.
Links
Campus Recruitment
EDA
data viz
Analysis of academic and employability factors influencing candidate’s job placement. A Project for IBM Exploratory Data Analysis for Machine Learning course.
House Price Prediction
regression
property
Predict house price based on physical attributes related to property sales with linear regression. Different methods of regularization such as Ridge, Lasso, and Elastic Net were evaluated. This is a project for IBM Supervised Learning: Regression course.
Browsing or Purchasing?
clasification
decision tree
retail
Predict whether a customer browsing session will end with goods purchase or not based online shop customer’s browsing behavior in one year period. A Project for IBM Supervised Learning: Classification course.
Online Retail Customer Segmentation
clustering
kmeans
agglomerative clustering
retail
How should a retailer make an offering to their customer? What does the transaction data tell about the customers’ purchasing behavior? We will group online retailer customers using clustering techniques, with the help of Recency, Frequency, and Monetary (RFM) analysis. A Project for IBM Unsupervised Learning course.
The Most Impactful Attack On Titan Episodes in Each Season
EDA
google trends
pytrends
fandom stuffs
Which of the Attack On Titan season has the most impact? Which episodes garnered the most attention in each season? We’ll answer both questions using the data from Google Trends.
Others
Portfolio of my other projects in Python.
Twitter Autobase Bot
tweepy
API
fandom stuffs
Simple twitter autobase bot that posts incoming Direct Messages.
Features
- Posts incoming DM that contains the trigger word.
- Posts incoming DM with media.
- Cuts post longer than 280 characters and posts them as a thread.
- Sends a DM back for each incoming DM.
- Deletes the incoming DMs once they are posted.
Links
- Source code on my github
- Tutorial on my Dev page
Twitter RT Bot
tweepy
API
sqlite
fandom stuffs
This is a simple twitter bot that collects tweets containing a certain keyword, stores them in a database file, and retweets them at a certain time window, built in Python using Tweepy.
Links
- Source code on my github