Skip to content

adambushman/data-science-projects

Repository files navigation

Data Science Portfolio Banner

Who am I?

I'm a data enthusiast, finding joy, fulfillment, and personal growth in oscillating between the roles of a data engineer, analyst, scientist, and visualizer. My unique background has enriched my ability to provide value across business functions. I'm honored that you've taken an interest in my work!

Portfolio Purpose

This portfolio showcases my data science projects, highlighting my skills and expertise in extracting insights, solving complex problems, and creating impactful solutions. Through a combination of raw code, rendered notebooks, and documentation, it demonstrates my ability to work across diverse tools and technologies in R, Python, and Julia to ultimately deliver business value.

Projects

Each project in this portfolio leverages a unique modeling approach and showcases my ability to adapt to diverse challenges using R, Python, and Julia. For each, you'll find links to the raw code and rendered notebooks, providing insight into the methods, results, and takeaways.

☯️ Hybrid

Projects spanning multiple languages, modeling approaches, etc., for solving complex business problems

Name Description Language(s) Modeling Approach(es) Links
Delivery Model Standardization for Swire Coca-Cola Identifying an improved segmentation strategy R, Python Kmeans, Hierarchical, Elastic Net, PCA Notebooks, Code
Home Credit Default Risk Estimating likelihood of default R, Python Random Forest, SVM, Elastic Net, PCA Notebooks, Code
NBA Player Position Roles Redefining NBA positions and classifing those for incoming prospects R KMeans, PCA, Nueral Network Notebooks, Code

📈 Regression

Supervised learning projects where the target variable is continuous

Name Description Language(s) Modeling Approach Links
Sacramento Housing Predicting house prices for value R Penalized Regression Notebooks, Code
Food Delivery Times Predicting time from order to delivery for restaurant R Boosted Trees Notebooks, Code
Hotel Rates Predicting average nightly hotel rates R Support Vector Regression Notebooks, Code
Car Prices Predicting car prices R, Python, Julia Linear Regression Code
Laptop Prices Predicting laptop pricing R Decision Tree Code

🎨 Classification

Supervised learning projects where the target variable is categorical

Name Description Language(s) Modeling Approach Links
Palmer Penguins Classifying penguin species R Neural Network Notebooks, Code
Zoo Animal Classes Predicting classes of zoo animals R Decision Tree Code
Employee Attrition Classifying employees to leave and stay at the company R Decision Tree -
Breast Cancer Predicting existance of breast cancer among patients Python Decision Tree -

🌀 Clustering

Unsupervised learning projects for deriving insights from unknown groupings

Name Description Language(s) Modeling Approach Links

🕰️ Time Series

Supervised learning projects where main predictor is time

Name Description Language(s) Modeling Approach Links
Medicare Perscriptions Predicting volumes of medicare perscriptions R ARIMA Notebooks, Code

📕 Natural Language Processing (NLP)

Unsupervised learning projects synthesizing insights from text

Name Description Language(s) Modeling Approach Links

❓ Other Methods

Name Description Language(s) Modeling Approach Links
Alzheimer's Disease Reducing dimensionality of alzheimer's predictors R PCA Notebooks, Code

Common Resources

The following resources were constant companions throughout my data science journey. I owe a thanks to all authors and maintainers.

Stat Quest (YouTube)

{tidymodels} suite

{scikitlearn} suite

Julia Documentation

DuckDB

About

Exploring data science algorithms in R, Python, and Julia

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages