The following are a few projects that I have worked on in my free time. They are in no particular order.

SQL Web Analytics for Political Advocacy

Description: Utilized SQLite to delve deeper into user behavior, device segmentation, and referral sources through extensive exploratory data analysis. Implemented joins, window functions, and comparison operators to extract actionable insights that informed strategic decisions.

Stacks: SQLite, Tableau

Skills: Exploratory data analysis, SQL querying with joins, window functions, and comparison operators

github repository web-analytics-sql miniature from the user k10sj02


SQL Online Commerce Cohort Analysis

Description: Utilized SSMS and Azure Data Studio to uncover insights into customer behavior, including cohort retention analysis and percentage breakdowns over time for online commerce database. Techniques such as CTEs, Temporary Tables, Window Functions, and the PIVOT function are employed to handle data transformation, calculation, and presentation. Created cohort dashboard in Tableau. Ideal for understanding online commerce trends and optimizing business strategies.

Stacks: SQLite, Tableau

Skills: CTEs, Temporary Tables, Window Functions, and the PIVOT function

github repository sql-cohort-analysis miniature from the user k10sj02


Global COVID-19 Trend Analysis

Description: Built an ETL pipeline using MySQL and MySQL Workbench. Utilized SQLAlchemy in Python for data transformation, cleaning, and loading into a MySQL data warehouse. Performed exploratory data analytics on the data warehouse to identify global COVID-19 trends and visualized results in a Tableau dashboard.

Stacks: MySQL, MySQL Workbench, Python (pandas), Tableau

Skills: ETL (Extract, Transform, Load), Data warehousing, Data cleaning and transformation, Exploratory data analysis (EDA), SQL, Data visualization

github repository covid-tracking-project-sql miniature from the user k10sj02


Personal blog

Description: A blog (where you are right now!) where I write about what’s on my mind.

Stacks: HTML/CSS, Hugo, Papermod theme, and Github Actions

github repository thedataplaybook miniature from the user k10sj02


8 Week SQL Challenge: Mastering PostgreSQL with Docker

Project: Completed the 8 Week SQL Challenge (Data with Danny) using PostgreSQL and Docker. Analyzed and cleaned eight diverse datasets, including banking and sales, utilizing various SQL techniques like window functions and Common Table Expressions (CTEs).

Stacks: PostgreSQL, Docker

Skills: Data analysis with SQL, Data cleaning and manipulation, Window functions, Common Table Expressions (CTEs), Containerization

github repository serious-sql miniature from the user k10sj02


Real Estate Analytics: King County Housing Sales Dashboard

Description: Developed an interactive dashboard for analyzing King County housing sales. Includes line chart for trends, histograms for distribution analysis, map for geographical patterns, heatmap for multi-dimensional insights.

Stacks: Tableau

Skills: Data Visualization, Dashboard design, Interactive development, Data analysis


Predicting Gender Self-Perception: An Analysis of American Masculinity

Description: Preprocessed a national US survey dataset on masculinity, focusing on data loading, mapping, cleaning, and exploration. Applied predictive analytics, including regression analysis, random forests, and XGBoost, to predict the gender self-perception of survey respondents. Employed advanced machine learning techniques, such as feature engineering and hyperparameter tuning, to optimize model performance.

Stacks: Python (pandas, matplotlib, scikit-learn, geopandas)

Skills: Data preprocessing, Data cleaning, Exploratory data analysis, Predictive modeling, Machine learning, Feature engineering, Hyperparameter tuning, Geospatial analysis

github repository us_views_of_masculinity miniature from the user k10sj02


British Columbia Labor Market Research

Description: Processed CSV datasets containing Canadian provincial labor market statistics through import, cleaning, and merging. Created visualizations focusing on monthly employment changes across Canadian provinces, highlighting sectors like accommodation, food services, and retail trade.

Stacks: R (tidyverse, dplyr, lubridate)

Skills: Data wrangling, Data cleaning and merging, Data visualization, Economic analysis

github repository BC_Labor_Statistics_Exploration miniature from the user k10sj02