Welcome to the data playbook đ
- This is a space dedicated to micro-blogging about topics that interest me and showcasing my data analysis projects. đ
Welcome to the data playbook đ
Background As a data analyst on critical projects, I optimize PostgreSQL queries to ensure end users receive accurate data because even a single second can make a difference in time-sensitive analyses. In resource-constrained environments, cloud computing costs can also limit access to essential tools, slowing decision-making, reducing team transparency, and even threatening a projectâs success. Thatâs why SQL performance tuning matters. Over the past year, Iâve gained substantial experience in query optimization, yet I was surprised by how scarce practical resources were when I was actively tackling these challenges. Many nuances were obscure, and some even eluded more senior colleagues. ...
Background A small thing Iâve been thinking more about lately: git commit messages. I put together this commit style guide for myself to reduce friction. I often spend a good minute deciding what I actually want to base a commit on, so I asked a teammate how they approach it. What stood out wasnât that thereâs a single âright wayâ. Itâs that even very experienced engineers optimize commits differently depending on scale. ...
Letâs be realâdata visualization can be frustrating, even for power analysts. I often get so buried in the numbers that stepping back to cOmMuNiCaTe insights can feel like a chore. Itâs like making a delicious meal but skipping the plating; no matter how good it tastes, it wonât impress if it looks like mush. But visuals matter. A lot. The good news? You donât have to sacrifice hours to create dashboards that are sharp, clear, and beautiful. Here are five go-to tips to level up your visualizationsâwithout burning out. The tips are tailored to Tableau users but can be repurposed for your preferred data visualization tool. ...
I took Andy Kriebelâs Makeover Monday challenge a step further by creating a quick tutorial on how to present a dashboard. Surprisingly, I couldnât find a comprehensive resource on this topic online, so Iâm hoping other netizens will find this helpful. Presenting a dashboard, especially when itâs compact like a one-page overview, requires a structured approach to convey insights effectively and keep the audience engaged. A typical presentation time for this kind of dashboard is around 15-20 minutes, as this provides enough time to cover the dashboardâs key elements without overwhelming your audience. ...
Why Learn OOP? Lately, Iâve been revisiting some fundamentals to reinforce Pythonâs role as a powerful tool for object-oriented programming (OOP). My mentor encouraged me to use Python like a developerânot just as an analystâand I completely agree. Understanding data structures is essential, but incorporating OOP principles makes you a much stronger problem solver. Surprisingly, many analysts either never learn OOP theory or only encounter it after theyâve already become proficient in Python. ...
Iâll be skipping past the standard SQL techniques, assuming weâre all familiar with the basics because in todayâs data-driven world, mastering advanced SQL isnât just useful for day-to-day analysis, itâs crucial for standing out in live-coding interviews, where youâre expected to navigate and manipulate complex datasets quickly and confidently. 1ïžâŁ GROUP BY vs. DISTINCT vs. Window Functions Feature GROUP BY DISTINCT Window Functions Use Case Aggregates Removes duplicates Ranking, cumulative sums Performance Medium Fast Can be slow with large data Example: Count unique customers per region. ...
As part of an analytics project for a client, I used R and various libraries to import, clean, and analyze datasets related to labor market statistics in British Columbia. This project involves reading data from CSV files, performing data cleansing operations, merging datasets, and generating visualizations such as box plots and a bar chart showing the monthly change in employment across Canadian provinces. The analysis focuses on specific sectors, including accommodation and food services, wholesale and retail trade, and other services, providing valuable insights into the labor market dynamics. In this post, weâll walk through the steps to analyze employment data across various sectors and provinces in Canada using R. ...
In this project, I generated insights specifically tailored for a web team within a prominent political advocacy organization. The queries cover various aspects such as user engagement, revenue analysis, and page views over time. Additionally, I designed visualizations in Tableau and provided recommendations based on my findings. Exploratory Data Analysis (EDA) Summary This section outlines the steps taken to explore and analyze web analytics data using SQL queries. The analysis provides insights into user engagement, popular pages, device segmentation, and referral sources. ...
Context From 2020 to 2021, I volunteered at the COVID Tracking Project, where our efforts significantly enhanced the accuracy of data collection in the USA. By auditing, compiling, and analyzing sub-national, daily data sources, we delivered accurate, real-time insights to government officials. Our team built over 50 collaborative datasets and corrected inconsistencies in more than 40,000 data points, informing life-saving national testing and immunization strategies to combat COVID-19. Dashboard This dashboard, inspired by Alex the Analyst, exemplifies the impact of our work in providing reliable data that informed public health decisions worldwide, playing a crucial role in creating transparent and globally accessible COVID-19 statistics. ...
Introduction As a Python developer, managing dependencies and ensuring a clean development environment has always been a top priority for me. Virtual environments have become my go-to tool for handling project-specific dependencies, avoiding conflicts, and maintaining a tidy global Python environment. In this blog post, Iâll share the benefits Iâve experienced using virtual environments and walk you through a quick tutorial on using virtualenvwrapper, an extension that has significantly enhanced my workflow. ...