Data analysis dates back to early civilizations, where records of trade and taxes were manually kept. In the 18th and 19th centuries, statistics emerged, with pioneers like Gauss and Nightingale shaping modern analysis. The 20th century saw breakthroughs with relational databases and SQL, revolutionizing data management. By the 1990s, business intelligence and data warehouses enabled deeper insights. Today, data analysis is a critical tool in finance, healthcare, and beyond, shaping the future of technology and business strategies. Thus, general workforce should take actions to upskill in order be a part of this unprecedented era.
This project demonstrates core analytical skills using real tools on simulated business cases. Practice makes better skillsets — thanks for checking them out!
I love the Skill Share projects, though I got lost in the thousands of ideas 😵💫. Skillsharing provides a platform for expressing ideas and spreading knowledge. These projects are critical in building our online personal brands while connecting with the community.
Time at American Family Insurance, I gained hands-on experience with IBM Mainframe databases, Apex CRM, Guidewire, Salesforce, and more. This journey opened my eyes to how technology is our most powerful assistant 🧠💻.
Years in school helped me develop analytical thinking and structured decision-making. It’s not just about knowledge, but understanding how technology works and how we can use it effectively.
Fast forward to today: Imagine you're working at DoorDash to uncover insights for a promo campaign. Data engineers prepare the datasets—now it’s our turn 🎯.
Pivot Tables are essential for:
For example, seniors order online more, which I truly believes as my dad orders from Walmart online all the time. Pivot tables helped me identify campaign trends and family behavior patterns.
🎧 Listen to my project walkthrough to gain more insights that could inform your next campaign. With my previous experiences across industries, I believe some of my insights are worth considered.
Being part of the Data Career Jumpstart community has sharpened my skills. Seeing how others approach data differently constantly inspires me. I keep refining my work, because everyone’s story—and analysis—brings something new.
In Exploratory Data Analysis (EDA), one dataset can tell a hundred stories—each shaped by the perspective of the analyst 🔬📚.My Confession Looking at my basic dashboard, you all can tell that I am not so much into arts but thanks God, we have Tableau/ Power BI to do this hard work for me.
This Tableau project explores how early academic performance—measured by 4th grade MCAS math scores—relates to long-term outcomes such as college enrollment across Massachusetts public and charter schools.
📌 Project Title:
Longitudinal Analysis of MCAS 4th Grade Math Performance and College Enrollment Across Massachusetts School Districts
🎯 Objective:
Examine whether 4th grade MCAS math scores correlate with college enrollment outcomes across public and charter schools.
📊 Data Overview:
🔍 Key Findings:
📈 Next Steps:
🏛️ Potential Impact:
This project underscores how Tableau can bring life to raw academic data, helping stakeholders uncover meaningful patterns in education policy.
How much have countries borrowed from the World Bank? Since childhood, I often heard about this organization and its efforts to support global prosperity by providing loans to poorer nations. Today, by exploring a dataset from their website, we can uncover some interesting insights about global debt, repayment, and financial obligations worldwide.
SQL Query:
There were 31 countries which paid more than 50 % of their debts to WorldBank, which was easily seen by scrolling down the result. For practicing purposes, I created another sub-query and result shows the same 31.
We can see that China repaid a huge amount out of their loans, and Macedonia, paid 95%, which meant they are incredibly reliable. Also the one who pays a smallest portion was Korea, at 0%. I wonder if there is any discrepancies in this dataset, but it was downloaded from WorldBank so, maybe in further studies we can dig deeper later.
Query Result:
SQL Query:
Overall, Vietnam repaid to IDA $149 billion, which was 15% of the total owed $1,601 billion. We heard here and there stories like some countries loaned us, e.g., Japan $6 billion to create a highway, but the salary paid to the Japanese engineers were extremely high. So with these numbers, I would further explore data integrity from both sides of aid sources.
Query Result:
SQL Query:
From our query result, we can look at the case of Afghanistan, there are 5004 transactions fully disbursed and around 35% of it, 1854 transactions fully repaid. Up until the published date of this dataset, there were another 3195 transactions in progress of being disbursed. There are 667 repaying transactions. On the other hand, Africa, has around 121 repaid transactions
Query Result:
India has received the highest loan amount. No wonder, over the past 20 years, Indian talent has been widely recognized in computer science, mathematics, and engineering. Some cities have become unexpectedly wealthier due to increased investment in infrastructure and the growth of capitalism. However, the issue with foreign aid to Asian countries is that, despite the large sums provided, regional economic disparities have not significantly improved. In fact, in some cases, they have worsened.
Query Result:
SQL Query:
We can see that Afganistan paid off around 33% of their transactions. For further analysis, we can have some stacked column charts to compare the percentage of repaid versus total number of transactions for all countries.
Query Result:
SQL Query:
Ever curious about how these project used their money, and how did it impacted their community? We need to know the projects' names and hopefully we can retrieve some insights from their local newspapers or reports about them. We can take a look at Ho Chi Minh city, where I grew up from to see if I can spot any familiar names of the transactions, in later queries.
Query Result:
SQL Query:
I recognize our numbers was hard to read from afar so I revise the currency format. I also have some stem plot using SQL, weirdo right but funny.
Query Result:
📌 Insight:
India received the highest number of loans globally, while my home country, Vietnam, ranked 5th.
Growing up in the 1980s–1990s, my hometown had no skyscrapers.Today, it's rising as the new dragon of Asia. Tech giants are taking notice: Google may soon open a headquarters in Vietnam. NVIDIA acquired the AI division of Vingroup. Qualcomm bought a division of VinAI.
🎖️ Historical Context
My father lived through two wars and remembers how hard it was for Vietnam to join the WTO in the early 2000s. Our government had to prove our economy was an open economy, where capital belonged to the people.
And Vietnam was approved to join.Once known for corruption, Vietnam has made major progress in the past 5 years. The government’s top political priority today:
🚫 Eradicating corruption
The dataset had formatting issues due to CSV export:
WHERE
, ORDER BY
, and LIMIT
GROUP BY
aggregationsOver the past 40 years, Vietnam has gone from the ground up— from a war-torn nation to an emerging tech economy. This case study sharpened my early SQL skills and reminded me how far we’ve come.
Imagine I am the CFO of Mars Health, analyzing patient data from 1998–2008. These patients had diabetes and were admitted to the hospital. We want to understand how many were readmitted within 30 days, which increases financial strain and worsens outcomes due to comorbidities like cardiovascular or renal disease.
SQL Query:
In the proctology department, which specializes in colorectal surgeries, patients often require 4 procedures per stay. This aligns with my experience interpreting for patients—colonoscopy alone may involve vitals, imaging, anesthesia, and more.
SQL Result:
SQL Query:
Despite having fewer procedures per stay, the cardiology department served 5,352 patients. I still need to confirm whether this figure represents one week or one month of data.
SQL Result:
SQL Query:
This query identifies departments with more than 50 admissions. There appear to be about 15 such departments.
SQL Result:
SQL Query:
SQL Result:
The analysis shows no major racial disparity in the number of procedures per patient.
SQL Query:
SQL Result:
The query in problem statement #5 is an example of a subquery. Let's now explore how we can revise the query as shown below.
In this query, we join the two tables—`demographics` and `health`—on the common column `patient_nbr`.
We created a new table called `inner_table`, which doesn't automatically refresh when the data in the original tables is updated.
Next, we examine CTE (Common Table Expressions) by rewriting the materialized view query as a CTE.
Now, let's use a Window Function in the query to observe how long it takes for MySQL to process the request.
Feature | Window Functions | Common Table Expressions (CTEs) | Materialized Views |
---|---|---|---|
Purpose | Calculate running totals, ranks, etc., over a set of rows | Simplify complex queries by breaking them into readable blocks | Store results of a query physically for faster access |
Volatile/Static | Dynamic—recomputed with each query | Dynamic—evaluated at runtime | Static—must be refreshed manually or periodically |
Storage | No extra storage | No extra storage | Occupies disk space |
Performance | Efficient for row-by-row calculations | Good for logical clarity; neutral on performance | Can greatly improve performance for complex joins |
Use Case | Ranking, moving averages, percentiles | Recursive queries, simplification | Dashboards, heavy aggregations reused often |
Based on our analysis, it is evident that cardiology and nephrology departments have the highest patient volumes and the most procedures performed. These departments represent critical areas of focus for operational improvements.
These insights are crucial for directing future strategies to improve departmental efficiency and patient care.
These insights are not just theoretical but directly applicable to the project and could significantly contribute to improving business processes and enhancing overall performance.