Data analysis dates back to early civilizations, where records of trade and taxes were manually kept. In the 18th and 19th centuries, statistics emerged, with pioneers like Gauss and Nightingale shaping modern analysis. The 20th century saw breakthroughs with relational databases and SQL, revolutionizing data management. By the 1990s, business intelligence and data warehouses enabled deeper insights. Today, data analysis is a critical tool in finance, healthcare, and beyond, shaping the future of technology and business strategies. Thus, general workforce should take actions to upskill in order be a part of this unprecedented era.
This project demonstrates core analytical skills using real tools on simulated business cases. Practice makes better skillsets ā thanks for checking them out!
I enjoy working on Skill Share projectsāthey offer a great way to express ideas, share knowledge, and build a personal brand while connecting with others in the community. That said, itās easy to feel overwhelmed by the sheer volume of ideas available.
At American Family Insurance, I gained practical experience working with IBM mainframe databases, Apex CRM, Guidewire, and Salesforce. That exposure helped me understand how technology can act as a powerful assistant when solving real business problems.
My academic background trained me in analytical thinking and structured decision-making. Itās not just about collecting knowledgeāit's about applying technology thoughtfully and effectively to deliver impact.
Today, imagine working at DoorDash on a promotional campaign. The data engineers have already prepared the datasetsānow itās your turn to uncover insights that can drive business decisions.
Pivot Tables help you explore patterns in your data. They're especially useful for:
For example, I discovered that older adults tend to order more frequently onlineāa pattern that aligned with my own observations. Tools like Pivot Tables allowed me to validate this trend using real data.
If you're interested, I also recorded a short walkthrough of this project where I discuss the findings and share ideas that might inspire your next campaign.
Being part of the Data Career Jumpstart community has been a great learning experience. Itās inspiring to see how differently people approach the same dataset. With Exploratory Data Analysis (EDA), even one dataset can tell many stories depending on who's analyzing it.
As for meāIām not a visual designer, but tools like Tableau and Power BI help bridge that gap. They allow me to create dashboards that communicate clearly, even if design isnāt my strongest suit.
This Tableau project explores how early academic performanceāmeasured by 4th grade MCAS math scoresārelates to long-term outcomes such as college enrollment across Massachusetts public and charter schools.
š Project Title:
Longitudinal Analysis of MCAS 4th Grade Math Performance and College Enrollment Across Massachusetts School Districts
šÆ Objective:
Examine whether 4th grade MCAS math scores correlate with college enrollment outcomes across public and charter schools.
š Data Overview:
š Key Findings:
š Next Steps:
šļø Potential Impact:
This project underscores how Tableau can bring life to raw academic data, helping stakeholders uncover meaningful patterns in education policy.
How much have countries borrowed from the World Bank? Since childhood, I often heard about this organization and its efforts to support global prosperity by providing loans to poorer nations. Today, by exploring a dataset from their website, we can uncover some interesting insights about global debt, repayment, and financial obligations worldwide.
SQL Query:
There were 31 countries which paid more than 50 % of their debts to WorldBank, which was easily seen by scrolling down the result. For practicing purposes, I created another sub-query and result shows the same 31.
We can see that China repaid a huge amount out of their loans, and Macedonia, paid 95%, which meant they are incredibly reliable. Also the one who pays a smallest portion was Korea, at 0%. I wonder if there is any discrepancies in this dataset, but it was downloaded from WorldBank so, maybe in further studies we can dig deeper later.
Query Result:
SQL Query:
Overall, Vietnam repaid to IDA $149 billion, which was 15% of the total owed $1,601 billion. We heard here and there stories like some countries loaned us, e.g., Japan $6 billion to create a highway, but the salary paid to the Japanese engineers were extremely high. So with these numbers, I would further explore data integrity from both sides of aid sources.
Query Result:
SQL Query:
From our query result, we can look at the case of Afghanistan, there are 5004 transactions fully disbursed and around 35% of it, 1854 transactions fully repaid. Up until the published date of this dataset, there were another 3195 transactions in progress of being disbursed. There are 667 repaying transactions. On the other hand, Africa, has around 121 repaid transactions
Query Result:
India has received the highest loan amount. No wonder, over the past 20 years, Indian talent has been widely recognized in computer science, mathematics, and engineering. Some cities have become unexpectedly wealthier due to increased investment in infrastructure and the growth of capitalism. However, the issue with foreign aid to Asian countries is that, despite the large sums provided, regional economic disparities have not significantly improved. In fact, in some cases, they have worsened.
Query Result:
SQL Query:
We can see that Afganistan paid off around 33% of their transactions. For further analysis, we can have some stacked column charts to compare the percentage of repaid versus total number of transactions for all countries.
Query Result:
SQL Query:
Ever curious about how these project used their money, and how did it impacted their community? We need to know the projects' names and hopefully we can retrieve some insights from their local newspapers or reports about them. We can take a look at Ho Chi Minh city, where I grew up from to see if I can spot any familiar names of the transactions, in later queries.
Query Result:
SQL Query:
I recognize our numbers was hard to read from afar so I revise the currency format. I also have some stem plot using SQL, weirdo right but funny.
Query Result:
š Insight:
India received the highest number of loans globally, while my home country, Vietnam, ranked 5th.
Growing up in the 1980sā1990s, my hometown had no skyscrapers.Today, it's rising as the new dragon of Asia. Tech giants are taking notice: Google may soon open a headquarters in Vietnam. NVIDIA acquired the AI division of Vingroup. Qualcomm bought a division of VinAI.
šļø Historical Context
My father lived through two wars and remembers how hard it was for Vietnam to join the WTO in the early 2000s. Our government had to prove our economy was an open economy, where capital belonged to the people.
And Vietnam was approved to join.Once known for corruption, Vietnam has made major progress in the past 5 years. The governmentās top political priority today:
š« Eradicating corruption
The dataset had formatting issues due to CSV export:
WHERE
, ORDER BY
, and LIMIT
GROUP BY
aggregationsOver the past 40 years, Vietnam has gone from the ground upā from a war-torn nation to an emerging tech economy. This case study sharpened my early SQL skills and reminded me how far weāve come.
Imagine I am the CFO of Mars Health, analyzing patient data from 1998ā2008. These patients had diabetes and were admitted to the hospital. We want to understand how many were readmitted within 30 days, which increases financial strain and worsens outcomes due to comorbidities like cardiovascular or renal disease.
SQL Query:
In the proctology department, which specializes in colorectal surgeries, patients often require 4 procedures per stay. This aligns with my experience interpreting for patientsācolonoscopy alone may involve vitals, imaging, anesthesia, and more.
SQL Result:
SQL Query:
Despite having fewer procedures per stay, the cardiology department served 5,352 patients. I still need to confirm whether this figure represents one week or one month of data.
SQL Result:
SQL Query:
This query identifies departments with more than 50 admissions. There appear to be about 15 such departments.
SQL Result:
SQL Query:
SQL Result:
The analysis shows no major racial disparity in the number of procedures per patient.
SQL Query:
SQL Result:
The query in problem statement #5 is an example of a subquery. Let's now explore how we can revise the query as shown below.
In this query, we join the two tablesā`demographics` and `health`āon the common column `patient_nbr`.
We created a new table called `inner_table`, which doesn't automatically refresh when the data in the original tables is updated.
Next, we examine CTE (Common Table Expressions) by rewriting the materialized view query as a CTE.
Now, let's use a Window Function in the query to observe how long it takes for MySQL to process the request.
Feature | Window Functions | Common Table Expressions (CTEs) | Materialized Views |
---|---|---|---|
Purpose | Calculate running totals, ranks, etc., over a set of rows | Simplify complex queries by breaking them into readable blocks | Store results of a query physically for faster access |
Volatile/Static | Dynamicārecomputed with each query | Dynamicāevaluated at runtime | Staticāmust be refreshed manually or periodically |
Storage | No extra storage | No extra storage | Occupies disk space |
Performance | Efficient for row-by-row calculations | Good for logical clarity; neutral on performance | Can greatly improve performance for complex joins |
Use Case | Ranking, moving averages, percentiles | Recursive queries, simplification | Dashboards, heavy aggregations reused often |
Based on our analysis, it is evident that cardiology and nephrology departments have the highest patient volumes and the most procedures performed. These departments represent critical areas of focus for operational improvements.
These insights are crucial for directing future strategies to improve departmental efficiency and patient care.
These insights are not just theoretical but directly applicable to the project and could significantly contribute to improving business processes and enhancing overall performance.