Ha Hoang Hao

Data & Research Analyst · Python Developer

I analyze complex data to surface insights that drive decisions — and build tools that turn analysis into repeatable workflows.

About me

I'm a data professional based in Ho Chi Minh City, Vietnam, with 4 years of experience turning messy, complex datasets into clean insights and automated workflows.

My work sits at the intersection of statistical analysis and data engineering — I don't just analyze data, I build the tools and pipelines to process it at scale. That mindset led me to create survy, an open source Python library for automated data process-analysis with AI-powered workflows, and surveydb — a SQL storage engine that turns isolated survey files into a unified, queryable database.

I'm drawn to problems where technical depth and research thinking both matter — where the answer isn't just in the data, but in how you build the system to get there.

My Projects

survy
Open source Python library for automated survey data processing, transformation and analysis with a clean, scriptable API. Shipping with AI integration extension - enable LLM-powered data analyzing workflows using the agent skills pattern.
PythonPolarsAISPSSPyPIGithub Actions
surveydb
A standalone Python + SQL storage engine for survey data. Ingests any project as standardized CSV files and normalizes it into a unified, queryable database — enabling cross-project analysis that was previously impossible without manual work.
PythonPolarsSQLPowerBIDocker
hhhao.dev
The site you're viewing here.
Next.jsShadcn

My Story

Insight Asia - Quantitative Research Executive

2024 — 2026

This is where I learned what survey data really looks like at scale — and how painful it is to process without the right tools. I owned the data side of the pipeline: from questionnaire programming through to cleaning, transforming, analysis, and dashboards. Every project had its own schema, its own quirks, its own manual steps that nobody had ever bothered to fix. That friction is what eventually became survy — a library I built to automate the parts of the workflow I found myself repeating across every single project.

Deli - Data & Research Analyst

2022 — 2024

My first job out of university. I spent two years close to the business — tracking sales performance, mapping trends across regions and product lines, and gathering ground-level feedback from sales reps, distributors, and end users to inform product decisions. It taught me how real data looks: messy, inconsistent, and nothing like textbook examples. That's also where I learned that the most valuable thing you can do with data isn't the analysis — it's removing the manual work standing between raw data and insight.