Data Platform Engineer

Yuhao Dai

Building scalable data infrastructure and AI systems. Based in Tokyo, Japan.

About

Data Platform Engineer with 7+ years building cloud-native, scalable data infrastructure. I've owned platform-wide architecture at organizations including Novo Nordisk and Coca-Cola Bottlers Japan, specializing in lakehouse modernization, CI/CD pipelines, and data governance.

Currently exploring the intersection of LLM engineering and enterprise data platforms—building agentic workflows that automate day-to-day operations.

English TOEFL 115
Japanese JLPT N1
Chinese Native

Experience

2025 — Now

Associate Director, Technical Consulting

Anaqua K.K.

Managing 11 technical consultants. Building GenAI-powered tools for enterprise data migration. Implementing agentic workflows to improve processes.

2023 — 2024

Senior Data Engineer — Platform Owner

Novo Nordisk Pharma Ltd.

Owned Common Data Platform for Region Japan. Led 5-member platform team. Rebuilt normalization and mart layers. Implemented Dagster, dbt, Snowflake stack.

2022 — 2023

Senior Data Engineer & Assistant Manager

Coca-Cola Bottlers Japan Inc.

Led team of 9. Enabled PB-scale data lakehouse. Automated pipelines on Azure Synapse with PySpark/Databricks. Platform transition from SAP.

2021 — 2022

Data Scientist & Infrastructure Engineer

bitFlyer K.K.

Solo DW migration to Snowflake. Established Data Analysis office. ML modeling for user clustering and AML anomaly detection.

Contract Projects

2025 — 2026

BI Platform Migration

Global Luxury Retail

Leading full BI platform migration from PowerBI to Looker for luxury brand analytics. Designed LookML data models aligned with HQ standards, migrated 50+ dashboards, and built DevOps automation reducing deployment time by 70%.

Looker LookML BigQuery dbt Monte Carlo
2025

Enterprise RAG System

AI Startup

Built RAG chatbot systems processing 10K+ documents using LangChain. Developed semantic chunking tools for unstructured Excel data. Created internal LLM tooling including Slack bot, knowledge base UI, and automated deploy pipelines.

Python LangChain FastAPI Azure Terraform
2024 — 2025

Voice AI Platform

AI Startup

Architected voice AI platform handling 1000+ daily calls. Built Terraform IaC across 5 Azure projects. Designed phone AI agent prompts and voice synthesis backend. Led engineering team while establishing CI/CD with GitHub Actions.

Python TypeScript FastAPI Next.js Azure Terraform
2024

Data Vault Platform

Retail Tech

Implemented Data Vault 2.0 architecture for retail analytics platform. Designed and maintained scalable data models serving 100+ business users. Optimized query performance reducing report generation time by 60%.

dbt Snowflake Data Vault 2.0 Python

Skills

Data Platform

  • Snowflake
  • dbt
  • Dagster
  • Airflow
  • Azure Data Factory

Cloud

  • Azure
  • GCP
  • AWS
  • Docker
  • Kubernetes

Languages

  • Python
  • SQL
  • Scala
  • Java
  • Spark

AI / ML

  • LLM Agents
  • RAG
  • PyTorch
  • TensorFlow
  • scikit-learn

Certifications

AWS Solutions Architect Associate
GCP Professional Cloud Architect
GCP Professional Data Engineer
GCP Professional ML Engineer
Azure AI Engineer Associate

Education

BA, English Literature

Doshisha University, Kyoto

2017 — 2021