Data and Analytics Engineer specializing in ELT pipeline design, dimensional data modeling, and modern data stack tooling. Built GA4 and AppsFlyer data infrastructure for products with 10M+ users and $5M+ monthly revenue. M.S. in Analytics, Northeastern University. Trilingual in English, Chinese, and Japanese (JLPT N1).
I'm a Data and Analytics Engineer with deep expertise in ELT pipeline design, dimensional data modeling, and modern data stack tooling (dbt, BigQuery, Airflow). My background spans the full data lifecycle โ from event instrumentation and pipeline ingestion to transformation, modeling, and self-serve dashboards.
At Newga Network, I built and maintained GA4 and AppsFlyer data infrastructure supporting products with 10M+ users and $5M+ monthly revenue. I designed star schema dimensional models for player behavior, UA performance, and monetization โ and established A/B testing frameworks that improved D7 retention by 15%.
Currently completing my M.S. in Analytics at Northeastern University, while running an independent game studio where I own the entire data stack from instrumentation to mart-layer modeling. Open to Data Engineer and Analytics Engineer roles across the US and Japan.
End-to-end ELT pipelines with dbt, Airflow, and BigQuery โ incremental models, snapshots, data quality tests, and full lineage.
Star schema dimensional modeling for player behavior, UA, and monetization. Deep expertise in gaming data architecture.
Built LLM-powered applications using LangChain, GPT-4o, FAISS, and Streamlit. Daily AI tools user (Claude, Cursor).
Trilingual in English, Chinese, and Japanese (JLPT N1). Experience across US, China, and Japan markets.
End-to-end data pipelines, analytics infrastructure, and live products I've built and shipped.
End-to-end ELT pipeline built on real GA4 and AppsFlyer data from live mobile game titles. Covers the full modern data stack from raw event ingestion to business-ready marts.
LLM-powered conversational chatbot for e-commerce product discovery. Built with RAG architecture using FAISS vector store for semantic search over product catalog data.
Designed and maintained the full analytics data infrastructure for mobile games serving 10M+ global users. Owned everything from event instrumentation to executive dashboards.
I build and ship mobile games and AI apps independently โ and own the full data infrastructure for each title, from GA4 instrumentation to dbt-modeled analytics marts.
A consumer social app where pets come alive through AI. LLM + computer vision give each pet a unique personality that responds to photos and chats.
A hybrid casual puzzle fusing poker and mahjong tile-matching. Full product ownership from design through live operations and analytics.
Number-matching puzzle combining blackjack mechanics with satisfying tile-clear gameplay. Owned full product lifecycle from concept through analytics.
Zen tile-matching with island-building progression. Designed engagement loops and retention mechanics, tracked via full analytics pipeline.
Strategic mahjong fused with board game territory mechanics. Android live; iOS under App Store review.
Analyzed box office trends through three business lenses. Produced a film where each character visualizes the same dataset differently.
Time series modeling of AAPL and HON using ARIMA, regression, and moving averages. Compared simulated trading strategies with comprehensive performance analysis.
โ GitHubMonte Carlo simulation and chi-square testing to find optimal betting strategies across best-of-3/5/7 series formats.
โ GitHubEDA, ANOVA, and Lasso regression on coffee bean prices. Includes STL decomposition and MAPE-evaluated forecasting models.
โ GitHubCompared Logistic Regression, Random Forest, and SVM using K-Fold and Repeated K-Fold to investigate performance stability.
โ GitHubInteractive Looker Studio dashboard analyzing 40 years of UK bicycle accident data to surface policy-relevant insights.
โ GitHubOpen to Data Engineer and Analytics Engineer roles in the US and Japan. Based in San Jose, CA โ open to relocation.
I build reliable data infrastructure that teams can trust. From ELT pipeline design and dimensional modeling to self-serve dashboards โ I own the full data stack end-to-end.
10M+ users in pipeline ยท $5M+ monthly revenue tracked ยท 6 live apps shipped ยท Gaming domain expertise.
๐ San Jose, CA ยท โ๏ธ Open to relocation ยท ๐ EN / ZH / JP (N1)