Hillary Chang

About

Hi! My name is Hillary Chang. I am a performance engineer at NVIDIA, where I design and execute performance benchmarks for distributed AI workloads on GPU clusters, profile GPU utilization and interconnect latency, and build automation pipelines for large-scale benchmarking and analysis.

Prior to that, I built and operated core cloud infrastructure at Oracle Cloud Infrastructure, developing services and tooling for the Service Gateway, owning on-call across 200+ global regions, and leading deployments for distributed control plane services.

I hold a Master's in Data Science from UC Berkeley with a concentration in AI/ML, and a Bachelor's in Data Science with a Mathematics minor from UC San Diego. I am passionate about the intersection of high-performance computing, machine learning systems, and scalable infrastructure.

Experience

May 2026 — Present

Santa Clara, CA

Performance Engineer, NVIDIA

Design and execute performance benchmarks for distributed AI workloads (LLM training and inference) on multi-node GPU clusters using CUDA, NCCL, and PyTorch.
Build Python and Bash automation pipelines to orchestrate large-scale benchmarking and performance analysis across GPU architectures.
Analyze RDMA/RoCE networking performance to optimize multi-node GPU communication and scaling efficiency.
Perform low-level GPU profiling with Nsight Systems and Nsight Compute to identify bottlenecks in utilization and interconnect latency.

Jul 2025 — May 2026

Santa Clara, CA

Software Engineer, Oracle Cloud Infrastructure

Engineered core infrastructure for Oracle Cloud's Service Gateway, improving scalability and fault tolerance for high-throughput enterprise networking systems.
Owned on-call for 200+ global regions, resolving Sev1/Sev2 production incidents using Grafana dashboards, Lumberjack logs, and internal debugging tools.
Led global deployments and region build workflows, managing rollout, validation, and rollback for control plane services.
Debugged complex distributed system failures including Terraform state drift, API throttling, and network misconfigurations across multi-region environments.

Dec 2024 — Jul 2025

Costa Mesa, CA

Software Engineer Intern, Siemens

Developed 30+ cross-platform CMake modules for STEP Adapter and Parasolid-STEP Translator, enabling CAD file translation across Windows, Linux, and Intel/ARM macOS.
Unblocked Linux and macOS development for the CAD translation team by delivering the first cross-platform build system, replacing a Windows-only proprietary toolchain.
Diagnosed and resolved 10+ platform-specific build failures, including missing Perl-generated headers, macOS compiler incompatibilities, and linker symbol errors, in a legacy C++ codebase with no existing CMake precedent.
Traced and translated 20+ library dependency graphs from proprietary .def files into CMake targets, resolving linker, compiler, and header generation issues across three OS toolchains.

Jun 2023 — Dec 2023

San Diego, CA

Software Engineer Intern, Wind River Systems

Built a code style checking tool enforcing DO-178C aviation safety standards, adopted by 1,000+ developers across engineering teams.
Automated 100+ critical validation tests for VxWorks real-time operating system, reducing manual QA efforts and error rates.
Integrated regex-based code parsing and Doxygen/ApiGen documentation standards to automate code quality validation across the codebase.

Sep 2022 — Sep 2023

San Diego, CA

Data Scientist, WCSNG Lab, UC San Diego

Engineered a real-time data streaming pipeline using AWS Kinesis, improving data throughput by 20% for indoor localization research.
Configured and deployed 10+ Raspberry Pi units for continuous data collection and secure real-time tracking of localization devices.
Defined advanced analytics and ML project objectives in collaboration with faculty, translating sensor data into actionable insights for wireless sensing research.

Jun 2022 — Sep 2022

Santa Clara, CA

Machine Learning Research Assistant, Santa Clara University

Engineered a YOLOv5 deep learning model achieving 90%+ accuracy on image recognition by training on COCO and custom datasets.
Built a web crawler using YouTube API and FFmpeg to download and preprocess video data, enabling large-scale dataset collection and analysis.
Automated extraction and preprocessing of 7,000+ images with data augmentation techniques to improve model robustness and generalization.
Fine-tuned hyperparameters using precision, recall, and F1-score metrics to optimize detection accuracy across diverse datasets.

Aug 2021 — Dec 2021

San Jose, CA

Software Engineer Intern, Intelligent Fiber Optic Systems

Applied Kalman filter signal processing to denoise 30,000+ rows of sensor data from Fiber Bragg Grating (FBG) optical systems.
Proposed and integrated multi-sensor fusion methods to improve accuracy across multiple FBG sensor channels.
Developed MATLAB visualization scripts to analyze sensor data patterns, creating plots and graphs for engineering review.
Presented technical findings and strategic product recommendations to the CEO and senior management, informing future R&D direction.

Projects

Cash Score

Alternative Credit Scoring from Transaction Data

Full-stack ML system that classifies raw bank transactions using NLP, engineers behavioral features, and predicts credit risk. Achieved 98.93% transaction classification accuracy and 0.84 ROC-AUC on delinquency prediction.

PythonXGBoostLightGBMFastTextTF-IDFTransformersPandasScikit-learn

BERT Transfer Learning Analysis

Exposing the Limits of Transfer Learning

Research demonstrating that TF-IDF outperformed fine-tuned BERT by 8% on domain-specific tasks, exposing negative transfer from domain mismatch between Amazon reviews and banking intents.

PyTorchTransformersBERTTF-IDFNLP

Introduction to Sailing

Interactive Data Visualization

Interactive educational website using D3.js visualizations to make Olympic sailing accessible to newcomers, featuring a global choropleth of Olympic participation, interactive boat diagrams, and a racing rules matching game.

D3.jsJavaScriptHTML/CSSData VisualizationInteractive Design

Flight Price Predictor

Multi-Model ML Pipeline for Airfare Estimation

Machine learning system predicting flight ticket prices using multi-airline data, comparing Linear Regression, Random Forest, and XGBoost to achieve a 43% error reduction over baseline.

PythonXGBoostRandom ForestScikit-learnPandas

Language Detection Model

Competition Winner, Top 5 of 300+

High-performance Spanish vs. French language classifier ranking top 5 out of 300+ competitors, built with 240+ engineered linguistic features on an 800,000+ row dataset.

PythonNumPyPandasFeature Engineering

Recipe Calorie Predictor

Nutritional Analysis and ML Regression

ML pipeline predicting calorie content from recipe attributes, achieving 99.54% R-squared accuracy with Random Forest regression on 83,000+ recipes from Food.com.

PythonScikit-learnRandom ForestPandasPlotly

Protein & Calorie Analysis

Statistical Investigation of Recipe Nutrition

Statistical analysis of 83,782 Food.com recipes investigating the relationship between protein levels and calorie counts using hypothesis testing and missingness analysis.

PythonPandasPlotlyPermutation TestingStatistics

Travel Log

Full-Stack Web Application

Full-stack travel logging platform with user authentication, RESTful APIs, and complete CRUD functionality built with a team of 4.

PythonFlaskMySQLJinjaBcryptREST APIs

Archived Projects

Recognition

Awards & Certifications

Grace Hopper Celebration Scholar

AnitaB.org

NCWIT Aspirations in Computing Award

National Center for Women & IT

#1 NorcalHacks Hackathon

NorcalHacks

Kaggle Competition, Top 2 Percentile

Kaggle

Google Data Science Certificate

Google

Python for Data Science & AI Certificate

IBM

Skills

Technical Stack

Languages

PythonJavaSQLHTML/CSS/JSRBashC

Frameworks & Libraries

PyTorchFlaskPandasNumPyDaskSparkMatplotlibScikit-learnXGBoostFastText

Systems & Cloud

CUDANCCLAWS KinesisTerraformLinuxDistributed SystemsRDMA/RoCE

Tools & Platforms

GitCMakeGrafanaVS CodeJUnitMATLAB

Domains

Machine LearningDeep LearningNLPComputer VisionPerformance EngineeringSignal Processing

Personal

Personal Interests

Chinese Bamboo Flute (Dizi), Level 8, Central Conservatory of Music

Outside of engineering, I love staying active. I rowed NCAA Division I at UC San Diego, played competitive tennis (USTA NorCal #3, 4-year varsity top singles, 2-year captain, 3-year MVP, nationals competitor), swam varsity, and played basketball competitively. I also play the Chinese bamboo flute (dizi), certified Level 8 of 9 on the Central Conservatory of Music exam with Honors.

I love training for races. I completed a Spartan Race and am currently training for a triathlon and half marathon. Other hobbies include surfing, rock climbing, hiking, golf, reading, and writing.

I also love volunteering. I tutored special needs students in robotics and programming at Friends of Children with Special Needs, coached Special Olympics basketball, and worked as a certified lifeguard for over 3 years.

Experience

Performance Engineer, NVIDIA

Software Engineer, Oracle Cloud Infrastructure

Software Engineer Intern, Siemens

Software Engineer Intern, Wind River Systems

Data Scientist, WCSNG Lab, UC San Diego

Machine Learning Research Assistant, Santa Clara University

Software Engineer Intern, Intelligent Fiber Optic Systems

Projects

Cash Score

BERT Transfer Learning Analysis

Introduction to Sailing

Flight Price Predictor

Language Detection Model

Recipe Calorie Predictor

Protein & Calorie Analysis

Travel Log

Archived Projects

Awards & Certifications

Grace Hopper Celebration Scholar

NCWIT Aspirations in Computing Award

#1 NorcalHacks Hackathon

Kaggle Competition, Top 2 Percentile

Google Data Science Certificate

Python for Data Science & AI Certificate

Technical Stack

Languages

Frameworks & Libraries

Systems & Cloud

Tools & Platforms

Domains

Personal Interests

Involvement