Hillary Chang

Specializing in GPU clusters and distributed cloud infrastructure at scale.

About Me

Hi! My name is Hillary Chang. I am a performance engineer at NVIDIA, where I design and execute performance benchmarks for distributed AI workloads on GPU clusters, profile GPU utilization and interconnect latency, and build automation pipelines for large-scale benchmarking and analysis.

Prior to that, I built and operated core cloud infrastructure at Oracle Cloud Infrastructure, developing services and tooling for the Service Gateway, owning on-call across 200+ global regions, and leading deployments for distributed control plane services.

I hold a Master's in Data Science from UC Berkeley with a concentration in AI/ML, and a Bachelor's in Data Science with a Mathematics minor from UC San Diego. I am passionate about the intersection of high-performance computing, machine learning systems, and scalable infrastructure.

Hillary Chang

Experience

NVIDIA

Performance Engineer
May 2026 - Present

Santa Clara, CA

Design and execute performance benchmarks for distributed AI workloads (LLM training and inference) on multi-node GPU clusters using CUDA, NCCL, and PyTorch

Build Python and Bash automation pipelines to orchestrate large-scale benchmarking and performance analysis across GPU architectures

Analyze RDMA/RoCE networking performance to optimize multi-node GPU communication and scaling efficiency

Perform low-level GPU profiling with Nsight Systems and Nsight Compute to identify bottlenecks in utilization and interconnect latency

Oracle Cloud Infrastructure

Software Engineer
Jun 2025 - May 2026

Santa Clara, CA

Engineered core infrastructure for Oracle Cloud's Service Gateway, improving scalability and fault tolerance for high-throughput enterprise networking systems

Owned on-call for 200+ global regions, resolving Sev1/Sev2 production incidents using Grafana dashboards, Lumberjack logs, and internal debugging tools

Led global deployments and region build workflows, managing rollout, validation, and rollback for control plane services

Debugged complex distributed system failures including Terraform state drift, API throttling, and network misconfigurations across multi-region environments

Siemens

Software Engineer Intern
Dec 2024 - Jun 2025

Costa Mesa, CA

Developed 30+ cross-platform CMake modules for STEP Adapter and Parasolid-STEP Translator, enabling CAD file translation across Windows, Linux, and Intel/ARM macOS

Unblocked Linux and macOS development for the CAD translation team by delivering the first cross-platform build system, replacing a Windows-only proprietary toolchain

Diagnosed and resolved 10+ platform-specific build failures, including missing Perl-generated headers, macOS compiler incompatibilities, and linker symbol errors, in a legacy C++ codebase with no existing CMake precedent

Traced and translated 20+ library dependency graphs from proprietary .def files into CMake targets, resolving linker, compiler, and header generation issues across three OS toolchains

Wind River Systems

Software Engineer Intern
Jun 2023 - Dec 2023

San Diego, CA

Built a code style checking tool enforcing DO-178C aviation safety standards, adopted by 1,000+ developers across engineering teams

Automated 100+ critical validation tests for VxWorks real-time operating system, reducing manual QA efforts and error rates

Integrated regex-based code parsing and Doxygen/ApiGen documentation standards to automate code quality validation across the codebase

WCSNG Lab, UC San Diego

Data Scientist
Sep 2022 - Sep 2023

San Diego, CA

Engineered a real-time data streaming pipeline using AWS Kinesis, improving data throughput by 20% for indoor localization research

Configured and deployed 10+ Raspberry Pi units for continuous data collection and secure real-time tracking of localization devices

Defined advanced analytics and ML project objectives in collaboration with faculty, translating sensor data into actionable insights for wireless sensing research

Santa Clara University

Machine Learning Research Assistant
Jun 2022 - Sep 2022

Santa Clara, CA

Engineered a YOLOv5 deep learning model achieving 90%+ accuracy on image recognition by training on COCO and custom datasets

Built a web crawler using YouTube API and FFmpeg to download and preprocess video data, enabling large-scale dataset collection and analysis

Automated extraction and preprocessing of 7,000+ images with data augmentation techniques to improve model robustness and generalization

Fine-tuned hyperparameters using precision, recall, and F1-score metrics to optimize detection accuracy across diverse datasets

Intelligent Fiber Optic Systems

Software Engineer Intern
Aug 2021 - Dec 2021

San Jose, CA

Applied Kalman filter signal processing to denoise 30,000+ rows of sensor data from Fiber Bragg Grating (FBG) optical systems

Proposed and integrated multi-sensor fusion methods to improve accuracy across multiple FBG sensor channels

Developed MATLAB visualization scripts to analyze sensor data patterns, creating plots and graphs for engineering review

Presented technical findings and strategic product recommendations to the CEO and senior management, informing future R&D direction

Projects

Cash Score

Alternative Credit Scoring from Transaction Data

Full-stack ML system that classifies raw bank transactions using NLP, engineers behavioral features, and predicts credit risk. Achieved 98.93% transaction classification accuracy and 0.84 ROC-AUC on delinquency prediction.

PythonXGBoostLightGBMFastTextTF-IDFTransformersPandasScikit-learn

BERT Transfer Learning Analysis

Exposing the Limits of Transfer Learning

Research demonstrating that TF-IDF outperformed fine-tuned BERT by 8% on domain-specific tasks, exposing negative transfer from domain mismatch between Amazon reviews and banking intents.

PyTorchTransformersBERTTF-IDFNLP

Introduction to Sailing

Interactive Data Visualization

Interactive educational website using D3.js visualizations to make Olympic sailing accessible to newcomers, featuring a global choropleth of Olympic participation, interactive boat diagrams, and a racing rules matching game.

D3.jsJavaScriptHTML/CSSData VisualizationInteractive Design

Flight Price Predictor

Multi-Model ML Pipeline for Airfare Estimation

Machine learning system predicting flight ticket prices using multi-airline data, comparing Linear Regression, Random Forest, and XGBoost to achieve a 43% error reduction over baseline.

PythonXGBoostRandom ForestScikit-learnPandas

Language Detection Model

Competition Winner, Top 5 of 300+

High-performance Spanish vs. French language classifier ranking top 5 out of 300+ competitors, built with 240+ engineered linguistic features on an 800,000+ row dataset.

PythonNumPyPandasFeature Engineering

Recipe Calorie Predictor

Nutritional Analysis and ML Regression

ML pipeline predicting calorie content from recipe attributes, achieving 99.54% R-squared accuracy with Random Forest regression on 83,000+ recipes from Food.com.

PythonScikit-learnRandom ForestPandasPlotly

Protein & Calorie Analysis

Statistical Investigation of Recipe Nutrition

Statistical analysis of 83,782 Food.com recipes investigating the relationship between protein levels and calorie counts using hypothesis testing and missingness analysis.

PythonPandasPlotlyPermutation TestingStatistics

Travel Log

Full-Stack Web Application

Full-stack travel logging platform with user authentication, RESTful APIs, and complete CRUD functionality built with a team of 4.

PythonFlaskMySQLJinjaBcryptREST APIs

Awards & Certifications

Grace Hopper Celebration Scholar

AnitaB.org

NCWIT Aspirations in Computing Award

National Center for Women & IT

#1 NorcalHacks Hackathon

NorcalHacks

Kaggle Competition, Top 2 Percentile

Kaggle

Google Data Science Certificate

Google

Python for Data Science & AI Certificate

IBM

Skills

Languages

PythonJavaSQLHTML/CSS/JSRBashC

Frameworks & Libraries

PyTorchFlaskPandasNumPyDaskSparkMatplotlibScikit-learnXGBoostFastText

Systems & Cloud

CUDANCCLAWS KinesisTerraformLinuxDistributed SystemsRDMA/RoCE

Tools & Platforms

GitCMakeGrafanaVS CodeJUnitMATLAB

Domains

Machine LearningDeep LearningNLPComputer VisionPerformance EngineeringSignal Processing

Personal Interests

NCAA D1 Women's Rowing, UC San Diego

NCAA D1 Women's Rowing, UC San Diego

Outside of engineering, I love staying active. I rowed NCAA Division I at UC San Diego, played competitive tennis (USTA NorCal #3, 4-year varsity top singles, 2-year captain, 3-year MVP, nationals competitor), swam varsity, and played basketball competitively. I also play the Chinese bamboo flute (dizi), certified Level 8 of 9 on the Central Conservatory of Music exam with Honors.

I love training for races. I completed a Spartan Race and am currently training for a triathlon and half marathon. Other hobbies include surfing, rock climbing, hiking, golf, reading, and writing.

I also love volunteering. I tutored special needs students in robotics and programming at Friends of Children with Special Needs, coached Special Olympics basketball, and worked as a certified lifeguard for over 3 years.

Involvement

Teaching Assistant

UCSD Data Science & Mathematics Departments

· Mar 2023 – Jun 2025

Hosted office hours, led discussions, created course materials, and graded for multiple courses across both departments

IT Specialist

UC San Diego ITS / ResNet

· Aug 2023 – Jul 2025

Provided technical support, resolved software issues through troubleshooting and desktop support

CSES Board Member

Computer Science & Engineering Society, UCSD

· Sep 2023 – Sep 2025

Helped host career fairs and events for students and members to network with industry professionals

HDSI Student Council

Halıcıoğlu Data Science Institute, UCSD

· Sep 2024 – Sep 2025

Hosted community events and socials for data science students, gathered student body feedback to improve department resources

Grace Hopper Celebration Scholar

GHC 2024

· Oct 2024

Selected out of hundreds of applicants to attend with a full scholarship

MIT Beaver Works Summer Institute

Embedded Security & Hardware Hacking

· Jul 2021 – Aug 2021

Implemented secure firmware distribution systems, coded in C and Python for security assessments of embedded systems

View Project