Publications
* denotes equal contribution
2026
-
DynaFlow: Transparent and Flexible Intra-Device Parallelism via Programmable Operator Scheduling9th Annual Conference on Machine Learning and Systems (MLSys’26, to appear), 2026 -
MuxTune: Efficient Multi-Task LLM Fine-Tuning via Spatial-Temporal Backbone MultiplexingIn 23nd USENIX Symposium on Networked Systems Design and Implementation (NSDI’26, to appear) , 2026
2025
-
Magneton: Optimizing Energy Efficiency of ML Systems via Differential Energy DebuggingarXiv preprint, 2025