Overview

Broadly, I am interested in designing systems mechanisms and policies to handle trade-offs in cost, performance, and efficiency for emerging applications. Specifically, I have worked on projects related to cloud/edge resource management, big data frameworks, deep learning inference, distributed training, neural architecture search, and AR/VR. My recent work has a strong focus on improving system support for deep learning and on the practical applications of deep learning in AR/VR.

Mobile Deep Learning

My work on mobile deep learning tackles the performance issues that arise in running complex models, using cloud-based serving, and supporting secure inference. This line of research is generously supported by the National Science Foundation.


Mobile Augmented and Virtual Reality

With deep learning excelling in a variety of tasks that are essential to environment understanding, I am very excited to shape the future of mobile computing through algorithms and systems innovations. Ultimately, I would like to build an end-to-end framework that facilitates the AR development. I am actively looking for sponsors for this line of research. If you are interested in my work, please don't hesitate to reach out!


Distributed Deep Learning

My work on distributed deep learning focuses on balancing the training cost and accuracy trade-offs. I am particularly interested in providing cost-effective training solutions for deep learning practitioners via cloud-based GPU clusters. This line of work is currently supported by the National Science Foundation and was previously supported by Google Cloud. I am looking for new sponsors for continuing this research!


Cloud Resource Management

Most of my early work is on resource management mechanisms and policies for traditional and distributed clouds.