I’m a PhD student in Brown CS.

My sensei are Stan Zdonik, Seny Kamara, and Tarik Moataz. I also collaborate with Ugur Cetintemel, Carsten Binnig and Tim Kraska in the Database Group, Emanuel Zgraggen in the Graphics Group, and Eli Upfal and Lorenzo De Stefani in the Theory Group.

I am interested in the theories and designs of big data systems that are intelligent and safe. My research spans a broad area covering cryptography, data science/machine learning, and big data systems. I’m building encrypted data systems that are provably secure at Sifr Systems with some fantastic people. We’re much more secure than CryptDB.

Some old gigs

Open-source projects



My collection of ariticles covering cryptography, data science, and database systems.

Security and Cryptography

Signal Search.
Engelman, Kamara, Moataz and Zhao.
April 2017. [Software]

Data Science

Behavior of Large Random Graph.
Zhao, supervised by Prof. Paul Dupius.
Randomized Algorithms for Counting, Integration and Optimzation, Brown University, April 2017.

Investigating the Effect of the Multiple Comparisons Problem in Visual Analysis.
Zgraggen, Zhao, Zeleznik, and Kraska.
CHI, April 2018. [Video], [Software], [Review on Medium]

Controlling False Discoveries During Interactive Data Exploration.
Zhao, De Stefani, Zgraggen, Binnig, Upfal and Kraska.
SIGMOD, May 2017. [Review on Medium]

Safe Visual Data Exploration.
Zhao, Zgraggen, De Stefani, Binnig, Upfal and Kraska.
SIGMOD Demo, May 2017.

Towards Sustainable Insights, or Why Polygamy is Bad for You.
Binnig, De Stefani, Kraska, Upfal, Zgraggen and Zhao.
CIDR, January 2017. [Code], [Review by Adrian Coyler]

Towards a Benchmark for Interactive Data Exploration.
Eichmann, Zgraggen, Zhao, Binnig, Kraska.
IEEE Data Engineering Bulletin, 2016.

VisTrees: Fast Indexes for Interactive Data Exploration.
El-Hindi, Zhao, Binnig and Kraska.
SIGMOD HILDA, June 2016.


Bridging the Gap between HPC and Big Data frameworks.
Anderson, Smith, Sundaram, Capota, Zhao, Dulloor, Satish and Willke.
VLDB, 2017. [Spark performance tool] [Machine learning code]

Larger-than-memory Data Management on Modern Storage Hardware for In-memory OLTP Database Systems.
Ma, Arulraj, Zhao, Pavlo, Dulloor, Giardino, Parkhurst, Gardner, Doshi and Zdonik.
SIGMOD DaMoN, June 2016.

Data Tiering in Heterogeneous Memory Systems.
Dulloor, Roy, Zhao, Sundaram, Satish, Sankaran, Jackson and Schwan.
EuroSys, April 2016. [Code]