I’m a PhD student in Brown CS.
My sensei are Stan Zdonik, Seny Kamara, and Tarik Moataz. I also collaborate with Ugur Cetintemel, Carsten Binnig and Tim Kraska in the Database Group, Emanuel Zgraggen in the Graphics Group, and Eli Upfal and Lorenzo De Stefani in the Theory Group.
- I’m building encrypted data systems that are provably secure at Sifr Systems with some fantastic people. We’re much more secure than CryptDB.
- I maintain an external advisory role for Blockchain Warehouse. This gig gets me to talk about blockchain for fun and profit.
- I provide consultation on machine learning at Critical Future.
Some old gigs
- Microsoft Research & AI, 2017
- Intel Labs, 2015
- Hadapt (Acquired by Teradata), 2013-14
- WalmartLabs, 2012
I am interested in the theories and designs of big data systems that are intelligent and safe. My research spans a broad area covering cryptography, data science/machine learning, and big data systems.
In this spirit I have dabbled in constraint learning for puzzle-solving AI, false-discovery control in data science, approximate data structures for visualization, database design on hybrid memory, consistency control for stochastic machine learning algorithms, and searchable encryption on mobile text messaging.
Security and Cryptography
Behavior of Large Random Graph.
Zhao, supervised by Prof. Paul Dupius.
Randomized Algorithms for Counting, Integration and Optimzation, Brown University, April 2017.
Investigating the Effect of the Multiple Comparisons Problem in Visual Analysis.
Zgraggen, Zhao, Zeleznik, and Kraska.
CHI, April 2018. [Video], [Software], [Review on Medium]
Controlling False Discoveries During Interactive Data Exploration.
Zhao, De Stefani, Zgraggen, Binnig, Upfal and Kraska.
SIGMOD, May 2017. [Review on Medium]
Towards Sustainable Insights, or Why Polygamy is Bad for You.
Binnig, De Stefani, Kraska, Upfal, Zgraggen and Zhao.
CIDR, January 2017. [Code], [Review by Adrian Coyler]
Towards a Benchmark for Interactive Data Exploration.
Eichmann, Zgraggen, Zhao, Binnig, Kraska.
IEEE Data Engineering Bulletin, 2016.
VisTrees: Fast Indexes for Interactive Data Exploration.
El-Hindi, Zhao, Binnig and Kraska.
SIGMOD HILDA, June 2016.
Bridging the Gap between HPC and Big Data frameworks.
Anderson, Smith, Sundaram, Capota, Zhao, Dulloor, Satish and Willke.
VLDB, 2017. [Spark performance tool]
Larger-than-memory Data Management on Modern Storage Hardware for In-memory OLTP Database Systems.
Ma, Arulraj, Zhao, Pavlo, Dulloor, Giardino, Parkhurst, Gardner, Doshi and Zdonik.
SIGMOD DaMoN, June 2016.