Firas Abuzaid
Photo credits: Hector Garcia-Molina

Hi, I’m Firas Abuzaid; thanks for visiting my website. I’m a 4th-year Ph.D. student in Computer Science at Stanford University, co-advised by Matei Zaharia and Peter Bailis.

As a member of the FutureData Systems Group and the Stanford DAWN Project, I focus on the intersection of systems and machine learning: how to take high-level machine learning tasks and build software systems to improve the efficiency of these tasks without sacrificing their accuracy.

I spent the first year of my Ph.D. at MIT CSAIL, working under Matei Zaharia and Sam Madden as part of the MIT DB group.

Prior to MIT, I was at Stanford, where I completed my B.S. and M.S. in Computer Science. I worked with Chris Ré on database systems and machine learning systems.

You can check out my CV here.

Latest Work


22x speed-ups for parsing JSON, Avro, and Parquet data.

S. Palkar, F. Abuzaid, P. Bailis, and M. Zaharia. Filter Before You Parse: Faster Analytics on Raw Data with Sparser, VLDB 2018.

[Code] [Blog Post]


100x speedups for CNN evaluation on video streams.

D. Kang, J. Emmons, F. Abuzaid, P. Bailis, and M. Zaharia. NoScope: Optimizing Neural Network Queries over Video at Scale, VLDB 2017.

[Code] [Blog Post]


A project on training deep decision trees at scale. Compatible with Spark MLlib 1.6+.

F. Abuzaid, J. Bradley, F. Liang, A. Feng, L. Yang, M. Zaharia, and A. Talwalkar. Yggdrasil: An Optimized System for Training Deep Decision Trees at Scale, NIPS 2016.

[Code] [Spark Package] [Slides from ML Systems Workshop, NIPS 2016] [Talk from Spark Summit 2016]

Contact Me

Feel free to shoot me an email at; you can also hit me up on Twitter. Or, if you're on campus, stop by my office in Gates 432!