I am interested in enabling the efficient management of big data by designing novel high performance data systems. To that end, I am currently wrapping up a PhD at the intersection of data systems and machine learning as a member of the Data Systems Group at the Computer Science and Artificial Intelligence Lab (CSAIL) of the Massachusetts Institute of Technology. During my time at MIT, I have worked on projects with Prof. Tim Kraska, Prof. Michael Cafarella and Prof. Samuel Madden. I have also interned at Intel as a Graduate Research intern in the summer of 2021, and at Amazon Web Services as an Applied Scientist intern in the summers of 2023 and 2025.
I am currently looking for an industry position starting June 2026.
Before joining MIT, I earned my Bachelor's of Science in Engineering (B.S.E.) in Electrical Engineering from Princeton University, alongside a certificate (minor) in Applications of Computing. For my undergraduate thesis, I had the honor of working with Prof. Margaret Martonosi on efficient memory consistency testing, as well as on formal verification for the DECADES project.
![]() |
05/2025 - 08/2025 | Amazon Web Services Applied Scientist Intern - Boston, MA
|
![]() |
05/2023 - 08/2023 | Amazon Web Services Applied Scientist Intern - Boston, MA
|
![]() |
06/2021 - 08/2021 | Intel Corporation Graduate Research Intern - Munich, Germany
|
![]() |
06/2019 - 08/2019 | McKinsey & Company Summer Business Analyst Intern - Athens, Greece
|
|
06/2018 - 08/2018 | EY (Ernst & Young) Performance Improvement Intern - Athens, Greece
|
|
06/2022 - Present Expected 05/2026 |
Massachusetts Institute of Technology
|
![]() |
09/2020 - 05/2022 |
Massachusetts Institute of Technology
|
| 09/2016 - 06/2020 |
Princeton University — Princeton, NJ
|
![]() |
2025/09/04 | From Logs to Causal Inference: Diagnosing Large Systems
Markos Markakis, Brit Youngmann, Trinity Gao, Ziyu Zhang, Rana Shahout, Peter Baile Chen, Chunwei Liu, Ibrahim Sabek, and Michael Cafarella 51st International Conference on Very Large Data Bases (VLDB 2025), London, UK |
![]() |
2025/09/03 | Improving DBMS Scheduling Decisions with Fine-grained Performance Prediction on
Concurrent Queries
Ziniu Wu, Markos Markakis, Chunwei Liu, Peter Baile Chen, Balakrishnan (Murali) Narayanaswamy, Tim Kraska, and Samuel Madden 51st International Conference on Very Large Data Bases (VLDB 2025), London, UK |
![]() |
2025/05/10 | Cloud Data Processing with Cost-Efficient Latency SLOs using Probabilistic Query
Performance Prediction
Markos Markakis, Ziniu Wu, Tim Kraska 1st Hellenic American Meeting of Early-Career Researchers (HAMER 2025), Cambridge, MA |
![]() |
2024/06/14 | Press ECCS to Doubt (Your Causal Graph)
Markos Markakis, Ziyu Zhang, Rana Shahout, Trinity Gao, Chunwei Liu, Ibrahim Sabek, Michael Cafarella 1st Workshop on Governance, Understanding and Integration of Data for Effective and Responsible AI (GUIDE-AI 2024), Santiago, Chile |
![]() |
2024/06/11 | Sawmill: From Logs to Causal Diagnosis of Large Systems
Markos Markakis, An Bo Chen, Brit Youngmann, Trinity Gao, Ziyu Zhang, Rana Shahout, Peter Baile Chen, Chunwei Liu, Ibrahim Sabek, Michael Cafarella 2024 ACM SIGMOD/PODS International Conference on Management of Data (SIGMOD 2024), Santiago, Chile |
![]() |
2024/05/23 | Sawmill: From Logs to Causal Diagnosis of Large Systems
Markos Markakis, An Bo Chen, Brit Youngmann, Trinity Gao, Ziyu Zhang, Rana Shahout, Peter Baile Chen, Chunwei Liu, Ibrahim Sabek, Michael Cafarella North East Database (NEDB) Day 2024, Boston, MA |
![]() |
2024/05/23 | Virtualizing Cloud Data Infrastructures with BRAD
Geoffrey X. Yu, Ziniu Wu, Ferdinand Kossmann, Tianyu Li, Markos Markakis, Amadou L Ngom, Tim Kraska, Samuel Madden North East Database (NEDB) Day 2024, Boston, MA |
![]() |
2024/05/23 | Press ECCS to Doubt (Your Causal Graph)
Markos Markakis, Ziyu Zhang, Rana Shahout, Trinity Gao, Chunwei Liu, Ibrahim Sabek, Michael Cafarella North East Database (NEDB) Day 2024, Boston, MA |
![]() |
2024/04/03 | Sawmill: Extracting Log Data for Causal Diagnosis of Large Systems
Markos Markakis, Brit Youngmann, Trinity Gao, Ziyu Zhang, Rana Shahout, Peter Baile Chen, Chunwei Liu, Ibrahim Sabek, Michael Cafarella CSAIL Alliances Annual Meeting 2024, Cambridge, MA |
![]() |
2023/08/31 | TreeLine: An Update-In-Place Key-Value Store for Modern Storage
Geoffrey X. Yu*, Markos Markakis*, Andreas Kipf*, Per-Ake Larson, Umar Farooq Minhas, and Tim Kraska 49th International Conference on Very Large Data Bases (VLDB 2023), Vancouver, Canada |
![]() |
2023/08/29 | Check Out the Big Brain on BRAD: Simplifying Cloud Data Processing with Learned
Automated Data Meshes
Tim Kraska*, Tianyu Li*, Samuel Madden*, Markos Markakis*, Amadou Ngom*, Ziniu Wu*, and Geoffrey X. Yu* 49th International Conference on Very Large Data Bases (VLDB 2023), Vancouver, Canada |
![]() |
2023/03/10 | Learning-Based Creation of Data Mesh Architectures
Tim Kraska*, Tianyu Li*, Samuel Madden*, Markos Markakis*, Amadou Ngom*, Ziniu Wu*, and Geoffrey X. Yu* North East Database (NEDB) Day 2023, Boston, MA |
![]() |
2023/03/10 | Automatically Extracting and Annotating Models From Scientific Publications and
Code
Markos Markakis, Chunwei Liu, Peter Baile Chen, Michael Cafarella North East Database (NEDB) Day 2023, Boston, MA |
![]() |
2023/03/10 | TreeLine: An Update-In-Place Key-Value Store for Modern Storage
Geoffrey X. Yu*, Markos Markakis*, Andreas Kipf*, Per-Ake Larson, Umar Farooq Minhas, and Tim Kraska North East Database (NEDB) Day 2023, Boston, MA |
![]() |
2022/10/20 | TreeLine: An Update-In-Place Key-Value Store for Modern Storage
Geoffrey X. Yu*, Markos Markakis*, Andreas Kipf*, Per-Ake Larson, Umar Farooq Minhas, and Tim Kraska DSAIL Retreat, Cambridge, MA |
| 2025/04 - Present | Proceedings of the VLDB Endowment (PVLDB), Volume 19 (For VLDB 2026) | |
| 2025 | ACM Computing Surveys (CSUR), Volume 57 (2025) |