Recent News
- Since Spring 2023, I left academia and I concentrate fully on building the Internet Computer at DFINITY.
- Since June 2022, I have left AWS Lambda.
- The paper co-authored at Amazon AWS with SeongJae Park on the topic of data-aware
memory management in Linux was accepted at ACM HPDC (acceptance 19%).
- Our paper, on the topic of indexing in large-scale data processing systems was accepted at IPDPS from the first reviewing round (10% acceptance ratio).
The paper is in collaboration with Databricks, CWI, and TUDelft. Many thanks for the help.
-
Myself and our group in Leiden got featured in the NWO I/O Magazine.
-
I have been awarded an NWO VENI grant on my project "Practical Performance Reproducibility in Cloud Systems Research" for €250,000, acceptance ratio ~12%.
-
Our paper on running reproducible big data experiments in clouds has been accepted and presented at NSDI in Feb 2020. Slides can be found here. Many thanks to the teams at U of Utah and UC Santa Cruz for the help.
General Information
I am a senior researcher at DFINITY, building the Internet Computer.
Previously, I was an assistant professor at LIACS, Leiden University, working in the Computer Systems group and an Amazon Visiting Academic working with AWS Lambda.
I am a computer scientist with experience in both industry and academia, building large-scale distributed systems and infrastructure.
My focus is on empirical performance evaluation with a strong emphasis on understanding, evaluating, and improving the performance of large-scale systems.
I am interested in and have experience with serverless platforms, data processing systems, general distributed (storage) systems, and cloud computing.
I have taught several courses on these topics and trained and supervised students at all academic levels.
Previously, I was a postdoctoral researcher in the Computer Systems
Section at the Department of Computer Science of the Vrije Universiteit Amsterdam in the
Massivizing Computer Systems>a Group, led by prof.dr.ir. Alexandru Iosup.
In the summer of 2019, I was a research visitor in the Flux Research Group at the University of Utah, working closely with dr. Robert Ricci.
In the summer of 2017, I worked as a research-intern at Databricks, where I designed and implemented a distributed index on Spark in collaboration
with prof.dr. Peter Boncz.
Previously, I briefly worked as a postdoctoral researcher under the supervision of prof.dr.ir. Henri Bal
on scalable IoT infrastructures.
In March 2017, I received my PhD from the Vrije Universiteit on Optimizing the Execution of Many-Task Computing Applications Using In-Memory Distributed File Systems",
under the supervision of Dr.-Ing. habil. Thilo Kielmann and prof.dr.ir. Henri Bal.
In 2012, I graduated my MSc with honors ("cum laudae") in high-performance distributed computing at Vrije Universiteit Amsterdam, with a thesis on
GPU-accelerated video encoding, under the supervision of dr. Frank Seinstra.
I received my BSc diploma in 2009 in my home country, Romania, at the University of Bucharest, Faculty of Mathematics and Computer Science.
Please drop me a line if you want to collaborate or want to do a BSc or MSc project under my supervision. For more information on the kind of
projects I supervise, please see my Projects or Publications.
Grants
-
NWO VENI - PI, Practical Performance Reproducibility in Cloud Systems Research -- €250,000. Project on studying how to achieve reproducible performance when running experiments in clouds.
-
SURFsara, 2019-2020 - PI, DLPerf, Pilot Project for 500K Cartesius Cluster hours. Project on studying the performance bottlenecks of Deep-Learning Workloads.
-
SURFsara, 2019 - PI, Granular Graph Processing, 50K compute hours for usage in the SURFsara HPCCloud; Project on assessing the impact of FaaS and Serverless paradigms on Graph Analytics.
-
Google, 2018 - PI, Granular Graph Processing, $5,000 grant for usage in the Google Cloud. Project on assessing the impact of FaaS and Serverless paradigms on Graph Analytics.
-
SURFsara, 2018 - PI, BDCloudVar, 70K compute hours in the SURFsara cloud. Project on studying the effects of performance variability on Big Data workloads.
-
NWO, 2017-2018 - PI, HPGraph, Pilot Project for 500K Cartesius Cluster hours. Project on studying the HPC and Big Data convergence.
-
Intel, 2017-2018 - PI, Intel gift for unlimited use of an Intel KNL cluster of 256 nodes.
Awards
Best paper award IEEE IUCC conference | 2017 |
Best e-Science Service or Project, IEEE eScience conference | 2015 |
IEEE TCSC CCGrid Scale Challenge Finalist | 2015 |
Best poster award, IEEE CLUSTER conference | 2014 |
Service
Conference/Workshop Organization
Reproducibility Co-Chair | UCC | 16th International Conference on Utility and Cloud Computing | 2021 |
Reproducibility Co-Chair | PDSW | 6th International Parallel Data Systems Workshop | 2021 |
Co-Chair | HotCloudPerf | 4th Workshop on Hot Topics in Cloud Computing | 2021 |
Co-Chair | HotCloudPerf | 3rd Workshop on Hot Topics in Cloud Computing | 2020 |
Junior Program Chair | ISPDC | International Symposium on Parallel and Distributed Computing | 2019 |
Co-Chair | CCIW | 1st Workshop on Converged Computing Infrastructure | 2019 |
Co-Chair | HotCloudPerf | Workshop on Hot Topics in Cloud Computing | 2019 |
Program Committee Member
IPDPS | International Parallel and
Distributed Processing Symposium | 2022 |
SC | AD/AE track: Intl. Conf. for High-Performance Computing, Networking, Storage, and Analysis | 2020 |
LOD | International Conference on Machine Learning, Optimization, and Data Science | 2020 |
CPS-IoTBench | Workshop on Benchmarking Cyber-Physical Systems and Internet of Things | 2020 |
ICPE | International Conference on Performance Engineering | 2020, 2021 |
CCGrid | International Symposium on Cluster, Cloud and Grid Computing | 2019, 2020, 2022 |
CLUSTER | International Conference on Cluster Computing | 2019 |
EuroPar | European Conference on Parallel and Distributed Computing | 2019 |
HotCloudPerf | Workshop on Hot Topics in Cloud Computing | 2018-2021 |
SCRAMBL | Workshop on Scalable Computing For Real-Time Big Data Applications | 2017, 2019 |
EuroSys | European Conference on Computer Systems, shadow PC | 2017 |
Journal Reviewer
ACM ToCS | Transactions on Computer Systems | 2020-ongoing |
IEEE TPDS | Transactions on Parallel and Distributed Systems | 2018-ongoing |
IEEE ToSE | Transactions on Software Engineering | 2018-ongoing |
IEEE Access | The Multidisciplinary Open Access Journal | 2018-ongoing |
Elsevier FGCS | Future Generation Computer Systems | 2015-ongoing |
External Reviewer
IPDPS | International Parallel & Distributed Processing Symposium | 2018 |
CCGrid | International Symposium on Cluster, Cloud and Grid Computing | 2018 |
ICPP | International Conference on Parallel Processing | 2017 |
CLUSTER | International Conference on Cluster Computing | 2014 |
Education
Teacher
Leiden University: MSc Distributed Data Processing Systems | 2020-present |
Leiden University: BSc Compiler Construction | 2020-present |
Co-Teacher
VU Amsterdam: BSc Systems Architecture | 2017-2019 |
VU Amsterdam: MSc Distributed Systems | 2017-2019 |
VU Amsterdam: BSc Research-first Honors Program | 2018-2019 |
Teaching Assistant
MSc Distributed Systems | 2016 |
MSc Large-scale Computing Infrastructure | 2014-2016 |
MSc Internet Programming | 2015 |
MSc Cluster and Grid Computing | 2013 |
MSc Computer Graphics | 2012 |
Publications
- SeongJae Park, Madhuparna Bhowmik, Alexandru Uta. DAOS: Data-aware Operating System. Proceedings of the 31st International Symposium on High-Performance Parallel and Distributed Computing (HPDC), 2022, Minneapolis, Minnesota, United States.
-
Yuxuan Zhao, Alexandru Uta:
Tiny Autoscalers for Tiny Workloads: Dynamic CPU Allocation for Serverless Functions. In Proceedings of the 22nd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGrid), 2022, Taormina, Italy.
-
Jayjeet Chakraborty, Ivo Jimenez, Sebastiaan Alvarez Rodriguez, Alexandru Uta, Jeff LeFevre, Carlos Maltzahn:
Skyhook: Towards an Arrow-Native Storage System. In Proceedings of the 22nd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGrid), 2022, Taormina, Italy.
-
Alexandru Uta, Bogdan Ghit, Ankur Dave, Jan Rellermeyer, Peter Boncz: In-Memory Indexed Caching for Distributed Data Processing. In proceedings
of the 36th IEEE Parallel and Distributed Processing Symposium (IPDPS), 2022, Lyon, France.
-
Amin Moradi, Alexandru Uta: Reproducible Model Sharing for AI Practitioners. In Fifth Workshop on Distributed Infrastructures for
Deep Learning (DIDL21). In conjuction with ACM Middleware '21, virtual, Canada.
-
Sebastiaan Alvarez Rodriguez, Jayjeet Chakraborty, Ivo Jimenez, Jeff LeFevre, Carlos Maltzahn, Alexandru Uta: Zero-Cost, Arrow-Enabled Data Interface for Apache Spark. IEEE BigData 2021 Workshop 10th Workshop on Scalable Cloud Data Management.
-
Yuxuan Zhao, Dmitry Duplyakin, Robert Ricci, Alexandru Uta: Cloud Performance Variability Prediction. In HotCloudPerf 2021, co-located with ICPE.
-
Alexandru Uta, Kristian Laursen, Alexandru Iosup, Paul Melis, Damian Podareanu, Valeriu Codreanu: Beneath the SURFace: An MRI-like View into the Life of a 21st-Century Datacenter.
login Usenix Mag. 45(3), 2020.
-
Dmitry Duplyakin, Alexandru Uta, Aleksander Maricq, Robert Ricci: In Datacenter Performance, The
Only Constant Is Change. 2020, The 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGrid), May 11-14,
Melbourne, Australia.
-
Alexandru Uta, Alexandru Custura, Dmitry Duplyakin, Ivo Jimenez, Jan Rellermeyer, Carlos Maltzahn, Robert Ricci, Alexandru Iosup:
Is Big Data Performance Reproducible In Modern Cloud Networks?, 2020, USENIX Networked Systems
Design and Implementation (NSDI), February 25-27, Santa Clara, USA.
- Ahmed Musaafir, Henk Dreuning, Alexandru Uta, Ana Varbanescu: A Sampling-Based Tool for Scaling Graph Datasets, 2020, ACM International
Conference on Performance Engineering (ICPE), 20-25 April, Edmonton, Canada.
-
Roshan Bharath-Das, Marc X. Makkes, Alexandru Uta, Lin Wang, Henri Bal: A Programming Environment for Heterogeneous Stream Analytics,
2019, IEEE International Conference on Big Data (IEEE BigData), December 9-12, Los Angeles, USA.
-
Roshan Bharath-Das, Marc X. Makkes, Alexandru Uta, Lin Wang, Henri Bal: Aves: A Decision Engine for Energy-efficient Stream Analytics across Low-power Devices,
2019, IEEE International Conference on Big Data (IEEE BigData), December 9-12, Los Angeles, USA.
-
Michel Cojocaru, Alexandru Uta, Ana-Maria Oprescu: MicroValid: A Validation Framework for Automatically Decomposed Microservices, 2019 11th IEEE International
Conference on Cloud Computing (CloudCom), 11-13 December, Sydney, Australia.
-
Dmitry Duplyakin, Alexandru Uta, Aleksander Maricq, and Robert Ricci: On Studying CPU Performance of CloudLab Hardware.
Midscale Education and Research Infrastructure and Tools (MERIT), October 7, 2019, Chicago, USA.
-
Alexandru Uta, Bogdan Ghit, Ankur Dave, Peter Boncz: [Demo] Low-latency Spark eries on Updatable Data, 2019 ACM SIGMOD
International Conference on Management of Data, 1-5 July, Amsterdam, The Netherlands.
-
Lucian Toader, Alexandru Uta, Alexandru Iosup: Graphless: Toward Serverless Graph Processing, 2019 IEEE International
Symposium on Parallel and Distributed Computing (ISPDC), 5-7 June, Amsterdam, The Netherlands.
-
Michel Cojocaru, Alexandru Uta, Ana-Maria Oprescu: Attributes Assessing the Quality of Microservices Automatically Decomposed from Monolithic Applications, 2019 IEEE International
Symposium on Parallel and Distributed Computing (ISPDC), 5-7 June, Amsterdam, The Netherlands.
-
Maria Voinea, Alexandru Uta, Alexandru Iosup: POSUM: A Portfolio Scheduler for MapReduce Workloads, 2018 IEEE International Conference
on Big Data (IEEE BigData), December 10-13, Seattle, USA.
-
Alexandru Uta, Ana Lucia Varbanescu, Ahmed Musaafir, Chris Lemaire, Alexandru Iosup:
Exploring HPC and Big Data Convergence: a Graph
Processing Study on Intel Knights Landing, 2018 IEEE International Conference on Cluster Computing (CLUSTER),
September 2018, Belfast.
-
Alexandru Uta, Sietse Au, Alexey Ilyushkin, Alexandru Iosup:
Elasticity in Graph Analytics? A Benchmarking
Framework for Elastic Graph Processing, 2018 IEEE International Conference on Cluster Computing (CLUSTER),
September 2018, Belfast.
-
Erwin van Eyk, Lucian Toader, Sacheendra Talluri, Laurens Versluis, Alexandru Uta, Alexandru Iosup:
Serverless is More: From PaaS to Present Cloud Computing, IEEE Internet Computing, Sept, 2018.
-
Alexandru Iosup, Alexandru Uta, Laurens Versluis, Georgios Andreadis, Erwin van Eyk, Tim Hegeman, Sacheendra Talluri, Vincent van Beek, Lucian Toader: Massivizing Computer Systems: a Vision to Understand, Design, and Engineer Computer Ecosystems through and beyond Modern Distributed Systems, ICDCS, Vienna, Austria, July 2-5, 2018.
-
Alexandru Uta, Harry Obaseki: A Performance Study of Big Data Workloads in Cloud Datacenters with Network Variability, 1st Workshop on Hot Topics in Cloud Computing Performance, in conjuction with ICPE, 9 April
2018, Berlin.
-
Sietse Au, Alexandru Uta, Alexey Ilyushkin, Alexandru Iosup:
An Elasticity Study of Distributed Graph Processing, 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid),
May 2018, Washington.
-
Nicolae Vladimir Bozdog, Marc X. Makkes, Alexandru Uta, Roshan Bharath Das, Aart Van Halteren and Henri Bal:
SenseLE: Exploiting Spatial Locality in Decentralized Sensing Environments,
16th IEEE International Conference on Ubiquitous Computing and Communications (IUCC 2017), Guangzhou, China, December 12-15, 2017
(best paper award)
-
Marc X. Makkes, Alexandru Uta, Roshan Bharath Das, Vladimir Bozdog and Henri Bal:
P2-SWAN: Real-time Privacy Preserving Computation for IoT Ecosystems,
IEEE, 1st International Conference on Fog and Edge Computing (ICFEC 2017), May 2017, Madrid
-
Alexandru Uta, Ove Danner, Cas van der Weegen, Ana-Maria Oprescu, Andreea Sandu, Stefania Costache, Thilo Kielmann:
MemEFS: A network-aware elastic in-memory runtime distributed file system,
Future Generation Computer Systems, 2017, https://doi.org/10.1016/j.future.2017.03.017
-
Alexandru Uta, Ana-Maria Oprescu, Thilo Kielmann:
Towards Resource Disaggregation - Memory Scavenging for Scientific Workloads,
2016 IEEE International Conference on Cluster Computing (CLUSTER), September 2016, Taipei
-
Alexandru Uta, Andreea Sandu, Stefania Costache, Thilo Kielmann:
MemEFS: an elastic in-memory runtime file system for escience applications,
2015 IEEE 11th International Conference on e-Science (e-Science), August/September 2015, Munchen (best eScience service or project)
-
Alexandru Uta, Andreea Sandu, Stefania Costache, Thilo Kielmann:
Scalable In-Memory Computing,
2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid),
May 2015, Shenzhen
-
Alexandru Uta, Andreea Sandu, Thilo Kielmann:
Overcoming data locality: An in-memory runtime file system with symmetrical data distribution,
Future Generation Computer Systems, 2015, https://doi.org/10.1016/j.future.2015.01.013
-
Alexandru Uta, Andreea Sandu, Thilo Kielmann:
POSTER: MemFS: An in-memory runtime file system with symmetrical data distribution,
2014 IEEE International Conference on Cluster Computing (CLUSTER), September 2014, Madrid (best poster award)
-
Alexandru Uta, Andreea Sandu, Ion Morozan, Thilo Kielmann:
In-memory runtime file systems for many-task computing,
International Workshop on Adaptive Resource Management and Scheduling for Cloud Computing,
co-located with PODC 2014, Paris
This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other
copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author.s copyright. In most cases,
these works may not be reposted without the explicit permission of the copyright holder.
Alexandru Uta
|