Rana Alotaibi,
Yuanyuan Tian,
Stefan Grafberger,
Jesus Camacho-Rodriguez,
Nicolas Bruno,
Brian Kroth,
Sergiy Matusevych,
Ashvin Agrawal,
Mahesh Behera,
Ashit Gosalia,
Cesar Galindo-Legaria,
Milind Joshi,
Milan Potocnik,
Beysim Sezgin,
Xiaoyu Li,
Carlo Curino
(2024).
Towards Query Optimizer as a Service (QOaaS) in a Unified LakeHouse Platform: Can One QO Rule Them All?.
Conference on Innovative Data Systems Research (CIDR).
Stefan Grafberger,
Paul Groth,
Julia Stoyanovich,
Sebastian Schelter
(2021).
Data Distribution Debugging in Machine Learning Pipelines.
The VLDB Journal — The International Journal on Very Large Data Bases (Special Issue on Data Science for Responsible Data Management).
Sebastian Schelter,
Stefan Grafberger,
Philipp Schmidt,
Tammo Rukat,
Mario Kiessling,
Andrey Taptunov,
Felix Biessmann,
Dustin Lange
(2019).
Differential Data Quality Verification on Partitioned Data.
International Conference on Data Engineering (ICDE).
Sebastian Schelter,
Stefan Grafberger,
Philipp Schmidt,
Tammo Rukat,
Mario Kiessling,
Andrey Taptunov,
Felix Biessmann,
Dustin Lange
(2018).
Deequ - Data Quality Validation for Machine Learning Pipelines.
Machine Learning Systems workshop at the conference on Neural Information Processing Systems (NeurIPS).