Publications

(2021). HedgeCut: Maintaining Randomized Trees for Low-Latency Machine Unlearning. ACM SIGMOD.

PDF

(2021). mlinspect: a Data Distribution Debugger for Machine Learning Pipelines. ACM SIGMOD (demo).

PDF

(2020). Lightweight Inspection of Data Preprocessing in Native Machine Learning Pipelines. Conference on Innovative Data Systems Research (CIDR).

PDF

(2019). Differential Data Quality Verification on Partitioned Data. International Conference on Data Engineering (ICDE).

PDF

(2018). Deequ - Data Quality Validation for Machine Learning Pipelines. Machine Learning Systems workshop at the conference on Neural Information Processing Systems (NeurIPS).

PDF