# Sivan Sabato

## Department of Computer Science, Ben Gurion University of the Negev

## sabatos@cs.bgu.ac.il

The following

github project provides the code of the algorithm and experiments described in the following paper:

N. Ben-David,

**S. Sabato**, "A Fast Algorithm for PAC Combinatorial Pure Exploration; AAAI-22.

The following

github project provides the code of the algorithm and experiments described in the following paper:

N. Barak,

**S. Sabato**, "Approximating a Distribution using Weight Queries", ICML 2021.

The following

github project provides the code of the algorithm and experiments described in the following paper:

S. Schnapp,

**S. Sabato**, "Active Feature Selection for the Mutual Information Criterion", AAAI 2021.

The following

github project provides the code of the algorithm and experiments described in the following paper:

**S. Sabato**, E. Yom-Tov, "Bounding the fairness and accuracy of classifiers from population statistics", The 37th International Conference on Machine Learning (ICML), 2020,

The following

github project provides the code of the algorithm and experiments described in the following paper:

T. Hess,

**S. Sabato**, "Sequential no-Substitution k-Median-Clustering", The 23rd International Conference on
Artificial Intelligence and Statistics (AISTATS), 2020.

The following

github project provides the code for the anomaly detection algorithm, as well as the TDA data set, which are described in the following paper:

E. Gutflaish, A. Kontorovich,

**S. Sabato**, O. Biller, O. Sofer, "Temporal anomaly detection: calibrating the surprise", 33rd AAAI Conference on Artificial Intelligence (AAAI 2019),

The following

github project provides the code of the algorithm and experiments described in the following paper:

E. Barash, N. Sal-Man,

**S. Sabato**, M. Ziv-Ukelson, "BacPaCS-Bacterial Pathogenicity Classification via Sparse-SVM", Bioinformatics, 35(12):2001-2008, 2018.

The following Matlab code implements the linear regression algorithms described in

D. Hsu and

**S. Sabato**, "

Heavy-tailed Regression with a Generalized Median-of-means", The 31st International Conference on Machine Learning (ICML), 2014.

Since this algorithm is geared towards low squared loss

**with high probability**, its benefits are best demonstrated by running a large number of experiments with different random samples, and estimating the worst loss achieved in, for instance, 95% of the experiments.

The Matlab code below implements the algorithms described in

**S. Sabato** and A. Kalai, "

Feature Multi-Selection among Subjective Features", The 30th International Conference on Machine Learning (ICML), 2013.

The algorithms perform feature multi-selection for regression, for features that are obtained by noisy measurements.

**Input:** a labeled training set with two measurements for each feature value, and a limit to the total number of feature measurements allowed for an example during test time.
**Output:** a linear predictor, along with counts of the number of times each feature should be measured at prediction time.

Matlab Code: