Data Science

Data Compression

  • Mak, S. and V. Roshan Joseph. (2018) “Support Points”. Annals of Statistics, 46, 2562-2592. R package: support.
  • Mak, S. and V. Roshan Joseph. “Projected Support Points: A New Method for High-Dimensional Data Reduction”. arXiv
  • V. Roshan Joseph and Mak, S. (2021). “Supervised Compression of Big Data”. Statistical Analysis and Data Mining: The ASA Data Science Journal, 14, 217-229. R package: supercompress.

Data Splitting

Data Twinning

Factor Importance

  • Huang, C. and V. Roshan Joseph. “Factor Importance Ranking and Selection using Total Indices”. arXiv.