Skip to content

Bernardo1998/Sampling-and-sketching-methods-for-machine-learning

 
 

Repository files navigation

Sampling-and-sketching-methods-for-machine-learning

Introduction to data science course project

Dataset for coreset problems:

  1. Covtype.binary dataset: https://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets/binary.html
  2. Credit card dataset: https://archive.ics.uci.edu/ml/datasets/default+of+credit+card+clients
  3. Skin dataset: https://archive.ics.uci.edu/ml/datasets/Skin+Segmentation/

Dataset for feature reduction probles:

  1. WebSpam dataset: Refered to in https://arxiv.org/pdf/1105.4385.pdf

Papers referred:

  1. Training Support Vector Machines using Coresets by Baykal et al.
  2. Coresets for data-efficient training of machine learning models by Mirzasoleiman et al.

About

Introduction to data science course project

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%