Data Mining - Big Data Project 2017 - Uncertain Clustering.
U-DBSCAN is a density based algorithm that can cluster uncertain data using as distance measure a probabilistic distance between the random distributions of the points.
Usage:
- Download the U-DBSCAN.jar file
- If it is the first time that you run the algorithm and you want to use an existing dataset, create a new folder in your documents named "bdmpFiles" and inside it create a folder named "input".
If, in your first execution, you don't want to use an existing dataset simply go to 4) point. - Put "dataset.txt" and "datasetInfo.txt" in the input folder
- Execute it with "java -jar U-DBSCAN.jar"
- Output will be saved into "bdmpFiles/output" folder