I have a large amount of data for which I basically have to learn a classifier model.
Since I dont have labels for my data (ie my classes arent defined) , I am plannig to use a clustering based model.Could I use Random Forests/ Gradient Boosting Algoritihm in an unsupervised way for classification?Are these good approaches for unlabelled data?

Best Answer

  • Unsupervised classification algorithms are summarized in the following link.
    I think a simple k-Means clustering algorithms would be good first step for your data. Please try kmeans function and find out a hidden pattern in your data.