Clustering (K-means) feature

Comments

2 comments

  • Jason Arrigo

    Hello Rahul and thanks for your question!

    We use indicator values to allow K-means to represent STRING columns. The number of indicator values is a setting that can be changed in the Advanced tab of the Clustering Wizards dialog box to change the STRINGs into INTEGERS for seamless usage of Clusters.
    You always want to make sure that you are leveraging the right amount of Indicator Values to best represent your data. To do this, examine the cardinality of the columns you wish to use. This can be easily accomplished using the Flipside in Datameer.
    If you find that you have a cardinality below 100 (the max allowable number of Indicator values) you can adjust the setting to match your Included Column cardinality.

    I hope that was able to clear up your question!
    Thanks,
    Jason

    0
    Comment actions Permalink
  • Rahul Dhond

    Thanks !
    regards,
    Rahul

    0
    Comment actions Permalink

Please sign in to leave a comment.