Algorithm behind Column Dependencies feature
Can anyone throw some light on the algorithm behind Column Dependencies feature in Smart Analytics? I know how the feature works in Datameer but I dont know how the dependency numbers are actually calculated and hence the curiosity.
thanks & regards,
Rahul
-
Hello Rahul,
The column dependencies feature uses Mutual Information between each pair of columns: https://en.wikipedia.org/wiki/Mutual_information since it is not limited to real-valued random variables like the correlation coefficient.Hope this helps.
Best,
Jana
Please sign in to leave a comment.
Comments
2 comments