2 questions on Smart Analytics (Bank Data) Use Case from App Market




  • Jason Arrigo

    Hello Rahul and thanks for your careful consideration!

    To answer:
    1) The statement "the duration of the call had the greatest impact on the number of purchased products" is true even though we do not know the number of product that are being purchased! The chart with blue bars in the PDF is showing us that the duration of the call has the highest correlation to whether the customs purchased a product or not. We don't really know whether it increased the number or decreased the number but we know that it affected the number of customers with a PurchasedProduct and therefore the number of PurchasedProducts.

    2) The numbers are off due to the size of the Validation Size and Pruning, but this is expected behavior.

    With pruning enabled the default validation size fraction of 0.25 is used. Pruning requires a performance measurement. The algorithm needs test data for that. So it is required to leave out some data and not to use them when building the tree, resulting in a reduced number of records.

    We are working on improving how the Flipside visualization informs users of this. Rest assured that when the algorithm model is built, ALL records are calculated and the data on the sheet displays the full results. You can easily validate this fact by adding an additional sheet where:

    Column A = GROUPBY(#DecisionTreeSheet1!Prediction)
    Column B = GROUPCOUNT()

    This gives you no=4,155 and yes=366. The total is your original 4,521.


    Comment actions Permalink
  • Rahul Dhond

    Hi Jason,
    Thanks a lot for your detailed answer. I appreciate it.

    Comment actions Permalink

Please sign in to leave a comment.