We are testing/measuring performance on a new Env. Datameer on Hadoop Cluster against Local Mode Datameer.
1) The Workbooks that are using Data Links are quite faster on The Hadoop Cluster which is understandable, but the In memory jobs are quite slower. Can You please advise for a setting(s) change in order to improve this?
2) Smart Execution is not used on the Hadoop Cluster, although Spark and Tez plugins are enabled on Datameer. Also all the jobs are executed as "Standard MR job". How can We enable Smart Execution, what are the prerequisites ? (Maybe this is connected with Issue No.1)
3) The Size of the WBs is quite larger on the Hadoop Cluster. This is not due to replication, and We are using Gzip Compression. Any advice on that (maybe LZO is used as default in Local Mode)?
Looking forward to Your answers and Thanks in advance.
Please sign in to leave a comment.