You might see a change in total data volume towards the license usage depending on how that data is ingested into Datameer. Learn how license usage for a data link is different from that of an import job.
License usage depends on various factors such as source data size, compression used, and whether the data is already in HDFS. The table below illustrates the main difference between how license usage is calculated for a data link and an import job.
|Source Data Size||Data Link Usage Current||Data Link Usage Total||Import Job Usage Current||Import Job Usage Total|
|25 MB||5 MB||5 MB||5 MB||5 MB|
|50 MB||10 MB||10 MB||10 MB||15 MB|
|75 MB||15 MB||15 MB||15 MB||30 MB|
|50 MB||10 MB||15 MB||10 MB||40 MB|
|25 MB||5 MB||15 MB||5 MB||45 MB|
Note that it's the maximum among different runs for a data link and it's cumulative for an import lob.
The above numbers are rounded off for illustration purposes, and reduction in a file size when it's stored in Hadoop is due to the compression used.