I am reading spark mllib documentation and in decision tree documentation it says -
Each partition is chosen greedily by selecting the best split from a set of possible splits, in order to maximize the information gain at a tree node. I am not able to understand -
- the partition that we are talking about, is it spark data partition or feature partition
- Or could it be splits on each data partition?