The problem Givena set of training cases/objects and their attribute values, try to determine the target attribute value of new examples. – Classification – Prediction 3
4.
Key Requirements Attribute-valuedescription: object or case must be expressible in terms of a fixed collection of properties or attributes (e.g., hot, mild, cold). Predefined classes (target values): the target function has discrete output values (bollean or multiclass) Sufficient data: enough training cases should be provided to learn the model. 4
Principled Criterion Choosingthe most useful attribute for classifying examples. Entropy - A measure of homogeneity of the set of examples - If the sample is completely homogeneous the entropy is zero and if the sample is an equally divided it has entropy of one Information Gain - Measures how well a given attribute separates the training examples according to their target classification - This measure is used to select among the candidate attributes at each step while growing the tree 6