I have a large CSV file with data similar to this
User ID Group ABC Group1 DEF Group2 ABC Group3 GHI Group4 XYZ Group2 UVW Group5 XYZ Group1 ABC Group1 DEF Group2 i need to group these items in such a way that number of times group attribute is repeated in a user id and get a value such that
ABC Group1 ->2 ABC Group3 ->1 DEF Group2 ->2 GHI Group4 ->1 UVW Group5 ->1 XYZ Group2 ->1 XYZ Group1 ->1 Are there any clustering algorithm to do this.