I have a dataset that has 100 columns and 100k rows. How can I print the maximum value and its corresponding row and column names, if the maximum value (ex: 20.17 of g1) is 2 time higher than the median value of the rest (0.21 and 0.57). This should be performed separately for each row name and the median should not be calculated including the maximum number but the rest of the numbers.
FYI: This has been answered before but using a small dataset that has only few columns and rows.
sample input
name s1 s2 s3 g1 20.17 0.21 0.57 g2 0.19 0.19 94.0 g3 0.15 0.21 0.26 g4 0.09 0.19 0.16 g5 0.019 0.19 0 g7 2.28 0 0 sample output
g1 s1 20.17 g2 s3 94.0 g7 s1 2.28
20.17to20? Is that a typo or do you want some sort of transformation? Please edit your question and clarify.