Imagine I have a time series measurement of the 1 tool. Each measurement is labeled by the ordinal category (3 categories) and measurements are non-overlapping.
What I want to do is to test a significant difference between the groups based on aggregated statistics for each measurement (like standard deviation, kurtosis etc.). For example, I have 1000 measurements, each measurement has 10000 points. So for each measurement I calculate aggregated statistics. Based on these stats I want to test the significant differences between the labels (for each aggregated stats).
What would be the best tests to do? I am mostly interested in the tests between a pair of groups( For example: Standard Deviation of Label 1 vs standard Deviation of Label 2). I was thinking about Mann–Whitney, but I am not sure.
The aggregated statistics data are not normally distributed.