Merkle & Steyvers (2013) write:
To formally define a proper scoring rule, let f$f$ be a probabilistic forecast of a Bernoulli trial d$d$ with true success probability p$p$. Proper scoring rules are metrics whose expected values are minimized if f = p$f = p$.
I get that this is good because we want to encourage forecasters to generating forecasts that honestly reflect their true beliefs, and don't want to give them perverse incentives to do otherwise.
Are there any real-world examples in which it's appropriate to use an improper scoring rule?
Merkle, E. C., & Steyvers, M. (2013). Choosing a strictly proper scoring rule. Decision Analysis, 10(4), 292-304.Reference
Merkle, E. C., & Steyvers, M. (2013). Choosing a strictly proper scoring rule. Decision Analysis, 10(4), 292-304