I'm wanting to identify which 8-bit microprocessor would have the best performance for a multiply-accumulate operation.
By "operation", I mean the minimal implementation for 16-bit operands and 32-bit result in assembly language.
By "8-bit microprocessor", I mean an early microprocessor (pre-1990) having an 8-bit external data bus -- I don't care about ALU or register size.
By "best performance", I mean lowest elapsed wall-clock-time to compute and store result.
Is there an objective answer, probably based on the CPU having a "fast" hardware multiplier, while sporting sufficient scratchpad registers and high clock rate?