Examine how the ISR looks now… There is still the same hardware entry, typically 5 clocks. But now there is no need to push or pop registers, the ISR starts running immediately with no need for saving context on entry and there is no restoring of context on exit. And the hardware side of the return remains at 3 clocks. By setting aside a small block of registers, one can save many clocks over a standard interrupt. While the main line code may experience a small decrease in performance as a result of running off of a smaller register set in this mode, with the RX one has the choice of how one would like to optimize performance for their application.