2 Comments

Really great articles. Unfortunately l am living in embedded world. If you are able, please try to take into account different CPU architectures as well (short pipelines changes everything :)

Expand full comment
author

Thank you for the kind words, Jakub! Function overhead hurts all types of environments, including embedded, so what you're referring to is most likely a general impact of inlining large bodies of instructions on things like i-cache. Platform differences are the reason why I generally avoid prescribing the recipes for improving performance and instead recommend trying and measuring the impact of different approaches.

Expand full comment