2025

The Top 5 Longreads of the Week - Longreads

Thoughtful stories for thoughtless times. Longreads has published hundreds of original stories—personal essays, reported features, reading lists, and more—and more than 13,000 editor’s picks. And they’re all funded by readers like you. Become a member today. I want to support Longreads In today’s…

Apr 18, 2025

Kurt Vonnegut, Shape of Stories illustrated with charts

flowingdata.com

New to me, a couple decades ago, author Kurt Vonnegut delivered a lecture on the shape of stories. He uses a diagrammatic line chart to illustrate. The y-axis represents a range from ill fortune to good fortune, and the x-axis represents beginning to end of a story. Vonnegut identifies four stories…

Apr 18, 2025

Algorithms for Modern Hardware - Algorithmica

en.algorithmica.org

This is an upcoming high performance computing book titled “Algorithms for Modern Hardware” by Sergey Slotin. Its intended audience is everyone from performance engineers and practical algorithm researchers to undergraduate computer science students who have just finished an advanced algorithms…

Apr 18, 2025

Modern Hardware - Algorithmica

en.algorithmica.org

The main disadvantage of the supercomputers of the 1960s wasn’t that they were slow — relatively speaking, they weren’t — but that they were giant, complex to use, and so expensive that only the governments of the world superpowers could afford them. Their size was the reason they were so expensive:…

Apr 18, 2025

Programming Languages - Algorithmica

en.algorithmica.org

If you are reading this book, then somewhere on your computer science journey you had a moment when you first started to care about the efficiency of your code. Mine was in high school, when I realized that making websites and doing useful programming won’t get you into a university, and entered the…

Apr 18, 2025

Computer Architecture - Algorithmica

en.algorithmica.org

When I began learning how to optimize programs myself, one big mistake I made was to rely primarily on the empirical approach. Not understanding how computers really worked, I would semi-randomly swap nested loops, rearrange arithmetic, combine branch conditions, inline functions by hand, and follow…

Apr 18, 2025

Instruction Set Architectures - Algorithmica

en.algorithmica.org

As software engineers, we absolutely love building and using abstractions. Just imagine how much stuff happens when you load a URL. You type something on a keyboard; key presses are somehow detected by the OS and get sent to the browser; browser parses the URL and asks the OS to make a network…

Apr 18, 2025

Loops and Conditionals - Algorithmica

en.algorithmica.org

Let’s consider a slightly more complex example: It calculates the sum of a 32-bit integer array, just as a simple for loop would. The “body” of the loop is add edx, DWORD PTR [rax]: this instruction loads data from the iterator rax and adds it to the accumulator edx. Next, we move the iterator 4…

Apr 18, 2025

Functions and Recursion - Algorithmica

en.algorithmica.org

To “call a function” in assembly, you need to jump to its beginning and then jump back. But then two important problems arise: What if the caller stores data in the same registers as the callee?Where is “back”? Both of these concerns can be solved by having a dedicated location in memory where we…

Apr 18, 2025

The Cost of Branching - Algorithmica

en.algorithmica.org

When a CPU encounters a conditional jump or any other type of branching, it doesn’t just sit idle until its condition is computed — instead, it starts speculatively executing the branch that seems more likely to be taken immediately. During execution, the CPU computes statistics about branches taken…

Apr 18, 2025

Branchless Programming - Algorithmica

en.algorithmica.org

As we established in the previous section, branches that can’t be effectively predicted by the CPU are expensive as they may cause a long pipeline stall to fetch new instructions after a branch mispredict. In this section, we discuss the means of removing branches in the first place. #Predication We…

Apr 18, 2025

Stages of Compilation - Algorithmica

en.algorithmica.org

Before jumping straight to compiler optimizations, which is what most of this chapter is about, let’s briefly recap the “big picture” first. Skipping the boring parts, there are 4 stages of turning C programs into executables: Preprocessing expands macros, pulls included source from header files,…

Apr 18, 2025

Situational Optimizations - Algorithmica

en.algorithmica.org

Most compiler optimizations enabled by -O2 and -O3 are guaranteed to either improve or at least not seriously hurt performance. Those that aren’t included in -O3 are either not strictly standard-compliant, or highly circumstantial and require some additional input from the programmer to help decide…

Apr 18, 2025

Contract Programming - Algorithmica

en.algorithmica.org

In “safe” languages like Java and Rust, you normally have well-defined behavior for every possible operation and every possible input. There are some things that are under-defined, like the order of keys in a hash table or the growth factor of an std::vector, but these are usually some minor details…

Apr 18, 2025

Getting Accurate Results - Algorithmica

en.algorithmica.org

It is not an uncommon for there to be two library algorithm implementations, each maintaining its own benchmarking code, and each claiming to be faster than the other. This confuses everyone involved, especially the users, who have to somehow choose between the two. Situations like these are usually…

Apr 18, 2025

Floating-Point Numbers - Algorithmica

en.algorithmica.org

The users of floating-point arithmetic deserve one of these IQ bell curve memes — because this is how the relationship between it and most people typically proceeds: Beginner programmers use it everywhere as if it was some magic unlimited-precision data type.Then they discover that 0.1 + 0.2 != 0.3…

Apr 18, 2025