When the Python Interpreter Is the Bottleneck

Fri, 08 Oct 2021 14:00:00 +0800

Consider the logistic map, a one-line recurrence, iterated 10 million times. The computation does no I/O and spends almost all its time executing a handful of floating-point operations in a Python loop. Rewriting that loop in C++ can produce a large speedup. Most Python performance problems are not this clean.

Performance-sensitive Python already delegates much of its work to compiled code. NumPy, image codecs, and compression libraries do not execute their inner loops as Python bytecode. Interpreter overhead matters when the hot computation remains in Python and no existing library can absorb it. In that case, cppimport can compile a small C++ extension as part of the import process. First, though, the profiler has to show that the loop is the problem.

Hybrid Programming on Boyang Yue

When the Python Interpreter Is the Bottleneck