Coders Bring Change

Despite a lot of talk about the removal of the GIL in Python, it is actually still the default execution mechanism in Python. But what is it and what does it do?

from concurrent.futures import ThreadPoolExecutor

def work1():
    total = 0  # local total, not shared with work2
    for _ in range(100):
        total -= 1  # new total is made, ref-count update
        print(total)

def work2():
    total = 0  # local total, not shared with work1
    for _ in range(100):
        total += 1  # new total is made, ref-count update
        print(total)

with ThreadPoolExecutor() as executor:
    executor.submit(work1)
    executor.submit(work2)

From looking at this code, you might expect the Python Interpreter to start and run two process threads and the output suggests it does:

But actually, the threads run in a single process, giving control to each other at regular intervals. This works somewhat like asyncio but with two important differences:

CPython automatically locks and releases the GIL when a thread executes Python bytecode. In contrast, asyncio runs on a single thread, so there is no need to explicitly lock or release the GIL.
Thread preemption: a thread can be interrupted by the scheduler to run another thread (preemptive multitasking). Async functions, on the other hand, yield control cooperatively back to the scheduler.

Since the interpreter must lock the GIL when running threads, using multiple threads for CPU-bound Python code does not achieve true parallelism, and might even hurt performance due to the overhead of task switching.

Why must the GIL be locked when running threads?

The GIL protects the interpreter, not your code. When bytecode runs in two threads simultaneously:

Both threads might manipulate global or local variables.
Both threads update Python object reference counts, which are shared across threads.

Without the GIL, memory corruption could occur and memory safety cannot be guaranteed.

The golden rule

If Python bytecode is executing, the GIL must be held. But there are exceptions.

When will the GIL be released immediately?

Threads can release the GIL during blocking I/O operations to allow another thread to run simultaneously. Examples include:

sleep
File I/O
Networking operations

What will change in the future?

When the GIL is removed or made optional, Python will achieve true parallelism for CPU-bound code. To make this possible:

Reference counting must be atomic or implemented using lock-free strategies.
Interpreter state must be protected without relying on a global lock.

Compatibility considerations

Some C extensions may need updates to safely release or acquire locks.
Existing code that relies implicitly on the GIL may need modifications or continue to use the legacy GIL.
Will the GIL be used by default or optional? This is still under discussion, but it may remain enabled by default for compatibility, with an option to disable it for full parallelism.

Just when I thought I was out...

Why must the GIL be locked when running threads?

The golden rule

When will the GIL be released immediately?

What will change in the future?

Compatibility considerations