Fibonacci Heaps — CLRS Chapter 19

Chapter 0: The Problem

You are running Dijkstra's algorithm on a road network — 10 million intersections, 30 million road segments. At each step, the algorithm does two things: extract the nearest unvisited node (EXTRACT-MIN), and for each outgoing edge, possibly lower the tentative distance to a neighbor (DECREASE-KEY).

With a binary min-heap, EXTRACT-MIN costs O(log n) and DECREASE-KEY costs O(log n). Dijkstra performs V extract-min operations and up to E decrease-key operations. The total cost is O((V + E) log V).

For a road network with V = 10 million and E = 30 million, that is roughly 40 million × 23 ≈ 920 million operations. Can we do better?

Where the Bottleneck Hides

Look at the two operations separately. Extract-min happens V times. Decrease-key happens up to E times. For dense graphs where E is much larger than V, decrease-key dominates. A road network has E ≈ 3V (each intersection connects to about 3 roads). But a social network might have E ≈ V². In that regime, the E log V term dwarfs V log V.

What if decrease-key were O(1) instead of O(log n)? Then Dijkstra's total cost would be O(V log V + E). For the road network: 10M × 23 + 30M = 260 million operations instead of 920 million. For a dense graph with E = V²: the savings are enormous — O(V²) instead of O(V² log V).

The core idea. A Fibonacci heap achieves O(1) amortized decrease-key by being lazy. It defers cleanup work — letting the structure get messy — and only pays for it when extract-min forces a consolidation. The key insight: individual decrease-key operations are cheap because they just cut a node and dump it in the root list. The mess gets cleaned up later, amortized over many operations.

Think of it like a messy desk versus a neat desk. A neat-desk person (binary heap) files every paper immediately — each filing takes effort. A messy-desk person (Fibonacci heap) just tosses papers on the desk surface (root list) and only organizes when they need to find something specific (extract-min). If you mostly need to toss papers (decrease-key) and rarely need to find the minimum, the messy approach wins.

Dijkstra Operation Counts

Compare binary heap vs Fibonacci heap for Dijkstra on graphs of varying density. Drag the slider to change E/V ratio.

E/V ratio 10

The table below shows the asymptotic comparison. Every entry in the Fibonacci heap column is at least as good as the binary heap, and decrease-key is dramatically better.

Operation	Binary Heap	Fibonacci Heap
INSERT	O(log n)	O(1)
MINIMUM	O(1)	O(1)
EXTRACT-MIN	O(log n)	O(log n) amortized
UNION	O(n)	O(1)
DECREASE-KEY	O(log n)	O(1) amortized
DELETE	O(log n)	O(log n) amortized

Hand Calculation: The Savings

Let us make this concrete. Suppose V = 1000 and E = 50,000 (a moderately dense graph with average degree 50).

Binary heap Dijkstra: (V + E) log V = (1000 + 50000) × log₂(1000) = 51,000 × 10 = 510,000 operations.

Fibonacci heap Dijkstra: V log V + E = 1000 × 10 + 50,000 = 60,000 operations.

That is an 8.5x improvement. And the gap widens as E/V grows. For E = V² = 1,000,000: binary heap gives 1,001,000 × 10 = 10,010,000 while Fibonacci heap gives 10,000 + 1,000,000 = 1,010,000. A 10x improvement.

The break-even point. Fibonacci heaps have larger constant factors than binary heaps (more pointer manipulation, worse cache behavior). In practice, the crossover point where Fibonacci heaps actually become faster is around E/V > 1000 for most implementations. For sparse graphs (like road networks with E ≈ 3V), binary heaps often win in practice despite worse asymptotics.

Concept check: You are running Dijkstra on a complete graph with V = 1000 vertices (E = V(V-1)/2 ≈ 500,000 edges). Which heap gives a better asymptotic bound?

Binary heap: O((V+E) log V) = O(500,000 × 10) = O(5,000,000) Fibonacci heap: O(V log V + E) = O(10,000 + 500,000) = O(510,000) Both are equivalent on dense graphs

Why answer B? For dense graphs where E dominates V log V, the Fibonacci heap saves a factor of log V on the dominant term. The E term drops from E log V to just E, which is the entire point of the O(1) amortized decrease-key.

The Road Ahead

Ch 1: Structure

A lazy collection of min-heap-ordered trees with a circular root list.

↓

Ch 2: Mergeable Ops

INSERT, MINIMUM, UNION — all O(1) because they do almost nothing.

↓

Ch 3: EXTRACT-MIN

Remove the minimum and consolidate. The one expensive operation.

↓

Ch 4: DECREASE-KEY

Cut, cascade, and the marking trick. O(1) amortized.

↓

Ch 5-6: DELETE & Analysis

The potential function that makes it all work.

↓

Ch 7-9: Degree Bound, Practice, Connections

Why "Fibonacci," practical comparisons, and interview patterns.

Chapter 1: Structure

Forget everything you know about binary heaps. A binary heap is a single, perfectly balanced tree stored in an array. A Fibonacci heap is nothing like that. It is a collection of trees — a messy forest where trees can have any shape, any size, and there can be any number of them.

The Forest of Trees

A Fibonacci heap is a collection of min-heap-ordered trees. "Min-heap-ordered" means that in each tree, every node's key is less than or equal to the keys of its children. But unlike a binary heap, these trees are not necessarily binary — a node can have 0, 1, 2, 3, or more children.

All the roots of these trees are linked together in a circular doubly-linked list called the root list. The heap maintains a single pointer called min that points to the root with the smallest key in the entire heap.

Why circular doubly-linked lists? Because they allow O(1) concatenation. To merge two circular lists, you just splice them: rewire two pointers from each list and you are done. No traversal needed. This is what makes UNION an O(1) operation. A singly-linked list would require finding the tail, which costs O(n). An array would require copying.

Node Structure

Each node in a Fibonacci heap stores more information than a binary heap node:

python
class FibNode:
    def __init__(self, key):
        self.key = key       # the priority value
        self.degree = 0     # number of children
        self.mark = False    # has this node lost a child since
                              # it became a child of its current parent?
        self.parent = None   # pointer to parent
        self.child = None    # pointer to one child (any child)
        self.left = self      # left sibling (circular)
        self.right = self     # right sibling (circular)

The degree field counts how many children a node has. The mark field is the secret weapon — it tracks whether a node has lost a child since it was last made a child of another node. This mark drives the cascading-cut mechanism that keeps the trees from getting too tall. We will cover it in Chapter 4.

The left and right pointers form circular doubly-linked lists at every level. A node's children are linked in a circular list, and the roots are linked in a circular list. The child pointer points to any one of the node's children — you can then traverse the child's left/right pointers to visit all siblings.

The Heap Object

python
class FibHeap:
    def __init__(self):
        self.min = None   # pointer to the minimum root
        self.n = 0        # total number of nodes

That is it. The heap object is just two fields: a pointer to the minimum and a count of total nodes. The entire structure is held together by pointer links between nodes.

A Visual Example

Suppose we have a Fibonacci heap with 14 nodes. It might look like this: three trees in the root list with min pointing to the smallest root. One tree has degree 3 (three children), another has degree 1, and the third is a single node with degree 0.

Fibonacci Heap Structure

A Fibonacci heap with multiple trees. The min pointer always points to the root with the smallest key. Roots are connected in a circular doubly-linked list (shown as curved arrows).

Key Properties

Let us be precise about what a Fibonacci heap guarantees:

1. Min-heap order within each tree. For every non-root node x, x.key ≥ x.parent.key. The minimum of the entire heap is always one of the roots.

2. Roots linked in a circular doubly-linked list. We can splice, insert, and remove from this list in O(1).

3. Children linked in circular doubly-linked lists. Same O(1) splicing for child lists.

4. The min pointer always accurate. H.min points to the root with the smallest key.

What is NOT guaranteed: the number of trees, the shape of trees, the degree of any node, or any balance condition. The structure is deliberately loose — that looseness is what makes insert, union, and decrease-key O(1).

Lazy is not sloppy. The Fibonacci heap is lazy by design, not by accident. It defers work (keeping many small trees, not consolidating after every operation) because that deferred work can later be done in batches during extract-min. This batching is where the amortized savings come from. A binary heap is "eager" — it fixes the structure after every operation. The Fibonacci heap says: "Why fix it now if nobody is asking for the minimum?"

Comparing with Binary Heaps

Property	Binary Heap	Fibonacci Heap
Storage	Contiguous array	Linked nodes (scattered memory)
Shape	One complete binary tree	Forest of arbitrary trees
Navigation	Index arithmetic (2i, 2i+1)	Pointer traversal
Root list	Just A[1]	Circular doubly-linked list
Node degree	Always 0, 1, or 2	Any non-negative integer
Balance	Perfectly balanced (complete)	No balance guarantee
Mark field	Not needed	Critical for cascading cuts

Concept check: In a Fibonacci heap with 100 nodes and 20 trees in the root list, how long does it take to find the minimum element?

O(1) — the min pointer always points to it O(20) — we must scan all 20 roots O(log 100) — just like a binary heap

Why O(1)? The min pointer is updated on every insert and union. It always points to the root with the smallest key. You never need to scan the root list to find the minimum — that is what the pointer is for.

Chapter 2: Mergeable Heap Operations

The first three operations on a Fibonacci heap — INSERT, MINIMUM, and UNION — are almost comically simple. They do the absolute minimum amount of work and defer everything else. This is the essence of the Fibonacci heap philosophy: be lazy now, pay later.

INSERT

FIB-HEAP-INSERT(H, x): Create a new node x as a single-node tree. Add it to the root list. If x.key < H.min.key, update H.min to x. Increment H.n.

python
def insert(self, key):
    node = FibNode(key)
    if self.min is None:
        # empty heap — node becomes the only root
        self.min = node
    else:
        # splice node into root list next to min
        self._add_to_root_list(node)
        if node.key < self.min.key:
            self.min = node
    self.n += 1
    return node  # return handle for decrease-key

That is O(1). No sifting, no rebalancing, no comparisons except the single comparison with the current minimum. The new node just gets tossed onto the root list.

Notice that insert returns the node. This is important — to call decrease-key later, the caller must have a direct pointer to the node. Without this handle, you would need O(n) to find the node first, defeating the purpose of O(1) decrease-key.

MINIMUM

FIB-HEAP-MINIMUM(H): Return H.min.key.

python
def minimum(self):
    return self.min.key  # O(1) — just follow the pointer

Nothing to explain. The pointer is always maintained. O(1).

UNION

FIB-HEAP-UNION(H1, H2): Concatenate the root lists of H1 and H2. Update min to whichever is smaller. Set n = H1.n + H2.n.

python
def union(h1, h2):
    h = FibHeap()
    h.min = h1.min
    # concatenate root lists — splice two circular lists
    if h1.min and h2.min:
        h1_right = h1.min.right
        h2_left = h2.min.left
        h1.min.right = h2.min
        h2.min.left = h1.min
        h1_right.left = h2_left   # was h2_left
        h2_left.right = h1_right
    if h2.min and (h1.min is None or h2.min.key < h1.min.key):
        h.min = h2.min
    h.n = h1.n + h2.n
    return h

This is O(1). Four pointer rewires to splice two circular lists. One comparison to update min. Done.

Contrast with binary heap union. Merging two binary heaps of size n each requires building a new heap from all 2n elements. The fastest way is to concatenate the arrays and run BUILD-HEAP, which takes O(n). There is no way to do it faster with a binary heap because the array structure forces a specific shape. Fibonacci heaps dodge this by having no shape constraint at all — two forests can be merged by just linking their root lists.

Circular List Splicing

The circular list splice is the fundamental operation. Let us trace through it carefully, because it appears everywhere in Fibonacci heap code.

Suppose list A has roots [a1 ↔ a2 ↔ a3] (circular, so a3.right = a1) and list B has roots [b1 ↔ b2]. To splice B into A after a1:

Before: a1 ↔ a2 ↔ a3 ↔ (back to a1) b1 ↔ b2 ↔ (back to b1)

Step 1: Save a1.right (= a2) and b2 (= b1.left)
Step 2: a1.right = b1, b1.left = a1
Step 3: b2.right = a2, a2.left = b2

After: a1 ↔ b1 ↔ b2 ↔ a2 ↔ a3 ↔ (back to a1)

Four pointer changes. Constant time regardless of list sizes. This is why everything that only involves list manipulation is O(1).

Insert & Union

Click Insert to add a random node. Click Union to merge a second heap. Watch the root list grow — no consolidation happens yet.

Hand Calculation: 5 Inserts

Let us trace 5 inserts into an empty Fibonacci heap: INSERT(7), INSERT(3), INSERT(11), INSERT(1), INSERT(9).

After INSERT(7): Root list = [7]. min = 7. n = 1.

After INSERT(3): Root list = [7, 3]. min = 3 (3 < 7). n = 2.

After INSERT(11): Root list = [7, 3, 11]. min = 3 (3 < 11). n = 3.

After INSERT(1): Root list = [7, 3, 11, 1]. min = 1 (1 < 3). n = 4.

After INSERT(9): Root list = [7, 3, 11, 1, 9]. min = 1 (1 < 9). n = 5.

Five inserts produced five single-node trees in the root list. No tree has any children. The minimum is correctly 1. Total work: 5 comparisons. That is it.

The mess grows. After k inserts, we have k trees in the root list. This is fine for insert, minimum, and union — but it would make extract-min expensive if we had to scan all k roots. That is why extract-min includes a consolidation step (Chapter 3) that merges roots until each degree is unique. The mess is allowed to grow during inserts because it will be cleaned up later.

Concept check: After inserting keys 15, 8, 23, 4, 42 into an empty Fibonacci heap, how many trees are in the root list and what is H.min?

1 tree (they get merged), min = 4 5 trees (no merging), min = 4 3 trees (some get merged), min = 4

Chapter 3: EXTRACT-MIN

Here is where the Fibonacci heap pays for its laziness. EXTRACT-MIN is the one operation that actually does significant work — it removes the minimum, promotes its children to roots, and then consolidates the root list so that no two roots have the same degree.

The Algorithm

FIB-HEAP-EXTRACT-MIN(H):

Step 1. Take the min node z = H.min. Add all of z's children to the root list (remove their parent pointers). This takes O(degree(z)) time.

Step 2. Remove z from the root list.

Step 3. If z was the only node (z.right == z and z had no children), set H.min = NIL.

Step 4. Otherwise, set H.min = z.right (temporarily — this might not be the actual minimum). Then call CONSOLIDATE.

Step 5. Decrement H.n. Return z.

CONSOLIDATE — The Heart of Extract-Min

CONSOLIDATE is where the real work happens. The goal: merge trees in the root list until no two roots have the same degree. This is like radix bucketing — we use an array A[0 .. D(n)] indexed by degree.

python
def _consolidate(self):
    # max possible degree: floor(log_phi(n))
    max_degree = int(math.log(self.n, (1+math.sqrt(5))/2)) + 1
    A = [None] * (max_degree + 1)

    # collect all roots into a list (can't modify while iterating circular list)
    roots = []
    x = self.min
    if x:
        curr = x
        while True:
            roots.append(curr)
            curr = curr.right
            if curr == x: break

    for w in roots:
        x = w
        d = x.degree
        while A[d] is not None:
            y = A[d]          # another root with same degree
            if x.key > y.key:
                x, y = y, x  # x is the smaller one
            self._link(y, x)   # make y a child of x
            A[d] = None
            d += 1             # x now has one more child
        A[d] = x

    # rebuild root list from A
    self.min = None
    for node in A:
        if node is not None:
            node.left = node.right = node
            if self.min is None:
                self.min = node
            else:
                self._add_to_root_list(node)
                if node.key < self.min.key:
                    self.min = node

The LINK Operation

FIB-HEAP-LINK(y, x): Make y a child of x. Remove y from the root list. Add y to x's child list. Increment x.degree. Unmark y (y.mark = False).

python
def _link(self, y, x):
    # remove y from root list
    y.left.right = y.right
    y.right.left = y.left
    # make y a child of x
    y.parent = x
    if x.child is None:
        x.child = y
        y.left = y.right = y
    else:
        # splice y into x's child list
        y.right = x.child.right
        y.left = x.child
        x.child.right.left = y
        x.child.right = y
    x.degree += 1
    y.mark = False

Worked Example: Extract-Min with Consolidation

Suppose we have a Fibonacci heap with 7 nodes all in the root list (after 7 inserts, no extractions): keys [3, 7, 11, 1, 9, 5, 13]. H.min points to 1.

Step 1: Remove 1 from root list. Node 1 has no children (it was a single-node tree). Root list is now [3, 7, 11, 9, 5, 13] — six degree-0 trees.

Step 2: Consolidate. We walk through the roots. Array A is indexed by degree.

Process 3 (deg 0): A[0] = empty → A[0] = 3
Process 7 (deg 0): A[0] = 3, conflict! Link: 3 < 7, so 7 becomes child of 3.
  Now 3 has deg 1. A[0] = empty. A[1] = empty → A[1] = 3(7)
Process 11 (deg 0): A[0] = empty → A[0] = 11
Process 9 (deg 0): A[0] = 11, conflict! Link: 9 < 11, so 11 becomes child of 9.
  Now 9 has deg 1. A[1] = 3(7), conflict! Link: 3 < 9, so 9(11) becomes child of 3.
  Now 3 has deg 2. A[1] = empty. A[2] = empty → A[2] = 3(7,9(11))
Process 5 (deg 0): A[0] = empty → A[0] = 5
Process 13 (deg 0): A[0] = 5, conflict! Link: 5 < 13, so 13 becomes child of 5.
  Now 5 has deg 1. A[1] = empty → A[1] = 5(13)

Final: A[1] = 5(13), A[2] = 3(7, 9(11))
Root list: [3, 5]. Two trees, different degrees. min = 3.

We started with 6 single-node trees and ended with 2 trees after consolidation. The degree array ensured no two roots share a degree.

EXTRACT-MIN with Consolidation

Click "Insert" several times to build up the root list, then click "Extract-Min" to see consolidation in action. Step through to watch trees merge by degree.

Amortized Cost of Extract-Min

The actual cost of extract-min is O(D(n) + t) where D(n) is the maximum degree of any node (we will prove D(n) = O(log n) in Chapter 7) and t is the number of trees in the root list before consolidation. After consolidation, at most D(n) + 1 trees remain (one per degree). The amortized cost is O(D(n)) = O(log n) — the potential drop from reducing t trees to at most D(n) + 1 trees pays for the extra work.

Concept check: During CONSOLIDATE, two roots both have degree 2. What happens?

The one with the smaller key becomes the parent; the other becomes its child. The parent's degree becomes 3. They are concatenated side by side with a pointer linking them. The one with larger key is removed from the heap.

Why linking increases degree by exactly 1. When we link y under x, y becomes one additional child of x. x already had d children (degree d), now it has d+1. This is why the while loop in consolidate increments d after each link — we need to check A[d+1] for a conflict too.

Chapter 4: DECREASE-KEY

This is the star of the show. DECREASE-KEY is the operation that makes Fibonacci heaps worth the complexity — O(1) amortized time to lower a node's key. It achieves this through a mechanism called cascading cuts.

The Basic Idea

When we decrease a node's key, the min-heap property might be violated — the node might now be smaller than its parent. In a binary heap, we would "bubble up" the node, which costs O(log n) in the worst case. A Fibonacci heap does something more aggressive: it cuts the node from its parent and moves it to the root list. This is O(1).

But there is a catch. If we keep cutting children from the same parent, that parent's tree becomes too sparse — it has a high degree but few descendants. This would break the degree bound we need for extract-min to be efficient. The solution: the mark mechanism and cascading cuts.

The Mark Mechanism

Every non-root node has a boolean mark field. The rules are simple:

1. When a node is first made a child of another node (during consolidation or linking), it is unmarked.

2. When a node loses its first child (due to a cut), it gets marked. This is a warning: "I have already lost one child."

3. When a marked node loses a second child, we cut it from its parent too (and unmark it). This is the cascading cut.

The analogy. Think of marks as "one strike" warnings. Every non-root node gets one free pass — it can lose one child without consequence (but it gets marked). If it loses a second child, that is strike two: it gets cut from its own parent and moved to the root list. This prevents any node from losing too many children, which keeps trees from becoming too sparse.

The Algorithm

python
def decrease_key(self, x, new_key):
    assert new_key <= x.key, "new key must be smaller"
    x.key = new_key
    y = x.parent
    if y is not None and x.key < y.key:
        self._cut(x, y)
        self._cascading_cut(y)
    if x.key < self.min.key:
        self.min = x

def _cut(self, x, y):
    # remove x from y's child list
    if x.right == x:
        y.child = None
    else:
        x.left.right = x.right
        x.right.left = x.left
        if y.child == x:
            y.child = x.right
    y.degree -= 1
    # add x to root list
    self._add_to_root_list(x)
    x.parent = None
    x.mark = False

def _cascading_cut(self, y):
    z = y.parent
    if z is not None:     # y is not a root
        if not y.mark:
            y.mark = True    # first child lost — just mark
        else:
            self._cut(y, z) # second child lost — cut!
            self._cascading_cut(z) # check grandparent

Worked Example: Decrease-Key with Cascading Cut

Consider a tree rooted at 3 with the following structure (marks shown as *):

3
/ | \
7 9* 15
/ /
18 11

Node 9 is marked (it lost a child previously). Now we call DECREASE-KEY(18, 2).

Step 1: Set 18.key = 2. Parent is 7. 2 < 7, so we need to cut.

Step 2: CUT(18, 7). Remove 18 from 7's child list. Add 18 to root list. Unmark 18 (it was not marked anyway). 7's degree drops from 1 to 0.

Step 3: CASCADING-CUT(7). 7's parent is 3. Is 7 marked? No. So just mark 7. Done.

Step 4: 2 < 3 (current min), so update H.min = 2.

Result: Root list: ... ↔ 3 ↔ ... ↔ 2 ↔ ...

Tree rooted at 3: Node 2 (new root):
3 2
/ | \
7* 9* 15
/
11

Now suppose we call DECREASE-KEY(11, 1).

Step 1: Set 11.key = 1. Parent is 9. 1 < 9, so cut.

Step 2: CUT(11, 9). Remove 11 from 9's child list. Add 11 to root list. 9's degree drops to 0.

Step 3: CASCADING-CUT(9). 9's parent is 3. Is 9 marked? YES. So cut 9 from 3!

Step 3a: CUT(9, 3). Remove 9 from 3's child list. Add 9 to root list. Unmark 9. 3's degree drops.

Step 3b: CASCADING-CUT(3). 3 is a root (no parent). Stop.

Result: Root list: ... ↔ 3 ↔ ... ↔ 2 ↔ 9 ↔ 1 ↔ ...

Tree rooted at 3: min = 1
3
/ \
7* 15

The cascading cut propagated up one level because 9 was marked. If 3 had also been marked, the cascade would have continued further. In the worst case, a cascade can climb all the way to the root — but the amortized cost is still O(1) because each cascading cut decreases the number of marked nodes.

DECREASE-KEY with Cascading Cuts

Build a tree by inserting and extracting, then click a node and decrease its key. Watch cuts and cascading cuts happen. Marked nodes shown with a red ring.

Click a node to select it, then decrease its key.

Why Cascading Cuts are Necessary

Without cascading cuts, a node of degree k could have had arbitrarily many children cut away. It would look like it should have a big subtree (high degree) but actually have very few descendants. This breaks the degree bound D(n) = O(log n) that we need for extract-min to be efficient.

The cascading cut ensures that no non-root node can lose more than one child. If a node has degree k, it still has at least k-1 of its original children (one may have been cut, leaving it marked). This keeps the subtree sizes large enough to maintain the logarithmic degree bound.

The invariant. Cascading cuts maintain this invariant: a node of degree k has at least F_k+2 descendants (where F is the Fibonacci sequence). This is why the data structure is called a Fibonacci heap — the Fibonacci numbers appear in the minimum subtree size bounds. We will prove this in Chapter 7.

Concept check: A non-root node x is marked (x.mark = True). We decrease the key of one of x's children, causing a cut. What happens to x?

x gets marked (it already was, so nothing changes) x gets cut from its parent and moved to the root list (cascading cut), then unmarked x's key is decreased to match its child's new key

Chapter 5: DELETE

DELETE is the simplest operation to describe, because it is just a combination of the two operations we already know.

The Algorithm

FIB-HEAP-DELETE(H, x): Decrease x's key to negative infinity, then extract the minimum. Since x now has the smallest possible key, it will be the minimum that gets extracted.

python
def delete(self, x):
    self.decrease_key(x, float('-inf'))
    self.extract_min()

Two lines. That is the entire implementation.

Why This Works

Step 1: decrease_key(x, -infinity). This cuts x from its parent (if it has one) and adds it to the root list. The cascading cut runs on x's parent. Since -infinity is smaller than any other key, H.min is updated to point to x.

Step 2: extract_min(). Since x is now the minimum, extract_min removes x, promotes its children to roots, and consolidates. All in O(log n) amortized.

Amortized Cost

DECREASE-KEY is O(1) amortized. EXTRACT-MIN is O(log n) amortized. Therefore DELETE is O(log n) amortized. This matches the binary heap's O(log n) delete (which is also typically implemented as decrease-key followed by extract).

Worked Example

Consider a Fibonacci heap where we want to delete node 15:

Root list: [3, 8]

3 8
/ | \ / \
5 7 12 10 15*
/ | / \
9 14 17 20

Step 1: decrease_key(15, -∞). 15 is marked (shown as *). Its key becomes -∞. -∞ < 8 (parent), so CUT(15, 8). Add 15 (now -∞) to root list. 8 was not marked, so mark 8. H.min = -∞ node.

Root list: [3, 8*, -∞]

3 8* -∞
/ | \ / / \
5 7 12 10 17 20
/ |
9 14

Step 2: extract_min(). Remove -∞ node. Its children (17, 20) become roots. Consolidate the root list [3, 8, 17, 20]. The result depends on degrees during consolidation, but the -∞ node is gone and the heap is valid.

DELETE Operation

Build a heap, then select a node and delete it. Watch the two-phase process: decrease-key to -infinity, then extract-min.

Click a node to select, then delete it.

All Operations Summary

Operation	Worst Case	Amortized
INSERT	O(1)	O(1)
MINIMUM	O(1)	O(1)
UNION	O(1)	O(1)
EXTRACT-MIN	O(n)	O(log n)
DECREASE-KEY	O(n)	O(1)
DELETE	O(n)	O(log n)

Notice that the worst-case for extract-min and decrease-key is O(n) — if all n nodes are in the root list, consolidation touches all of them. The amortized bounds are what matter: over a sequence of operations, the average cost per operation matches the amortized column.

Concept check: Why can't we implement DELETE as just "remove the node and rewire pointers" in O(1)?

Because the node's children need to become roots, and the root list may need consolidation to maintain the degree bound Because we cannot find the node in O(1) Because the circular list does not support deletion

The pointer concern. We already have a direct pointer to the node (returned by insert), so finding it is free. The issue is structural: if we just remove the node, its children become orphaned. We must place them somewhere, and if we put them in the root list, we may accumulate too many roots (hurting future extract-min). The decrease-key + extract-min approach handles this cleanly through consolidation.

Chapter 6: Amortized Analysis

We have claimed that insert is O(1) amortized, extract-min is O(log n) amortized, and decrease-key is O(1) amortized. Now we prove it. The tool is the potential method.

The Potential Function

Define the potential of a Fibonacci heap H as:

Φ(H) = t(H) + 2 · m(H)

where t(H) is the number of trees in the root list and m(H) is the number of marked nodes in the entire heap.

The amortized cost of an operation is: amortized = actual + ΔΦ = actual + (Φ_after - Φ_before).

If an operation increases the potential, it "stores credit" for future operations. If it decreases the potential, it "spends saved credit." The key insight: operations that seem expensive (long cascading cuts) actually decrease the potential by so much that their amortized cost is cheap.

INSERT Analysis

Actual cost: O(1) — add a node to the root list, maybe update min.

Potential change: We add one tree to the root list, and the new node is not marked. So ΔΦ = +1 + 2(0) = +1.

Amortized cost: O(1) + 1 = O(1).

Sanity check. Insert stores 1 unit of potential with the new tree. This credit will pay for that tree being processed during a future consolidation.

EXTRACT-MIN Analysis

Actual cost: O(D(n) + t) where D(n) is the max degree and t is the number of trees before consolidation. We add D(n) children to the root list and process t + D(n) trees during consolidation.

Potential change: Before: t trees. After consolidation: at most D(n) + 1 trees (one per degree). No nodes get marked or unmarked in this process. So ΔΦ = (D(n) + 1) - t.

Amortized cost: O(D(n) + t) + (D(n) + 1 - t) = O(D(n) + t) + D(n) + 1 - t = O(D(n)) because the +t actual and -t potential cancel. Since D(n) = O(log n), the amortized cost is O(log n).

The beautiful cancellation. The root list might have 1000 trees before consolidation. The actual work is O(1000). But the potential drops by roughly 1000 (from t trees down to D(n) + 1). So the amortized cost is only O(D(n)). All those trees were "pre-paid" by the inserts that created them — each insert deposited 1 unit of potential. Extract-min spends it.

DECREASE-KEY Analysis

Suppose decrease-key triggers c cascading cuts (including the initial cut of the node itself).

Actual cost: O(c) — one cut operation per cascade level.

Potential change:

Trees: we add c new trees to the root list (each cut creates a new root). Δt = +c.

Marks: the initial cut unmarks the node (but it might not have been marked). Each cascading cut unmarks one marked node. The final step either marks one previously unmarked node or does nothing (if we reached a root). In the worst case, c-1 marked nodes get cut and unmarked, and 1 node gets newly marked. Δm ≤ -(c - 1) + 1 = -(c - 2).

ΔΦ = c + 2(-(c - 2)) = c - 2c + 4 = -c + 4.

Amortized cost: O(c) + (-c + 4) = O(1). The c actual work is exactly cancelled by the c potential drop.

Why the factor of 2 in the potential. The coefficient 2 on the mark count is not arbitrary. Each cascading cut reduces marks by 1 but increases trees by 1. With coefficient 2 on marks, each cascade step contributes -2 + 1 = -1 to the potential change, exactly offsetting the O(1) actual cost of one cut. With coefficient 1, the cancellation would fail. With coefficient 3, we would be "overpaying." The factor 2 is the exact equilibrium.

The Big Interactive Simulation

Interactive Fibonacci Heap with Potential Tracking

Build a Fibonacci heap with inserts and extract-mins. The potential function Φ = t + 2m updates in real time. The bar chart shows actual vs amortized cost for each operation.

Summary Table

Operation	Actual	ΔΦ	Amortized
INSERT	O(1)	+1	O(1)
EXTRACT-MIN	O(D(n) + t)	D(n) + 1 - t	O(D(n))
DECREASE-KEY	O(c)	4 - c	O(1)
DELETE	O(D(n) + c)	D(n) + 5 - t - c	O(D(n))

Concept check: We insert 100 nodes into an empty Fibonacci heap (no extract-mins). What is the potential Φ?

Φ = 100 (100 trees, 0 marks) Φ = 200 (100 trees, 100 marks) Φ = 0 (no extract-min has happened)

100 inserts, no extractions. Each insert creates one tree in the root list. No nodes are marked (marks only happen during cascading cuts, which only happen during decrease-key on non-root nodes, which cannot exist if we have only done inserts). So t = 100, m = 0, Φ = 100 + 0 = 100.

Chapter 7: The Maximum Degree Bound

The entire amortized analysis depends on D(n) = O(log n). But why is this true? Why can't a root node accumulate an enormous number of children?

The Key Lemma

Let x be any node in a Fibonacci heap, and let x have degree k. Let y₁, y₂, ..., y_k be the children of x in the order they were linked (oldest first). Then:

degree(y_i) ≥ i - 2 for i = 1, 2, ..., k

Why? When y_i was linked to x, x already had children y₁, ..., y_i-1, so x had degree at least i-1. Since CONSOLIDATE only links trees of equal degree, y_i also had degree at least i-1 at the time it was linked. Since then, y_i could have lost at most one child (if it lost two, cascading cut would have removed it). So y_i.degree ≥ (i-1) - 1 = i - 2.

Fibonacci Numbers Appear

Let size(x) denote the number of nodes in the subtree rooted at x (including x). We want to show that size(x) grows exponentially with degree(x).

Let s_k be the minimum possible size of a subtree rooted at a node of degree k. From the lemma above, a degree-k node has children with degrees at least 0, 0, 1, 2, 3, ..., k-2. So:

s_k ≥ 2 + ∑_i=0^k-2 s_i

The 2 accounts for the node itself and the first child (degree ≥ 0, so at least 1 node). The sum accounts for children 2 through k.

Computing the first few values:

s₀ = 1
s₁ = 2
s₂ = 2 + s₀ = 2 + 1 = 3
s₃ = 2 + s₀ + s₁ = 2 + 1 + 2 = 5
s₄ = 2 + s₀ + s₁ + s₂ = 2 + 1 + 2 + 3 = 8
s₅ = 2 + s₀ + s₁ + s₂ + s₃ = 2 + 1 + 2 + 3 + 5 = 13

The sequence 1, 2, 3, 5, 8, 13, ... These are the Fibonacci numbers! Specifically, s_k = F_k+2 where F is the standard Fibonacci sequence (F₀ = 0, F₁ = 1, F₂ = 1, F₃ = 2, F₄ = 3, F₅ = 5, ...). One can prove by induction that s_k ≥ F_k+2.

This is why it is called a Fibonacci heap! The minimum number of nodes in a degree-k subtree follows the Fibonacci sequence. The Fibonacci numbers are not just a cute coincidence — they are the structural constraint that makes the entire data structure work. The cascading-cut mechanism (lose at most one child) is precisely what forces the Fibonacci lower bound on subtree sizes.

The Golden Ratio Connection

The Fibonacci numbers satisfy F_k ≥ φ^k-2 where φ = (1 + √5) / 2 ≈ 1.618 is the golden ratio.

Since size(x) ≥ s_k ≥ F_k+2 ≥ φ^k, and size(x) ≤ n, we get:

φ^k ≤ n ⇒ k ≤ log_φ(n) = ln(n) / ln(φ) ≈ 1.4404 · log₂(n)

Therefore the maximum degree D(n) ≤ ⌊log_φ(n)⌋. For n = 1,000,000: D(n) ≤ log_φ(1,000,000) ≈ 1.4404 × 20 ≈ 29. Compare with log₂(1,000,000) ≈ 20 for a binary heap. The Fibonacci heap's max degree is about 44% higher, but still O(log n).

Degree Bound vs Heap Size

Drag the slider to see how the maximum degree D(n) grows with n. Compare log_φ(n) with log₂(n).

n 1000

The Minimum Tree Sizes

Degree k	Min nodes s_k	Fibonacci F_k+2	φ^k
0	1	1	1.00
1	2	2	1.62
2	3	3	2.62
3	5	5	4.24
4	8	8	6.85
5	13	13	11.09
6	21	21	17.94
7	34	34	29.03
8	55	55	46.98

Notice that s_k = F_k+2 exactly. The Fibonacci numbers are not just a lower bound — they are the exact minimum. You can construct trees that achieve this minimum: each child has lost exactly one child, creating the sparsest possible tree.

Concept check: A Fibonacci heap has 1000 nodes. What is the maximum possible degree of any single node?

10 (log₂(1000)) 999 (n - 1) 14 (log_φ(1000) ≈ 14.3)

Why not log₂(n)? A binary heap's height is log₂(n) because each node has exactly 2 children and none are ever removed. In a Fibonacci heap, children can be removed (via cuts), so subtrees can be sparser. A degree-k node has at least F_k+2 ≈ φ^k descendants instead of 2^k. Since φ < 2, the degree grows faster relative to n — hence log_φ(n) > log₂(n).

Chapter 8: Fibonacci Heaps in Practice

The theoretical superiority of Fibonacci heaps is undeniable — O(1) decrease-key is unbeatable asymptotically. But asymptotic notation hides constant factors, and those constants matter enormously in practice.

Why Fibonacci Heaps Often Lose

1. Pointer overhead. Each node in a Fibonacci heap stores 5 pointers (parent, child, left, right) plus degree and mark. In a binary heap, each node is just a key in an array — no pointers at all. For 64-bit keys, a Fibonacci heap node is about 56 bytes (7 fields × 8 bytes). A binary heap element is 8 bytes. That is 7x memory overhead.

2. Cache behavior. A binary heap is stored in a contiguous array. Traversing it hits the CPU cache beautifully — adjacent elements are in the same cache line. A Fibonacci heap scatters nodes across the heap (memory allocator sense), with pointer chasing at every step. Each pointer dereference is a potential cache miss (100+ cycles on modern hardware).

3. Allocation cost. Each insert into a Fibonacci heap allocates a new node on the heap. Each extract-min potentially frees a node. Dynamic allocation is expensive — a single malloc/free pair can cost 50-100 nanoseconds. Binary heap insert is just an array write.

4. Constant factors in consolidation. The consolidation loop involves many pointer rewires per merge. Each merge in a binary heap is a single compare-and-swap on array elements.

The bottom line. For graphs with fewer than ~10,000 vertices, a binary heap almost always beats a Fibonacci heap in wall-clock time. For moderate graphs (10K-100K vertices), the Fibonacci heap starts winning only when the graph is very dense (E > 100V). For very large dense graphs (millions of vertices, E = Θ(V²)), the Fibonacci heap's asymptotic advantage finally dominates.

Practical Alternatives

Pairing heaps. Invented by Fredman, Sedgewick, Sleator, and Tarjan (1986). Much simpler than Fibonacci heaps — no mark field, no cascading cuts. The analysis is more complex (the exact amortized bound for decrease-key is still an open problem!), but empirically they perform nearly as well as Fibonacci heaps and are much faster in practice due to simpler code and fewer pointer operations.

Brodal queues. Achieve O(1) worst-case (not amortized) decrease-key. This is the theoretical optimum. But the implementation is so complex that no one uses them in practice. They are mainly of theoretical interest.

d-ary heaps. A generalization of binary heaps where each node has d children instead of 2. With d = 4 or d = 8, decrease-key costs O(log_d n) which is smaller than O(log₂ n), and the cache behavior is excellent because the wider tree has fewer levels. For Dijkstra, a 4-ary heap often beats all other implementations for sparse to moderate graphs.

Rank-pairing heaps. A recent (2009) data structure that achieves the same amortized bounds as Fibonacci heaps but with simpler code and fewer structural constraints. They use ranks instead of degrees and a simplified cascade mechanism.

Heap Type	Insert	Extract-Min	Decrease-Key	Merge	Practical Speed
Binary	O(log n)	O(log n)	O(log n)	O(n)	Excellent
d-ary	O(log_d n)	O(d log_d n)	O(log_d n)	O(n)	Very good
Binomial	O(log n)	O(log n)	O(log n)	O(log n)	Good
Fibonacci	O(1)	O(log n)*	O(1)*	O(1)	Poor-to-fair
Pairing	O(1)	O(log n)*	O(log n)*?	O(1)	Good
Brodal	O(1)	O(log n)	O(1)	O(1)	Terrible

* = amortized. ? = exact bound still open.

When to Actually Use a Fibonacci Heap

Use a Fibonacci heap (or pairing heap) when ALL of the following hold:

1. You need decrease-key (not just insert + extract-min).

2. The number of decrease-key operations greatly exceeds the number of extract-min operations (high E/V ratio in Dijkstra).

3. The graph is large enough that the asymptotic advantage overcomes the constant-factor overhead.

4. You are willing to accept the implementation complexity (or you use a library).

In interviews. You will almost never implement a Fibonacci heap in an interview. What you need to know: (1) it exists, (2) it gives O(1) amortized decrease-key, (3) this improves Dijkstra from O((V+E) log V) to O(V log V + E), (4) Prim's MST also benefits similarly, (5) in practice, simpler heaps often win. Knowing the high-level mechanism (lazy consolidation + cascading cuts) is sufficient.

Heap Comparison: Dijkstra Runtime

Compare estimated operation counts for Dijkstra with different heap implementations across graph densities.

Vertices (V) 10000

E/V ratio 10

Concept check: You are implementing Dijkstra on a sparse road network (V = 1M, E = 3M, E/V = 3). Which heap should you use?

A 4-ary heap — excellent cache behavior, and the log factor is small for sparse graphs A Fibonacci heap — always use the best asymptotic bound An unsorted array — sparse graphs have few edges

Why the 4-ary heap? For E/V = 3, the dominant term is E log V = 3M × 20 = 60M for both binary and Fibonacci heaps (since 3M and 1M × 20 are comparable). The Fibonacci heap saves nothing meaningful here because E ≈ 3V, so the E term and V log V term are similar in magnitude. The 4-ary heap wins on constants: better cache utilization, no allocation overhead, simpler code.

Chapter 9: Interview Arsenal & Connections

The Five-Dimensional Cheat Sheet

Here is everything you need to know about Fibonacci heaps, organized by interview dimension:

CONCEPT: A Fibonacci heap is a collection of min-heap-ordered trees in a circular root list. It achieves O(1) amortized insert, union, and decrease-key by deferring structural cleanup to extract-min. The mark field and cascading-cut mechanism ensure that no node loses more than one child, maintaining the D(n) = O(log n) degree bound.

DESIGN: Use Fibonacci heaps when your algorithm does many more decrease-key operations than extract-min operations (dense graph algorithms). The lazy consolidation strategy trades individual operation efficiency for aggregate amortized efficiency. In practice, pairing heaps or d-ary heaps are almost always better choices.

CODE: You will not implement a Fibonacci heap in an interview. But you should be able to explain: (1) insert = add to root list, (2) extract-min = remove min + consolidate, (3) decrease-key = cut + cascade, (4) the potential function Φ = t + 2m.

DEBUG: Common Fibonacci heap bugs: forgetting to update min pointer after consolidation, not handling the circular list correctly during splice/remove, forgetting to unmark nodes when they become roots, incorrectly updating degree during link/cut.

FRONTIER: Strict Fibonacci heaps (Brodal et al., 2012) achieve all the Fibonacci heap bounds in worst-case time, not just amortized. Hollow heaps (2015) simplify the decrease-key mechanism while maintaining the same bounds. The exact amortized complexity of pairing heaps' decrease-key remains an open problem in theoretical CS.

Complete Heap Comparison

Heap	Insert	Find-Min	Extract-Min	Decrease-Key	Merge	Delete
Binary	O(log n)	O(1)	O(log n)	O(log n)	O(n)	O(log n)
Binomial	O(1)*	O(1)	O(log n)	O(log n)	O(log n)	O(log n)
Fibonacci	O(1)	O(1)	O(log n)*	O(1)*	O(1)	O(log n)*
Pairing	O(1)	O(1)	O(log n)*	O(log n)*	O(1)	O(log n)*
Brodal	O(1)	O(1)	O(log n)	O(1)	O(1)	O(log n)
Strict Fib	O(1)	O(1)	O(log n)	O(1)	O(1)	O(log n)

* = amortized

Graph Algorithm Impact

Algorithm	With Binary Heap	With Fibonacci Heap	Improvement
Dijkstra	O((V+E) log V)	O(V log V + E)	factor log V on E
Prim's MST	O((V+E) log V)	O(V log V + E)	factor log V on E
Johnson's APSP	O(VE log V)	O(V² log V + VE)	significant for dense

Key Takeaways for Interviews

Know the bounds

Insert O(1), Extract-Min O(log n)*, Decrease-Key O(1)*, Union O(1). All amortized.

↓

Know the mechanism

Lazy consolidation + cascading cuts + mark bits. Potential function Φ = t + 2m.

↓

Know the application

Dijkstra O(V log V + E), Prim O(V log V + E). The decrease-key improvement.

↓

Know the practice

Binary/d-ary heaps win on sparse graphs. Fibonacci heaps only win for large dense graphs.

↓

Know the name

D(n) = O(log_φ n). Min subtree size = F_k+2 for degree k. Golden ratio. Fibonacci sequence.

Related Lessons

Chapter 6: Heapsort & Priority Queues — Binary heaps from the ground up. The foundation for understanding why Fibonacci heaps improve on them.

Chapter 24: Single-Source Shortest Paths — Dijkstra's algorithm, the primary client of Fibonacci heaps.

Chapter 23: Minimum Spanning Trees — Prim's algorithm, the other major client.

Chapter 17: Amortized Analysis — The potential method used to analyze Fibonacci heap operations.

A closing thought. Fibonacci heaps are one of the most beautiful data structures in computer science — not because they are practical (they often are not), but because they demonstrate a profound idea: laziness as a design principle. By deliberately deferring work, they achieve amortized bounds that no eager data structure can match. The mark-and-cascade mechanism is an elegant solution to the tension between laziness and structural integrity. Even if you never implement one, understanding how they work deepens your intuition about amortized analysis, potential functions, and the art of algorithmic design.

Final check: Which statement about Fibonacci heaps is FALSE?

The mark field prevents nodes from losing more than one child The potential function is Φ = t + 2m where t = trees, m = marked nodes EXTRACT-MIN is O(1) amortized because we just remove the min pointer

Answer C is false. EXTRACT-MIN is O(log n) amortized, not O(1). It involves promoting children to roots and then running CONSOLIDATE, which merges trees until all roots have distinct degrees. The "easy O(1)" operations are insert, minimum, union, and decrease-key.