Skip to content

Path.rglob performance issues in deeply nested directories compared to glob.glob(recursive=True) #102613

Closed
@ionite34

Description

@ionite34

Bug report

Pathlib.rglob can be orders of magnitudes slower than glob.glob(recursive=True)

With a 1000-deep nested directory, glob.glob and Path.glob both took under 1 second. Path.rglob took close to 1.5 minutes.

import glob import os from pathlib import Path x = "" for _ in range(1000): x += "a/" os.mkdir(x) # ~ 0.5s print(glob.glob("**/*", recursive=True)) # ~ 87s print(list(Path(".").rglob("**/*")))

Linked PRs

Metadata

Metadata

Assignees

No one assigned

    Labels

    performancePerformance or resource usagetopic-pathlibtype-bugAn unexpected behavior, bug, or error

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions