spprof

A high-performance sampling profiler for Python with Speedscope and FlameGraph output.

Features

Low overhead — <1% CPU at 10ms sampling, suitable for production
Mixed-mode profiling — Capture Python and C extension frames together
Multi-threaded — Automatic profiling of all Python threads
Memory-efficient — Stack aggregation for long-running profiles
Cross-platform — Linux, macOS, Windows
Python 3.9–3.14 — Including free-threaded builds (Linux & macOS)
Zero dependencies — No runtime requirements
Container-friendly — No SYS_PTRACE capability required

Installation

pip install spprof

Build from source:

pip install git+https://github.com/Periecle/spprof.git

Quick Start

import spprof

spprof.start()
# ... your code ...
profile = spprof.stop()
profile.save("profile.json")

# View at https://www.speedscope.app

Context Manager

with spprof.Profiler(interval_ms=5) as p:
    expensive_computation()

p.profile.save("profile.json")

Decorator

@spprof.profile(output_path="func.json")
def heavy_work():
    ...

API

Core Functions

Function	Description
`start(interval_ms=10)`	Begin profiling
`stop()`	Stop and return `Profile`
`is_active()`	Check if profiler is running
`stats()`	Get live statistics

Thread Management

# Option 1: Explicit registration (recommended for Linux)
def worker():
    spprof.register_thread()
    try:
        do_work()
    finally:
        spprof.unregister_thread()

# Option 2: Context manager
def worker():
    with spprof.ThreadProfiler():
        do_work()

Native Unwinding

Capture C/C++ frames alongside Python for debugging extensions:

if spprof.native_unwinding_available():
    spprof.set_native_unwinding(True)

spprof.start()

Memory-Efficient Aggregation

For long-running profiles with repetitive call patterns:

profile = spprof.stop()
aggregated = profile.aggregate()

print(f"Compression: {aggregated.compression_ratio:.1f}x")
aggregated.save("profile.json")

Output Formats

Speedscope (default)

profile.save("profile.json")

Open at speedscope.app for interactive flame graphs.

FlameGraph

profile.save("profile.collapsed", format="collapsed")

Generate SVG with FlameGraph:

flamegraph.pl profile.collapsed > profile.svg

Configuration

Parameter	Default	Description
`interval_ms`	10	Sampling interval (1–1000ms)
`memory_limit_mb`	100	Maximum buffer size
`output_path`	None	Auto-save on stop

Interval Guidelines

Interval	Overhead	Use Case
1ms	~5%	Short scripts, benchmarks
10ms	<1%	Development (default)
100ms	<0.1%	Production monitoring

Platform Details

Platform	Mechanism	Thread Sampling	Free-Threading
Linux	`timer_create` + SIGPROF	Per-thread CPU time	✅ Supported
macOS	Mach thread suspension	All threads automatic	✅ Supported
Windows	Timer queue + GIL	All threads automatic	—

Linux: Per-thread CPU time sampling with explicit thread registration. Free-threaded Python 3.13+ is supported via speculative capture with validation (~0.0005% sample drop rate).

macOS: Full support for free-threaded Python (--disable-gil) via Mach-based thread suspension sampling.

Development

git clone https://github.com/Periecle/spprof.git
cd spprof
pip install -e ".[dev]"

Testing

pytest                           # Run tests
pytest --cov=spprof              # With coverage

Code Quality

ruff check src/ tests/           # Lint
ruff format src/ tests/          # Format
mypy src/spprof                  # Type check

Troubleshooting

No Samples Captured

If your workload completes too quickly, you may see zero samples. Ensure your workload runs at least 10x the sampling interval:

# For fast functions (< 100ms), use aggressive sampling
spprof.start(interval_ms=1)
fast_function()  # Must run > 10ms to capture samples
profile = spprof.stop()

High Dropped Sample Count

If profile.dropped_count is high, samples are being lost due to buffer overflow:

# Increase memory limit for long-running profiles
spprof.start(interval_ms=10, memory_limit_mb=200)

# Or reduce sampling frequency
spprof.start(interval_ms=100)  # For production/long profiles

Container Permission Issues

In containers with restricted syscalls (seccomp, cgroups v1), spprof automatically falls back to wall-time sampling. For full CPU-time profiling:

# Docker: Run with extended permissions (development only)
docker run --security-opt seccomp=unconfined myapp

See Troubleshooting Guide for detailed solutions including custom seccomp profiles and Kubernetes configurations.

Documentation

Usage Guide — Detailed API documentation
Architecture — Internal design
Performance Tuning — Optimization guide
Troubleshooting — Common issues and solutions

License

MIT License. See LICENSE for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

spprof

Features

Installation

Quick Start

Context Manager

Decorator

API

Core Functions

Thread Management

Native Unwinding

Memory-Efficient Aggregation

Output Formats

Speedscope (default)

FlameGraph

Configuration

Interval Guidelines

Platform Details

Development

Testing

Code Quality

Troubleshooting

No Samples Captured

High Dropped Sample Count

Container Permission Issues

Documentation

License

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

spprof

Features

Installation

Quick Start

Context Manager

Decorator

API

Core Functions

Thread Management

Native Unwinding

Memory-Efficient Aggregation

Output Formats

Speedscope (default)

FlameGraph

Configuration

Interval Guidelines

Platform Details

Development

Testing

Code Quality

Troubleshooting

No Samples Captured

High Dropped Sample Count

Container Permission Issues

Documentation

License