Parallel and Distributed Systems - Project 1 (MergeSort)

Davide Marchi - August 2025

Distributed out-of-core MergeSort in C++20 using mmap. The project implements a common I/O pipeline (index build → index sort → rewrite) across:

Sequential (baseline)
OpenMP (task-based mergesort)
FastFlow (farm: emitter + workers)
MPI + OpenMP (one rank per node; pairwise distributed merges)

The repo includes runnable scripts for experiments, a Jupyter notebook for plotting, and a PDF report. Results and logs are saved to folders so runs are reproducible.

Repository layout

. ├── analysis/ │ ├── seq_omp_ff_composites.ipynb # Notebook for plots/analysis │ └── plots_seq_omp_ff/ # Plots saved by the notebook ├── report/ │ ├── report.pdf # Compiled report │ └── imgs/ # Images used in the report ├── scripts/ # Helper scripts for sweeps │ ├── run_array_any.sh # Single-node: seq / omp / ff │ └── run_array_mpi.sh # Multi-node: MPI+OMP ├── results/ # CSV output (created at run time) ├── logs/ # Logs from runs (created at run time) ├── fastflow/ │ └── ff/ # FastFlow │ └── mapping_string.sh ├── bin/ # Executables (created by make) ├── Makefile ├── utils.hpp ├── sequential_seq_mmap.cpp ├── omp_mmap.cpp ├── ff_mmap.cpp ├── mpi_omp_mmap.cpp └── README.md

Build

Requirements

GCC or Clang with C++20 and OpenMP
MPI toolchain (Open MPI or MPICH) for the MPI target
Linux with mmap
FastFlow headers (cloned locally as shown below)

FastFlow setup

Clone the official FastFlow repository in this project directory so it sits alongside the sources:

git clone https://github.com/fastflow/fastflow.git fastflow

Run the CPU mapping helper from inside the ff/ subfolder. This script directly updates FastFlow’s internal mapping configuration.

cd fastflow/ff bash mapping_string.sh

Compile

make # builds all targets into ./bin

Targets produced:

bin/sequential_seq_mmap
bin/omp_mmap
bin/ff_mmap
bin/mpi_omp_mmap

Clean

make clean # remove objects make distclean # also remove ./bin and build artifacts

Command-line parameters

All binaries print help with -h. Example:

bin/omp_mmap -h Usage: bin/omp_mmap [options] -n, --records N number of records (default 1e6) -p, --payload B maximum payload size in bytes (default 256) -t, --threads T threads to use (0 = hw concurrency) -c, --cutoff N task cutoff size (default 10000) -h, --help show this help

(The same flags apply to the other executables. For the MPI+OMP binary, -t selects threads per rank.)

Running on Slurm with `srun`

Create output folders once:

mkdir -p results logs

Single-node examples (`srun -n 1`)

# Sequential baseline srun -n 1 bin/sequential_seq_mmap -n 100000000 -p 32 # OpenMP on 16 threads srun -n 1 --cpus-per-task=16 bin/omp_mmap -n 100000000 -p 32 -t 16 -c 10000 # FastFlow on 16 threads (1 emitter + 15 workers) # Run ff/mapping_string.sh beforehand to get a good CPU mapping. srun -n 1 --cpus-per-task=16 bin/ff_mmap -n 100000000 -p 32 -t 16 -c 10000

Multi-node MPI + OMP (`srun --mpi=pmix`)

One rank per node (example: 4 nodes, 16 threads per rank):

srun --mpi=pmix -N 4 -n 4 --cpus-per-task=16 bin/mpi_omp_mmap -n 100000000 -p 32 -t 16 -c 10000

(Alternatively: -N 4 --ntasks-per-node=1.)

Scripts for testing

Run the sweep scripts with just the binary path; outputs go to results/ and logs to logs/ by default.

# Single-node sweeps (seq / omp / ff) bash scripts/run_array_any.sh --bin bin/omp_mmap # Multi-node sweeps (MPI + OMP) bash scripts/run_array_mpi.sh --bin bin/mpi_omp_mmap

Each script defines default grids for records, payloads, threads, and trials. Override them by passing the appropriate flags/environment variables (see the script header).

Plots and analysis

The plotting notebook is at analysis/seq_omp_ff_composites.ipynb. At the top, a boolean variable toggles whether to include write-time in the figures. Running the notebook saves composite figures under analysis/plots_seq_omp_ff/.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Parallel and Distributed Systems - Project 1 (MergeSort)

Repository layout

Build

Requirements

FastFlow setup

Compile

Clean

Command-line parameters

Running on Slurm with `srun`

Single-node examples (`srun -n 1`)

Multi-node MPI + OMP (`srun --mpi=pmix`)

Scripts for testing

Plots and analysis

About

Uh oh!

Releases 6

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 133 Commits
analysis		analysis
logs		logs
report		report
results		results
scripts		scripts
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
ff_mmap.cpp		ff_mmap.cpp
mpi_omp_mmap.cpp		mpi_omp_mmap.cpp
omp_mmap.cpp		omp_mmap.cpp
project1.pdf		project1.pdf
sequential_seq_mmap.cpp		sequential_seq_mmap.cpp
utils.hpp		utils.hpp

davide-marchi/Parallel-and-Distributed-Systems-Paradigms-and-Models

Folders and files

Latest commit

History

Repository files navigation

Parallel and Distributed Systems - Project 1 (MergeSort)

Repository layout

Build

Requirements

FastFlow setup

Compile

Clean

Command-line parameters

Running on Slurm with srun

Single-node examples (srun -n 1)

Multi-node MPI + OMP (srun --mpi=pmix)

Scripts for testing

Plots and analysis

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 6

Languages

Running on Slurm with `srun`

Single-node examples (`srun -n 1`)

Multi-node MPI + OMP (`srun --mpi=pmix`)