Add `to_batch_edge_index` utility function #10485

DhyeyMavani2003 · 2025-10-04T09:01:42Z

Description

Implements to_batch_edge_index as the inverse operation of unbatch_edge_index. This function merges a list of edge_index tensors into a single batched edge_index tensor and returns the corresponding batch vector.

Closes #6099

Motivation

Currently, PyG provides unbatch_edge_index to split a batched edge_index into individual graphs, but lacks the inverse operation to merge multiple edge_index tensors into a batch. This function completes the API by providing the batching counterpart, enabling users to:

Manually construct batched graphs from individual edge_index tensors
Implement custom batching logic
Work with dynamic graph batching scenarios

Changes

Core Implementation

Added to_batch_edge_index() function in torch_geometric/utils/_unbatch.py
- Takes a list of edge_index tensors as input
- Returns a tuple of (batched_edge_index, batch_vector)
- Properly offsets node indices for each graph
- Handles edge cases: empty lists, empty graphs, mixed scenarios

API Updates

Exported to_batch_edge_index in torch_geometric/utils/__init__.py

Testing

Added 6 comprehensive test cases in test/utils/test_unbatch.py:
- Basic functionality
- Empty list handling
- Single graph handling
- Mixed empty/non-empty graphs
- Roundtrip verification (proves it's the inverse of unbatch_edge_index)
- Different sized graphs

Usage Example

import torch from torch_geometric.utils import to_batch_edge_index, unbatch_edge_index # Create individual edge_index tensors edge_index_list = [ torch.tensor([[0, 1, 1, 2, 2, 3], [1, 0, 2, 1, 3, 2]]), torch.tensor([[0, 1, 1, 2], [1, 0, 2, 1]]), ] # Batch them together edge_index, batch = to_batch_edge_index(edge_index_list) print(edge_index) # tensor([[0, 1, 1, 2, 2, 3, 4, 5, 5, 6], # [1, 0, 2, 1, 3, 2, 5, 4, 6, 5]]) print(batch) # tensor([0, 0, 0, 0, 1, 1, 1]) # Verify roundtrip unbatched = unbatch_edge_index(edge_index, batch) assert all(torch.equal(a, b) for a, b in zip(edge_index_list, unbatched))

Testing

All tests pass:

pytest test/utils/test_unbatch.py -v # 8 passed in 0.04s

Pre-commit hooks pass:

pre-commit run --all-files # All checks passed

Checklist

Implementation follows PyG conventions
Comprehensive test coverage added
Documentation with examples included
Type hints provided
Pre-commit hooks pass (yapf, flake8, ruff)
Roundtrip test verifies correctness
Edge cases handled (empty lists, empty graphs)

Related Issues

Closes #6099

Implements to_batch_edge_index as the inverse of unbatch_edge_index. This function merges a list of edge_index tensors into a single batched edge_index tensor and returns the corresponding batch vector. Features: - Handles empty lists and empty graphs - Properly offsets node indices for each graph - Comprehensive test coverage including roundtrip tests - Follows PyG conventions and code style Closes pyg-team#6099 Co-authored-by: Ona <no-reply@ona.com>

DhyeyMavani2003 requested review from akihironitta, rusty1s and wsad1 as code owners October 4, 2025 09:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add `to_batch_edge_index` utility function #10485

Add `to_batch_edge_index` utility function #10485

Uh oh!

DhyeyMavani2003 commented Oct 4, 2025

Labels

1 participant

Add to_batch_edge_index utility function #10485

Are you sure you want to change the base?

Add to_batch_edge_index utility function #10485

Uh oh!

Conversation

DhyeyMavani2003 commented Oct 4, 2025

Description

Motivation

Changes

Core Implementation

API Updates

Testing

Usage Example

Testing

Checklist

Related Issues

Labels

1 participant

Add `to_batch_edge_index` utility function #10485

Add `to_batch_edge_index` utility function #10485