Add COFT: Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language Models (https://arxiv.org/abs/2410.15116) #10550

waynechu1109 · 2025-12-10T19:27:05Z

Summary / Motivation

This pull request introduces COFT into torch_geometric.llm, a modular component designed to reduce hallucinations in retrieval-augmented and knowledge-grounded LLM workflows.

While PyG already provides various utilities for LLM integration, there is currently no built-in mechanism for context selection, entity-driven scoring, or highlight-based grounding. COFT fills this gap by offering a plug-and-play module that:

identifies key entities using a graph-aware recall step,
scores them via contextual weighting,
and highlights important spans in the reference text at different granularities (paragraph, sentence, word).

This reflects the methodology proposed in the COFT research work and enables more accurate downstream LLM reasoning.

What This PR Adds

New modules

COFT: main highlighting pipeline with recaller, scorer, and selector
Graph-based “recaller” using entity alias dictionaries
Contextual weight “scorer” integrating TF-ISF + language-model self-information
Dynamic threshold “selector” supporting paragraph, sentence, and word-level highlighting

New example script

examples/coft.py

Demonstrates full end-to-end usage with torch_geometric.llm.LLM.

New unit tests

Located under test/llm/models/test_coft.py, covering:

candidate recall
word/sentence/paragraph-level highlighting
consistency on various granularities

Why This Is Useful

Large-context reasoning can hallucinate when irrelevant text overwhelms the LLM.

COFT significantly improves robustness by:

reducing distractions in long contexts
grounding LLM reasoning to graph-derived key entities
improving interpretability (highlighted spans act as an attention bottleneck)
supporting CPU-friendly scoring models when running lightweight LLMs

This module complements PyG’s direction toward graph-assisted LLMs and aligns well with existing efforts such as RAG, graph prompting, and KG-augmented workflows.

Breaking Changes

No breaking changes introduced.

COFT is self-contained and does not modify existing LLM APIs.

Example Usage

from torch_geometric.llm.models import LLM, Granularity, COFT llm = LLM("Qwen/Qwen2.5-0.5B-Instruct") coft = COFT(llm, triplets, entity_alias) highlighted = coft( query="What nutrients do apples provide?", reference=text, granularity=Granularity.SENTENCE, ) print(highlighted)

Test Plan

All tests pass:

pytest test/llm/models/test_coft.py -v

Manual validation:

python examples/coft.py

Both example results and unit tests confirm consistent highlighting behavior.

for more information, see https://pre-commit.ci

waynechu1109 added 5 commits December 8, 2025 21:22

add: add coft implementation, 1st version

452461c

fix: fix selector, 1st version

2885b7f

add: add documentation

069e180

black formatted

9d88188

yapf formatted, test passed

966bfcd

waynechu1109 requested review from akihironitta, puririshi98, rusty1s and wsad1 as code owners December 10, 2025 19:27

pre-commit-ci bot and others added 2 commits December 10, 2025 19:29

[pre-commit.ci] auto fixes from pre-commit.com hooks

ac78ec9

for more information, see https://pre-commit.ci

fix: fix docstring D415 punctuation issues for pr check

b3b68fb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add COFT: Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language Models (https://arxiv.org/abs/2410.15116) #10550

Add COFT: Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language Models (https://arxiv.org/abs/2410.15116) #10550

Uh oh!

waynechu1109 commented Dec 10, 2025

Labels

1 participant

Add COFT: Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language Models (https://arxiv.org/abs/2410.15116) #10550

Are you sure you want to change the base?

Add COFT: Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language Models (https://arxiv.org/abs/2410.15116) #10550

Uh oh!

Conversation

waynechu1109 commented Dec 10, 2025

Summary / Motivation

What This PR Adds

New modules

New example script

New unit tests

Why This Is Useful

Breaking Changes

Example Usage

Test Plan

Labels

1 participant