Beyond "Not Novel Enough": Enriching Scholarly Critique with LLM-Assisted Feedback

Novelty assessment is a central yet understudied aspect of peer review, particularly in high-volume fields like NLP where reviewer capacity is strained. We present a structured approach for automated novelty evaluation that models expert reviewer behavior through three stages: content extraction from submissions, retrieval and synthesis of related work, and structured comparison for evidence-based assessment. Informed by a large-scale analysis of human-written novelty reviews, our method captures key patterns such as independent claim verification and contextual reasoning. Evaluated on 182 ICLR 2025 submissions with human-annotated reviewer novelty assessments, it achieves 86.5% alignment with human reasoning and 75.3% agreement on novelty conclusions, substantially outperforming existing LLM-based baselines. The method produces detailed, literature-aware analyses, improves consistency over ad hoc judgments, and demonstrates the potential of structured LLM-assisted approaches to support more rigorous and transparent peer review without displacing human expertise. Data and code are made available.

This repo contains the code and data used to produce the experiments in this paper.

Contact person: Osama Mohammed Afzal

UKP Lab | TU Darmstadt

Getting Started

python -m venv .venv
source .venv/bin/activate
pip install .
pip install -r requirements.txt

Prerequisites

API Keys: Copy .env.example to .env and add your API keys:
- OPENAI_API_KEY: Required for structured extraction and novelty assessment
- SEMANTIC_SCHOLAR_API_KEY: Optional, improves rate limits for paper retrieval
GROBID Setup: The pipeline requires GROBID for extracting structured metadata from PDFs:
- Installation: Follow the installation instructions at GROBID repository
- Usage: Process your PDF papers to generate TEI XML files:
```
# Example GROBID processing (refer to GROBID docs for detailed instructions)
curl -X POST -F "[email protected]" localhost:8070/api/processFulltextDocument
```
- Expected Output: The pipeline expects GROBID TEI XML files in this structure:
```
data/{submission_id}/{submission_id}.grobid.tei.xml
```

OCR Processing: The pipeline requires OCR processing of PDF papers to extract introductions. You can use either:

Nougat OCR: Follow installation instructions at Nougat repository
MinerU OCR: Follow installation instructions at MinerU repository

The pipeline expects OCR output in these specific directory structures:

For Main Paper (any one of these):

data/{submission_id}/ocr_output/{submission_id}/auto/{submission_id}.md
data/{submission_id}/nougat_output/{submission_id}.mmd
data/{submission_id}/mineru_output/{submission_id}.md

For Related Papers (any one of these):

data/{submission_id}/related_work_data/ocr_output/{paper_id}/auto/{paper_id}.md
data/{submission_id}/related_work_data/nougat_output/{paper_id}.mmd

Generated Introduction Files (created by pipeline):

data/{submission_id}/ours/{submission_id}_intro.txt              # main paper
data/{submission_id}/ours/related_papers/{paper_id}_intro.txt    # related papers

Usage

Complete Pipeline

The novelty assessment pipeline consists of several stages. For a single submission:

GROBID Processing: Process submission PDF with GROBID (external step)

# Start GROBID service (refer to GROBID documentation)
# Process PDF to generate TEI XML file
# Save as: data/{submission_id}/{submission_id}.grobid.tei.xml

Preprocess: Extract metadata from GROBID TEI XML

cd src/preprocess
python extract_metadata.py --data-dir /path/to/data --submission-id SUBMISSION_ID

Enrich Citations: Add Semantic Scholar data to citations

cd src/retrieval
python match_papers_to_s2.py --input /path/to/data --submission-id SUBMISSION_ID

Retrieve Related Papers: Find and rank related papers

cd src/retrieval
python retrieval.py --input /path/to/data --submission-id SUBMISSION_ID

Download PDFs: Download PDFs of ranked papers

cd src/retrieval
python get_cited_pdfs.py --data-dir /path/to/data --submission-id SUBMISSION_ID

OCR Processing: Process PDFs with Nougat or MinerU (external step)

# Process main paper PDF and related paper PDFs with OCR tool of choice
# Save outputs in expected directory structure (see Prerequisites)

Extract Introductions: Extract introductions from OCR output

cd src/retrieval
python extract_introductions.py --data-dir /path/to/data --submission-id SUBMISSION_ID

Run Analysis: Complete novelty assessment pipeline

cd src/novelty_assessment
python pipeline.py --data-dir /path/to/data --submission-id SUBMISSION_ID

Individual Components

Each stage can also be run independently using the CLI interfaces provided:

extract_metadata.py: Extracts structured metadata (title, abstract, citations, citation contexts) from GROBID TEI XML files
match_papers_to_s2.py: Enriches citations with Semantic Scholar data (abstracts, paper IDs, publication info)
retrieval.py: Generates search keywords, queries Semantic Scholar, ranks papers using SPECTER2 embeddings and RankGPT
get_cited_pdfs.py: Downloads PDFs of ranked papers from ArXiv, ACL Anthology, and Semantic Scholar
extract_introductions.py: Extracts introduction sections from OCR-processed papers using pattern matching
structured_extraction.py: Uses LLMs to extract structured information (methods, problems, datasets, results, novelty claims)
research_landscape.py: Analyzes the research landscape and identifies methodological clusters and relationships
novelty_assessment.py: Performs detailed novelty analysis comparing submission against related work
generate_summary.py: Generates final reviewer guidance summarizing the novelty assessment
pipeline.py: Orchestrates the complete analysis pipeline with dependency checking

Expected results

After running the complete pipeline, you should expect the following output structure and files:

data/{submission_id}/
├── {submission_id}.grobid.tei.xml           # Input: GROBID TEI XML file
├── ours/
│   ├── {submission_id}.json                 # Extracted metadata (title, abstract, citations)
│   ├── {submission_id}_intro.txt            # Main paper introduction
│   ├── related_papers/                      # Related papers introductions
│   │   ├── {paper_id}_intro.txt
│   │   └── ...
│   ├── s2_enriched_{submission_id}.json     # Citations enriched with Semantic Scholar data
│   ├── related_work_{submission_id}.json    # Retrieved and ranked related papers
│   ├── structured_extraction_{submission_id}.json  # LLM-extracted structured information
│   ├── research_landscape_{submission_id}.json     # Research landscape analysis
│   ├── novelty_assessment_{submission_id}.json     # Detailed novelty assessment
│   └── summary_{submission_id}.json         # **Final summary for reviewers**
└── related_work_data/
    ├── pdfs/                                # Downloaded related paper PDFs
    └── ocr_output/                          # OCR processed papers

Key Output Files:

summary_{submission_id}.json - The main output containing:
- Executive Summary: High-level novelty assessment and recommendations
- Detailed Analysis: Evidence-based comparison with related work
- Reviewer Guidance: Structured feedback for peer reviewers
- Supporting Evidence: Citations and specific comparisons
novelty_assessment_{submission_id}.json - Detailed technical analysis including:
- Methodological comparisons with related work
- Innovation assessment across different dimensions
- Evidence-based novelty scoring
- Specific technical differentiators
structured_extraction_{submission_id}.json - Structured information extracted from paper:
- Methods and approaches used
- Problems addressed and datasets
- Key results and claims
- Novelty assertions by authors

The pipeline produces comprehensive, literature-aware analyses that help reviewers assess novelty systematically rather than making ad hoc judgments. The final summary provides actionable guidance for peer review decisions.

Key Parameters

The pipeline components accept these main parameters:

--data-dir: Base directory containing submission data (required for all components)
--submission-id: Unique identifier for the submission being processed (required for pipeline mode)
--input: Input directory path (used by some retrieval components)
--verbose, -v: Enable detailed logging output (optional)

Development

For development work:

Install dependencies:
```
pip install -r requirements.txt
```
The codebase is organized into three main stages:
- src/preprocess/: Metadata extraction from GROBID TEI files
- src/retrieval/: Paper retrieval, PDF download, and introduction extraction
- src/novelty_assessment/: LLM-based analysis and summary generation
Each component includes both CLI interface and pipeline integration methods for flexible usage.

Cite

Please use the following citation:

@misc{afzal2025notnovelenoughenriching,
      title={Beyond "Not Novel Enough": Enriching Scholarly Critique with LLM-Assisted Feedback},
      author={Osama Mohammed Afzal and Preslav Nakov and Tom Hope and Iryna Gurevych},
      year={2025},
      eprint={2508.10795},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2508.10795},
}

Disclaimer

This repository contains experimental software and is published for the sole purpose of giving additional background details on the respective publication.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github		.github
data/2ofVtMvRil		data/2ofVtMvRil
src		src
static		static
.env.example		.env.example
.gitignore		.gitignore
.nojekyll		.nojekyll
LICENSE		LICENSE
NOTICE.txt		NOTICE.txt
README.md		README.md
index.html		index.html
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Beyond "Not Novel Enough": Enriching Scholarly Critique with LLM-Assisted Feedback

Getting Started

Prerequisites

Usage

Complete Pipeline

Individual Components

Expected results

Key Parameters

Development

Cite

Disclaimer

About

Uh oh!

Releases

Packages

Languages

License

UKPLab/arxiv2025-assessing-paper-novelty

Folders and files

Latest commit

History

Repository files navigation

Beyond "Not Novel Enough": Enriching Scholarly Critique with LLM-Assisted Feedback

Getting Started

Prerequisites

Usage

Complete Pipeline

Individual Components

Expected results

Key Parameters

Development

Cite

Disclaimer

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages