Rouge: A package for automatic evaluation of sum- maries

· 2004

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Beyond Pixel Diffs: Benchmarking Image Change Captioning for Web UI Visual Regression Testing

cs.CV · 2026-07-02 · conditional · novelty 7.0

Proposes WUICC task and WUICC-bench dataset, then evaluates 11 image difference captioning methods plus 2 LLMs on web UI changes.

From Prompts to Pavement Through Time: Temporal Grounding in Agentic Scene-to-Plan Reasoning

cs.AI · 2026-05-19 · unverdicted · novelty 5.0

Temporal conditioning in three LLM-based planner architectures for AV scene-to-plan reasoning yields no statistically significant gains on NLP correctness metrics but enables predictive hazard reasoning and stable corrections on BDD-X subsets.

citing papers explorer

Showing 2 of 2 citing papers.

Beyond Pixel Diffs: Benchmarking Image Change Captioning for Web UI Visual Regression Testing cs.CV · 2026-07-02 · conditional · none · ref 44
Proposes WUICC task and WUICC-bench dataset, then evaluates 11 image difference captioning methods plus 2 LLMs on web UI changes.
From Prompts to Pavement Through Time: Temporal Grounding in Agentic Scene-to-Plan Reasoning cs.AI · 2026-05-19 · unverdicted · none · ref 21
Temporal conditioning in three LLM-based planner architectures for AV scene-to-plan reasoning yields no statistically significant gains on NLP correctness metrics but enables predictive hazard reasoning and stable corrections on BDD-X subsets.

Rouge: A package for automatic evaluation of sum- maries

fields

years

verdicts

representative citing papers

citing papers explorer