Visualize: Figure1 & 2, Table1.
Description: " A multiple-choice question from the ARC dataset
with the correct answer in bold, followed by justification sentences selected by our approach (ROCC) vs. sentences selected by a strong IR baseline (BM25). ROCC justification
sentences fully cover the five key terms in the question (shown
in italic),"