Confidence and Stability of Global and Pairwise Scores in NLP Evaluation Paper • 2507.01633 • Published Jul 2
IMDB-WIKI-SbS: An Evaluation Dataset for Crowdsourced Pairwise Comparisons Paper • 2110.14990 • Published Oct 28, 2021
Watset: Local-Global Graph Clustering with Applications in Sense and Frame Induction Paper • 1808.06696 • Published Aug 20, 2018
Best Prompts for Text-to-Image Models and How to Find Them Paper • 2209.11711 • Published Sep 23, 2022 • 3
CrowdSpeech and VoxDIY: Benchmark Datasets for Crowdsourced Audio Transcription Paper • 2107.01091 • Published Jul 2, 2021