SHORT: Can citations tell us about a paper’s reproducibility? A case study of machine learning papers

Rochana R. Obadage, Sarah M. Rajtmajer, Jian Wu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The iterative character of work in machine learning (ML) and artificial intelligence (AI) and reliance on comparisons against benchmark datasets emphasize the importance of reproducibility in that literature. Yet, resource constraints and inadequate documentation can make running replications particularly challenging. Our work explores the potential of using downstream citation contexts as a signal of reproducibility. We introduce a sentiment analysis framework applied to citation contexts from papers involved in Machine Learning Reproducibility Challenges in order to interpret the positive or negative outcomes of reproduction attempts. Our contributions include training classifiers for reproducibility-related contexts and sentiment analysis, and exploring correlations between citation context sentiment and reproducibility scores. Study data, software, and an artifact appendix are publicly available at https://github.com/lamps-lab/ccair-ai-reproducibility.

Original languageEnglish (US)
Title of host publicationProceedings of the 2nd ACM Conference on Reproducibility and Replicability, REP 2024
PublisherAssociation for Computing Machinery, Inc
Pages96-100
Number of pages5
ISBN (Electronic)9798400705304
DOIs
StatePublished - Jun 18 2024
Event2nd ACM Conference on Reproducibility and Replicability, REP 2024 - Rennes, France
Duration: Jun 18 2024Jun 20 2024

Publication series

NameProceedings of the 2nd ACM Conference on Reproducibility and Replicability, REP 2024

Conference

Conference2nd ACM Conference on Reproducibility and Replicability, REP 2024
Country/TerritoryFrance
CityRennes
Period6/18/246/20/24

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Hardware and Architecture
  • Software

Fingerprint

Dive into the research topics of 'SHORT: Can citations tell us about a paper’s reproducibility? A case study of machine learning papers'. Together they form a unique fingerprint.

Cite this