Skip to main navigation Skip to search Skip to main content

Human or Machine? A Survey on Machine-Generated Text Detection

  • Zainab Ahmad
  • , Miguel Torres-Ruiz
  • , Ahmad Mahmood
  • , Rolando Quintero
  • , Iqra Ameer
  • , Necva Bölücü

Research output: Contribution to journalReview articlepeer-review

Abstract

As generative AI advances rapidly across education, research, medicine, and journalism, machine-generated text (MGT) raises questions about authenticity, ethics, and social impact. To ground this discussion, we conducted a linguistic analysis, covering phonology, morphology, syntax, semantics, lexicon, and pragmatics, which uncovers robust MGT signatures, such as lower perplexity and simpler morphology. We then consolidate the state-of-the-art by reviewing 30 benchmark corpora, totaling over 4 million text samples, and 44 empirical studies, including outcomes from six major shared tasks. Detection approaches are grouped into five broad classes: classical machine learning, deep learning, transformer-based architectures, commercial AI detectors, and statistical tools. Transformer models achieve near-perfect accuracy (≈ 100%), while human evaluators peak at ≈ 77 % accuracy. This survey also highlights common evaluation setups and the core performance measures used to assess model effectiveness. Future MGT detectors can become truly fair, scalable, and effective across languages and domains by expanding corpus diversity and innovating resource-efficient, adversarially robust methods.

Original languageEnglish (US)
Pages (from-to)34113-34136
Number of pages24
JournalIEEE Access
Volume14
DOIs
StatePublished - 2026

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

All Science Journal Classification (ASJC) codes

  • General Computer Science
  • General Materials Science
  • General Engineering

Fingerprint

Dive into the research topics of 'Human or Machine? A Survey on Machine-Generated Text Detection'. Together they form a unique fingerprint.

Cite this