Teacher or professor laughing with students

(© Robert Kneschke - stock.adobe.com)

Students Don’t Trust AI Feedback. Their Teachers May Have Even More Reason to Worry.

In A Nutshell

  • A new paper argues that relying on AI to generate feedback could gradually erode the professional judgment educators build through years of reading and responding to student work.
  • Students in one cited survey rated educator feedback as far more trustworthy than AI-generated feedback, and ranked it higher on quality, helpfulness, and clarity.
  • While AI can serve as a low-stakes space for students too intimidated to seek human feedback, it lacks the relational continuity that makes feedback meaningful over time.
  • Researchers warn that AI tools could widen existing achievement gaps, with students who already know how to act on feedback standing to benefit most.

Every time a professor reads a student’s essay and decides what to say, something more than grading is happening. A judgment is being sharpened, a relationship is being built, and a professional skill is being practiced. New research suggests the rise of AI may end up eroding this essential element of academia. As AI tools take over more of that work, educators may gradually lose the very abilities that make their feedback worth having.

A paper published in Assessment & Evaluation in Higher Education argues that the rush to automate academic feedback carries a hidden cost that seldom gets discussed. Seven researchers from universities in the UK, Australia, and Denmark warn that if colleges begin outsourcing feedback to generative AI, they may not be merely changing how students receive information. They may be quietly hollowing out the professional craft of the educators left behind.

It is a concern that cuts against the dominant narrative around AI in education, which tends to focus almost entirely on whether AI feedback is good enough to replace a professor’s comments. In one cited survey of Australian students, nine in ten rated educator feedback as somewhat or very trustworthy. For AI-generated feedback, that figure dropped to just six in ten, with students also ranking human feedback higher on quality, helpfulness, and clarity. But those numbers alone miss a deeper problem on the educators’ side of the exchange.

What AI Feedback Takes Away From Educators

Reading student work and crafting a thoughtful response is not a passive clerical task. One instructor studied by researchers adjusted her comments based on a student’s personal circumstances, prioritizing encouragement over correction. As one ethnographic study of marking practices found, educators’ “readings of texts were tangled up with ‘readings’ of students.” That attentiveness develops over years of practice, and it is exactly the kind of skill that quietly erodes when the work gets handed off.

When AI generates the first draft of feedback and educators simply review and approve it, that process of professional refinement gets cut short. Educators engage less deeply with student work, exercise less independent judgment, and see fewer of the patterns that make their responses genuinely useful. Over time, the authors warn, this produces what they describe as a gradual erosion of feedback expertise, driven not by laziness but by delegation. Educators may eventually start to see student work through AI’s framing rather than their own.

teachers AI
AI is changing how schools handle feedback, but researchers warn it may be quietly eroding teachers’ most important skill. (Credit: Frame Stock Footage on Shutterstock)

The Limits of AI Feedback

Much of what makes a professor’s feedback valuable is precisely what AI cannot access. Good judgment is built from years in a discipline, shaped by knowing individual students, and informed by context no algorithm can retrieve. A student struggling with foundational concepts needs very different feedback than one who is technically proficient but intellectually coasting. An experienced educator reads for both. AI tools cannot fully account for either.

Most commercial AI feedback tools are also built around an outdated model: feedback as a one-way transfer of information from provider to receiver. When a system generates comments without any ongoing relationship with the student, those comments risk becoming what one earlier scholar called “dangling data,” technically present but practically ignored. Even accurate AI feedback may go unused if students lack the trust or connection to act on it.

When AI Feedback Actually Helps

None of this amounts to a blanket case against AI in education. For students who find human feedback intimidating, one research team has described AI as a “pedagogical sandbox,” a lower-stakes space to test ideas and get reactions without the pressure of a professor’s judgment. That is especially relevant for students who feel too intimidated to approach their professors directly, and for whom a safe first step matters.

AI also holds real promise for tracking student development over time, building a sustained, broader view of how skills and thinking evolve across a course of study rather than delivering isolated comments on individual assignments. That kind of ongoing analysis is something most educators lack the bandwidth to provide alone, and it represents a genuinely smarter use of the technology than automating grading.

Still, students who interacted repeatedly with AI tools described something important missing. Each session started from scratch. A focus group study found that students noted “each interaction [with GenAI] feels like a new beginning,” contrasting it with human relationships that build “a continuous timeline that builds gradually over time, allowing for personalized guidance and support.” That continuity matters. It is what turns feedback into an ongoing conversation rather than a series of disconnected drops.

A Warning About Who Gets Left Behind

An equity concern threads through the paper as well. Students who already know how to evaluate and act on feedback will get more from AI tools than those who do not. For students without that foundation, AI feedback may be harder to interpret and easier to dismiss. The authors suggest this dynamic could widen long-standing achievement gaps, echoing what education researchers call a “Matthew Effect,” in which those already ahead pull further ahead.

Protecting against that outcome means treating feedback as a practice worth investing in, not a cost to be minimized. It means giving educators time to read student work closely, respond as full professionals, and develop the judgment that no approval queue can replace.

What is at stake, the paper argues, goes beyond whether AI feedback is accurate enough. It is whether the educators overseeing it will still know how to do better.


Paper Notes

Limitations

This is a theoretical and conceptual paper rather than an empirical study, meaning the authors do not present original data of their own collection. The paper synthesizes existing research literature to build an argument about care, craft, and feedback in higher education. The authors acknowledge that much remains unknown about how students engage with AI feedback in everyday settings and how those practices affect equity and attainment across different student groups. Most existing research has also focused on commercial generative AI systems rather than purpose-built educational chatbots, which may function quite differently in practice.

Funding and Disclosures

No potential conflict of interest was reported by the authors. No external funding source was identified in the paper.

Publication Details

Authors Naomi E. Winstone, Karen Gravett, Margaret Bearman, Christy Noble, Lasse X Jensen, Anna Jones, and Kelli Nicola-Richmond are affiliated with the University of Surrey, Deakin University, the University of Queensland, the University of Copenhagen, and King’s College London. Their paper, “The care-full craft of feedback in an age of generative AI,” was published online on March 18, 2026, in Assessment & Evaluation in Higher Education. DOI: 10.1080/02602938.2026.2643333

About StudyFinds Analysis

Called "brilliant," "fantastic," and "spot on" by scientists and researchers, our acclaimed StudyFinds Analysis articles are created using an exclusive AI-based model with complete human oversight by the StudyFinds Editorial Team. For these articles, we use an unparalleled LLM process across multiple systems to analyze entire journal papers, extract data, and create accurate, accessible content. Our writing and editing team proofreads and polishes each and every article before publishing. With recent studies showing that artificial intelligence can interpret scientific research as well as (or even better) than field experts and specialists, StudyFinds was among the earliest to adopt and test this technology before approving its widespread use on our site. We stand by our practice and continuously update our processes to ensure the very highest level of accuracy. Read our AI Policy (link below) for more information.

Our Editorial Process

StudyFinds publishes digestible, agenda-free, transparent research summaries that are intended to inform the reader as well as stir civil, educated debate. We do not agree nor disagree with any of the studies we post, rather, we encourage our readers to debate the veracity of the findings themselves. All articles published on StudyFinds are vetted by our editors prior to publication and include links back to the source or corresponding journal article, if possible.

Our Editorial Team

Steve Fink

Editor-in-Chief

John Anderer

Associate Editor

Leave a Comment