AI Hiring Bias: Why LLMs Prefer Their Own Resumes

Research shows that LLMs used in recruitment favor resumes generated by the same model, creating a significant 'self-preference bias.' This bias can increase a candidate's shortlisting chances by up to 60% compared to human-written resumes. Fortunately, the study demonstrates that this effect can be mitigated by more than half through simple technical interventions.

Key Points

LLMs demonstrate a systematic self-preference bias, favoring their own generated content over human or alternative AI outputs.
In hiring simulations, candidates using the same AI as the recruiter saw a 23% to 60% increase in shortlisting probability.
The bias is most pronounced in business-related fields like sales and accounting compared to other occupations.
Self-preference bias can be significantly reduced by over 50% using simple interventions that target the model's self-recognition capabilities.
Current AI fairness frameworks are insufficient as they focus on demographics rather than biases emerging from AI-AI interactions.

Sentiment

The community views the phenomenon as largely unsurprising and intuitive — LLMs favoring their own output aligns with common technical understanding. However, there is significant concern about the practical implications for hiring fairness and the self-reinforcing dynamics this creates. The study's methodology drew substantial criticism, with many feeling the experimental design was too narrow to support the broad claims made. Overall sentiment is cautiously concerned about the real-world trend while skeptical of this particular paper's rigor.

In Agreement

LLMs generate text that aligns with their own training biases, so self-preference is a natural and expected consequence of using the same model for both generation and evaluation
Multiple commenters shared real-world experiences of significantly improved job search results after having LLMs rewrite their resumes, supporting the paper's core finding
The self-reinforcing feedback loop — employers using LLMs to filter, candidates using LLMs to write — creates a problematic dynamic where human-written resumes are disadvantaged
Using the same LLM for both code generation and review (marking your own homework) produces biased results, and this principle applies equally to resume screening
AI as a hiring gatekeeper introduces poorly understood biases that could exacerbate inequality, especially if premium model access becomes a factor

Opposed

The study only tested LLM-rewritten executive summaries evaluated in isolation, not full resumes — massively overstating any real-world impact on hiring decisions
The paper's abstract misrepresents its findings by claiming LLMs prefer resumes they generated, when they actually only measured preference for executive summaries
This is analogous to preferring computer-printed resumes over typewritten ones — a temporary friction that will resolve as LLMs become commoditized
The phenomenon is obvious and well-documented in existing LLM-as-judge literature, making this study largely redundant
The real-world prevalence of LLM-based resume screening in HR departments is unclear and possibly overstated — many organizations still rely heavily on human reviewers