The preprint platform will ban authors for a 12 months if “incontrovertible proof” of AI errors is discovered of their work.
gorsh13/iStock/Getty Pictures
A serious scientific repository’s resolution to ban authors whose work accommodates “hallucinated” references written by generative synthetic intelligence has been welcomed by analysis integrity campaigners regardless of considerations about how the coverage might be correctly enforced.

In a landmark transfer, the favored preprint platform arXiv has mentioned it should impose an instantaneous one-year ban if it finds “incontrovertible proof” that submissions include “inappropriate language, plagiarized content material, biased content material, errors, errors, incorrect references, or deceptive content material” written by massive language fashions.
“If a submission accommodates incontrovertible proof that the authors didn’t test the outcomes of LLM era, this implies we will’t belief something within the paper,” defined Thomas Dietterich, who chairs arXiv’s computing part, as he introduced the coverage on the social media platform X.
Examples of incontrovertible proof would come with “hallucinated references” and “meta-comments from the LLM,” continued Dietterich, who gave examples of a researcher failing to delete phrases corresponding to “here’s a 200 phrase abstract; would you want me to make any adjustments?” or “the info on this desk is illustrative, fill it in with the actual numbers out of your experiments.”
The one-year ban from arXiv will likely be “adopted by the requirement that subsequent arXiv submissions should first be accepted at a good peer-reviewed venue,” mentioned Dietterich, an emeritus professor at Oregon State College whose analysis focuses on machine studying and AI.
The brand new coverage is extensively seen as an try to stem the rising variety of AI-assisted submissions to the preprint server for arithmetic, physics and computing, which acquired greater than 30,000 submissions for the primary time in March—greater than double the 15,000 papers acquired in 2020 and 6 instances the 5,000 papers acquired in 2015.
Whereas some students have recommended that the one-year ban for a primary offense is just too harsh, analysis integrity campaigner Anna Abalkina, based mostly on the Free College of Berlin, welcomed the “countermeasure towards paper-mill submissions and low-quality manuscripts.”
“We’ve a number of guidelines but not so many enforcement mechanisms,” mentioned Abalkina on the decisive nature of the ban.
“I welcome this measure as a result of it’s aimed primarily at paper mills somewhat than legit authors. Anyway, authors nonetheless have an opportunity to attraction,” she added.
Abalkina’s personal analysis has proven an “improve in submissions to preprint servers that have been additionally made to extend citations,” she defined, with a latest preprint displaying that many papers submitted to scientific conferences had beforehand been supplied on the market by paper mills.
Reese Richardson, postdoctoral analysis fellow at Northwestern College’s Heart for Science of Science and Innovation who focuses on analysis integrity, mentioned the coverage change “undoubtedly follows” from the latest research by Zhenyue Zhao and others, together with arXiv founder Paul Ginsparg, which discovered that just about 150,000 “hallucinated” references have been current in papers posted on 4 preprints in 2025 alone.
Though he counseled the preprint’s efforts to “disincentivize submitting AI-generated bullshit,” Richardson questioned whether or not these punitive measures would work given the dimensions of doubtful submissions.
“Zhao et al. estimate that 1000’s of manuscripts containing hallucinated references will likely be posted on arXiv yearly. Does arXiv plan to use bans for all of those submissions? Issuing these bans requires arXiv workers to adjudicate every case in addition to reply to appeals, which I think about will wind up being fairly onerous, even when they solely prosecute a small fraction of the offending circumstances,” he mentioned.
“If arXiv solely selectively enforces this coverage, they could save sources on adjudication and enforcement. Nonetheless, if the danger of being punished is just too low even if you’re caught, there’s little incentive towards authors persevering with to submit content material with unverified references,” Richardson added.
There are related considerations about how arXiv workers will crack down on submissions with falsifications, plagiarism and nonsensical content material, mentioned Richardson. “Whereas we’d all wish to see loads much less bullshit, the identical considerations about scalability apply.”
The brand new coverage, nonetheless, additionally raised questions in regards to the evolving position of preprints inside scientific literature, he continued, noting that “arXiv is in a difficult state of affairs” provided that its acknowledged mission is “to share analysis papers and facilitate scientific discovery rapidly and freely” somewhat than test their high quality.
“Its assumed position within the data ecosystem is to host scientific works that haven’t undergone conventional gatekeeping procedures like formal peer overview and editorial acceptance for publication,” he defined.
“Nonetheless, in sure fields corresponding to pc science or AI, arXiv is arguably displacing the position historically occupied by aggressive retailers like journals and conferences, such that having a whole paper on arXiv can itself be a status-conferring accomplishment,” he mentioned.
“Regardless of now being firmly of their fourth decade round, preprint platforms like arXiv stay an ongoing experiment in scholarly communication. As this experiment continues, how verification and gatekeeping ought to issue into this mission will stay the topic of appreciable debate,” mentioned Richardson.
Source link
#Ban #Authors #Submit #Content #Unenforceable


