Skip to main content

ArXiv Cracks Down: One-Year Ban for Fully AI-Written Submissions

ArXiv, the widely utilized open repository for pre-print academic research, is intensifying its efforts to curb the negligent application of large lan

2 min read18 views5 tags
Originally reported bytechcrunch

ArXiv, the widely utilized open repository for pre-print academic research, is intensifying its efforts to curb the negligent application of large language models (LLMs) within scientific papers.

Despite papers being posted to the site prior to peer review, arXiv (pronounced "archive") has evolved into a primary conduit for the dissemination of research in disciplines such as computer science and mathematics. Furthermore, the platform itself has become a valuable source for tracking emerging trends in scientific inquiry.

The organization has already implemented measures to counter the increasing influx of low-quality, AI-generated submissions, including a requirement for first-time posters to secure an endorsement from an established author. Following over two decades under Cornell's stewardship, arXiv is transitioning into an independent nonprofit entity, a move expected to facilitate greater fundraising to tackle challenges like the proliferation of AI-generated content of subpar quality.

In its most recent announcement, Thomas Dietterich, who chairs arXiv's computer science section, posted on Thursday that “if a submission contains incontrovertible evidence that the authors did not check the results of LLM generation, this means we can’t trust anything in the paper.”

Dietterich elaborated that such "incontrovertible evidence" could manifest as "hallucinated references" or direct comments exchanged with an LLM. Should such evidence be uncovered, the authors of the paper will face a severe penalty: “a 1-year ban from arXiv followed by the requirement that subsequent arXiv submissions must first be accepted by a reputable peer-reviewed venue.”

It is crucial to note that this policy does not constitute an outright prohibition on the use of LLMs. Instead, it underscores the imperative for authors to assume “full responsibility” for their content, “irrespective of how the contents are generated,” as articulated by Dietterich. Consequently, researchers remain accountable if they copy-paste "inappropriate language, plagiarized content, biased content, errors, mistakes, incorrect references, or misleading content" directly from an LLM.

Speaking to 404 Media, Dietterich confirmed this would operate as a "one-strike" rule. However, safeguards are in place: moderators must first flag the issue, and section chairs are required to confirm the evidence before any penalty is imposed. Authors will also retain the right to appeal the decision.

This development comes amidst recent peer-reviewed research indicating a rise in fabricated citations within biomedical studies, a phenomenon largely attributed to LLMs. It is worth noting, however, that the issue of AI-generated, made-up citations extends beyond the scientific community.

#AI News#ArXiv#LLMs#AI policy#Author responsibility
ES
Editorial StaffEditor

The Editorial Staff at AIChief is a team of professional content writers with extensive experience in AI and marketing. Founded in 2025, AIChief has quickly grown into the largest free AI resource hub in the industry.

View all posts
Reader feedback

What did you think of this story?

User Comments

Filter:
No comments yet. Be the first to comment!
Continue reading
View all news