Bitget App
Trade smarter
Buy cryptoMarketsTradeFuturesEarnSquareMore
The most reliable resource for identifying AI-generated text is Wikipedia

The most reliable resource for identifying AI-generated text is Wikipedia

Bitget-RWA2025/11/20 18:45
By:Bitget-RWA

Many of us have experienced that nagging feeling that a piece of text was generated by a language model, yet it's surprisingly challenging to confirm. For a period last year, there was a widespread belief that certain words like “delve” or “underscore” were clear indicators of AI authorship, but there’s little solid proof, and as these models have advanced, such obvious clues have become much less apparent.

Interestingly, Wikipedia editors have become quite adept at spotting writing produced by AI, and their publicly available “Signs of AI writing” guide is the most useful tool I’ve encountered for verifying those suspicions. (Thanks to poet Jameson Fitzpatrick for highlighting this resource on X.)

Since 2023, Wikipedia’s editors have been tackling the issue of AI-generated content through an initiative called Project AI Cleanup. With millions of daily edits to review, they have ample examples to study, and true to Wikipedia’s tradition, the team has assembled a comprehensive and evidence-based field guide.

The guide starts by reaffirming what many already suspect: automated detection tools are largely ineffective. Instead, it highlights certain writing patterns and expressions that are uncommon on Wikipedia but frequently found elsewhere online (and thus, prevalent in AI training data). The guide notes that AI-generated entries often go out of their way to stress the importance of a topic, typically using broad phrases like “a pivotal moment” or “a broader movement.” These models also tend to list minor media mentions to make a subject appear more significant—behavior more typical of a personal profile than an impartial source.

One notable pattern the guide points out is the use of trailing clauses with vague assertions of significance. AI models might claim that an event is “emphasizing the significance” of something or “reflecting the continued relevance” of a concept. (Grammar enthusiasts will recognize this as the use of present participles.) While this can be subtle, once you know to look for it, it becomes much easier to spot.

Another common feature is the use of generic, promotional language that’s widespread online. Descriptions are often overly positive—landscapes are always beautiful, views are always stunning, and everything is described as spotless and up-to-date. As the editors describe it, “it reads more like a script from a commercial.”

The entire guide is well worth reading, and I found it quite insightful. Previously, I would have argued that LLM-generated writing was evolving too rapidly to reliably identify. However, the tendencies highlighted here are deeply rooted in how AI models are built and used. While these habits can be masked, eliminating them entirely will be difficult. If the public becomes more skilled at recognizing AI-generated text, it could lead to some fascinating changes.

0

Disclaimer: The content of this article solely reflects the author's opinion and does not represent the platform in any capacity. This article is not intended to serve as a reference for making investment decisions.

PoolX: Earn new token airdrops
Lock your assets and earn 10%+ APR
Lock now!

You may also like

Zcash Latest Updates: Privacy-Focused Cryptocurrencies Rise While SEC Weighs Regulation and Technological Progress

- The SEC will host a December roundtable on privacy/financial surveillance, shifting 2026 exam priorities to fiduciary duty, custody, and data privacy. - Zcash (ZEC) surged 125% in 30 days as institutional investor Cypherpunk Technologies added $18M in ZEC, reflecting growing demand for privacy-centric crypto. - Regulatory tensions persist: DOJ jailed Samourai Wallet founder for mixer operations, while Tornado Cash sanctions were overturned, highlighting legal ambiguity. - SEC's focus on privacy aligns wi

Bitget-RWA2025/11/20 22:08
Zcash Latest Updates: Privacy-Focused Cryptocurrencies Rise While SEC Weighs Regulation and Technological Progress