Ask Dave Taylor
  • Facebook
  • Instagram
  • Linkedin
  • Pinterest
  • Twitter
  • YouTube
  • Home
  • YouTube Videos
  • Top Categories
  • Subscribe via Email
  • Ask A Question
  • Meet Dave
  • Home
  • Computer & Internet Basics
  • Can AI-Generated ChatGPT Text Be Accurately Identified?

Can AI-Generated ChatGPT Text Be Accurately Identified?

January 28, 2023 / Dave Taylor / Computer & Internet Basics / No Comments

Hi Dave, I’m a high school teacher and am curious about all of the AI writing tools available now. Is there a way to identify when text is written by a program rather than a person? If so, how accurate is it today?

While machine learning-based AI tools have been around for a few years, the beginning of 2023 has been all about OpenAI and its ChatGPT tool. And no wonder; you can ask it to produce just about any sort of text content and within a few seconds you’re getting something that’s not bad. It’s not brilliant, but how many song lyrics, poems, blog posts, article comments, or student papers are brilliant?

Then again, teachers aren’t looking for that needle in a haystack, we’re just trying to help people learn something new and expand their horizons and expertise. A task that is made quite a bit more difficult if we give them written tasks and instead of writing them, they turn to software or Web sites that can produce the content instead. A click and copy/paste versus the critical thinking required to produce something thoughtful and on-topic? Unfortunately, there’ll always be a couple of lazy students who will seek shortcuts for whatever reason.

DOES USING AI DIFFER FROM PLAGIARIZING?

At some level, this is no different from plagiarism. Prior to the Internet, plagiarism referred to students copying out of a book or a prior student’s assignment. In the digital age, there are dozens of Web sites that offer “only A papers” on any of thousands of topics, from Shakespeare to Organic Chemistry. The duplicated writing is generally identified by testing multi-word phrases. The plagiarism tests from companies like TurnItIn, for example, are pretty solid in this regard. But AI tools like ChatGPT produce unique content every single time they’re invoked, so how can they be detected?

Turns out that the current measure is perplexity. The technical definition of this measure is “a metric that quantifies how uncertain a model is about the predictions it makes” but that doesn’t really clarify what’s being calculated, does it? Here’s another explanation of perplexity: “if a [language] model assigns a high probability to the test set, it means that it is not surprised to see it (it’s not perplexed by it)…”

For our purposes, though, we can consider perplexity as a common language-based analysis tool to try and ascertain whether a specific prose passage is likely produced by a human or an AI. High perplexity means it’s likely AI-generated, low perplexity means it’s likely written by a human. The good news is that there are already online tools that offer just this analysis. Let’s consider both GPT Zero and GPT Radar.

PRODUCING ACADEMIC PROSE WITH CHATGPT

As a benchmark, I’m going to use a discussion prompt from a course I’m teaching at the University of Denver. The class is about media and society and this week’s theme happens to be quite appropriate: Do Machines Control Us?

Here’s the prompt:

chatgpt test for plagiarism ai - college writing assignment prompt

As a test, I’m going to simply copy and paste this into ChatGPT, the interactive front-end to the Open AI “GPT3” language model system. You can try this too, at chat.openai.com (though you might need to sign up for a free account first).

chatgpt test for plagiarism ai - content produced on chatgpt gpt3

Here’s the exact response from GPT3 to the discussion prompt:

It is a complex question and depends on the perspective one takes. From a technological determinism perspective, it could be argued that media technologies have significantly changed the way we think and interact with the world. They have revolutionized the way we communicate, access information, and make decisions. The constant availability of information and the ability to connect with people from all over the globe has reshaped our understanding of time and space.

On the other hand, from a social constructionism perspective, it could be argued that while technology has certainly influenced our behavior, it is ultimately shaped by the social and cultural context in which it is used. It is important to remember that technology is created, used, and interpreted by human beings, and it is human choices that determine how technology is used.

The critical thinking demonstrated here is weak, with a lot of words covering up a lack of actual analysis, but it’s not bad and it’s believable that a student could submit something like this as a response to the discussion prompt.

The question isn’t so much about whether a close read and consideration can detect this AI-produced content, however, but whether there are tools that can do so. You can certainly imagine if this is one response from a class with 50 students, the teacher’s ability to slow down and really contemplate the prose is going to be very limited.

FIRST TEST: GPT ZERO

The first tool to consider is one initially created in a weekend by Edward Tian, a computer science undergrad at Princeton University: GPTZero. It is based on the perplexity measure of language analysis, as discussed earlier. The test is easy to perform, a simple paste from ChatGPT:

chatgpt test for plagiarism ai - gptzero gpt3 test

You can upload files to analyze too – particularly helpful for longer class assignments – but it’s straightforward to copy and paste the modest 136-word passage.

A click on “Get Results” and the verdict is delivered:

chatgpt test for plagiarism ai - gptzero says AI generated

Okay, “Your text is likely to be written entirely by AI”. Case closed? Not so fast.

SECOND TEST: GPT RADAR

Before we conclude the AI prose is easily identified, let’s try another tool that’s been around a bit longer: GPT Radar. It’s a tool that content production teams utilize when delivering blog posts and other sponsored content for clients, but it’s illustrative for our purposes too.

chatgpt test for plagiarism ai - gpt radar

Since perplexity is a mathematical analysis of text, the result should be the same, right? A click on “Analyze” shows otherwise:

chatgpt test for plagiarism ai - gpt radar analysis

GPTZero reports a perplexity score of 18.33, while GPT Radar produces a 6.0. The lower the score, the less “surprised” the algorithm is about word choice in the passage and the more likely it’s written by a human (since we all tend to write in rather similar ways), but as is obvious, it’s not entirely deterministic.

ANALYSIS RESULTS: YES, AND NO

The results demonstrate the complexity of the problem; one tool reports that our stilted, awkwardly written prose is almost certainly written by an AI program, while the other tool insists it’s “likely human generated”. The obvious conclusion is that online tools aren’t quite ready to accurately identify AI produced text. This is concerning for both us as educators and all of us as citizens and consumers of information.

Perhaps more importantly, neither tool offers any analysis of whether the response actually answers the prompts and offers up an intelligent commentary and response. That’s the job of us instructors, and it’s a tough task. With a small class, the teacher can track writing across assignments (if a student has an intro written at a 7th-grade level, but their assignments are grad school level work, that’s an obvious and immediate red flag).  But what if you do have dozens or hundreds of students?

There is no easy solution today. The best advice I can offer is to understand the limitations of these tools and realize that even as they seek to be more accurate, the AI language models will become more sophisticated, causing a technological cat-and-mouse game. Challenge students whose prose seems improbable or surprising.

The real conclusion, however, is that we’re going to have to change our approach to teaching so that in-person, non-technologically-assisted recitation becomes a part of student evaluation and assessment at any grade level.

Have thoughts and ideas on the subject? Please let me know in the comments!

About the Author: Dave Taylor has been involved with the online world since the early days of the Internet. Author of over 20 technical books, he runs the popular AskDaveTaylor.com tech help site. You can also find his gadget reviews on YouTube and chat with him on Twitter as @DaveTaylor.

Let’s Stay In Touch!

Never miss a single article, review or tutorial here on AskDaveTaylor, sign up for my fun weekly newsletter!
Name: 
Your email address:*
Please enter all required fields
Correct invalid entries
No spam, ever. Promise. Powered by FeedBlitz
Please choose a color:
Starbucks coffee cup I do have a lot to say, and questions of my own for that matter, but first I'd like to say thank you, Dave, for all your helpful information by buying you a cup of coffee!
ai writing, chatgpt, gptradar, open ai

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Search

Recent Posts

  • Everything You Need to Know about Apple’s Clean Energy Charging
  • How Can I Watch Free Classic Movies on my Windows PC?
  • How Can I Maximize Online Privacy with a VPN Connection?
  • SCAM: Did I Just Buy A Computer From Amazon? I Demand a Refund!
  • How Can I Stop My AirPods Pro Making Beeps and Noises?

On Our YouTube Channel

iClever Bluetooth 34-Key Number Pad Keyboard -- REVIEW

Google Pixel 7 Pro Android Smartphone - UNBOXING

Categories

  • AdSense, AdWords, and PPC Help (106)
  • Amazon, eBay, and Online Shopping Help (164)
  • Android Help (228)
  • Apple iPad Help (147)
  • Apple Watch Help (53)
  • Articles, Tutorials, and Reviews (346)
  • Auto Tech Help (17)
  • Business Advice (200)
  • ChromeOS Help (33)
  • Computer & Internet Basics (782)
  • d) None of the Above (166)
  • Facebook Help (384)
  • Google, Chrome & Gmail Help (188)
  • HTML & Web Page Design (247)
  • Instagram Help (49)
  • iPhone & iOS Help (625)
  • iPod & MP3 Player Help (173)
  • Kindle & Nook Help (99)
  • LinkedIn Help (88)
  • Linux Help (174)
  • Linux Shell Script Programming (90)
  • Mac & MacOS Help (914)
  • Most Popular (16)
  • Outlook & Office 365 Help (33)
  • PayPal Help (68)
  • Pinterest Help (54)
  • Reddit Help (19)
  • SEO & Marketing (82)
  • Spam, Scams & Security (96)
  • Trade Show News & Updates (23)
  • Twitter Help (222)
  • Video Game Tips (66)
  • Web Site Traffic Tips (62)
  • Windows PC Help (951)
  • Wordpress Help (206)
  • Writing and Publishing (72)
  • YouTube Help (47)
  • YouTube Video Reviews (159)
  • Zoom, Skype & Video Chat Help (62)

Archives

Social Connections:

Ask Dave Taylor


Follow Me on Pinterest
Follow me on Twitter
Follow me on LinkedIn
Follow me on Instagram


AskDaveTaylor on Facebook



microsoft insider mvp


This web site is for the purpose of disseminating information for educational purposes, free of charge, for the benefit of all visitors. We take great care to provide quality information. However, we do not guarantee, and accept no legal liability whatsoever arising from or connected to, the accuracy, reliability, currency or completeness of any material contained on this site or on any linked site. Further, please note that by submitting a question or comment you're agreeing to our terms of service, which are: you relinquish any subsequent rights of ownership to your material by submitting it on this site. Our lawyer says "Thanks for your cooperation."
© 2023 by Dave Taylor. "Ask Dave Taylor®" is a registered trademark of Intuitive Systems, LLC.
Privacy Policy - Terms and Conditions - Accessibility Policy