You are not logged in. Your edit will be placed in a queue until it is peer reviewed.
We welcome edits that make the post easier to understand and more valuable for readers. Because community members review edits, please try to make the post substantially better than how you found it, for example, by fixing grammar or adding additional resources and hyperlinks.
Required fields*
- 14If you want to add an additional level of realism, first, lay them in front of a subject-matter expert (to substitute the flagger) to identify possible use of ChatGPT. Then, give those results to a mod to do a second check. I'd imagine your approach would still lead to vastly more incorrect suspensions than what's actually happening, if I flag for the tags I follow I'm 100% confident it's not a human.Erik A– Erik A2023-06-07 22:02:20 +00:00Commented Jun 7, 2023 at 22:02
- 2I second the non-moderator test. I'm pretty sure there are many users who have seen enough GPT answers to have a fairly low error rate in detecting them. I would be quite curious to see how well I do.Esther– Esther2023-06-07 22:12:17 +00:00Commented Jun 7, 2023 at 22:12
- 4@ErikA To be honest, I'm not a SME in most of the answers I'm identifying as coming from ChatGPT. I've just seen thousands of them personally, and I feel I'm fairly good at pattern recognition (obviously a biased statement, but I'm willing to put it to the test). And when I saw thousands, today I just added my 2,000th answer to my "ChatGPT Output" "Saves" list on SO. I have another 144 on a similar list on Ask Ubuntu. But I absolutely agree that for code answers, you must be a SME to identify the issues.NotTheDr01ds– NotTheDr01ds2023-06-07 22:26:13 +00:00Commented Jun 7, 2023 at 22:26
- 5Even if we may misidentify some posts as AI-generated, the damage of allowing them is much larger than the damage of removing them. It's better to lower the punishment, and start with just a warning, and put the user on a watchlist for future offenses, which times out.user357051– user3570512023-06-08 00:14:51 +00:00Commented Jun 8, 2023 at 0:14
- 1A more realistic scenario would be to have groups of three-ish answers from the same user, some groups from real users and some from ChatGPT. Then ask to differentiate which are which.Ryan M– Ryan M2023-06-08 04:27:24 +00:00Commented Jun 8, 2023 at 4:27
- 1A better test would have a small fraction of the 100 posts be ChstGPT-generated, and the mod should not know how many there actually are.Cris Luengo– Cris Luengo2023-06-08 15:19:04 +00:00Commented Jun 8, 2023 at 15:19
- I proposed a similar test (in moderator-only space), however using pre-GPT answers that score highly on Huggingface (or whatever) vs. GPT-generated answers. This makes the test considerably more difficult of course, but otherwise you have the problem that the vast majority of human answers do not look GPT-generated at all and thus the GPT answers stand out. This wouldn’t reflect a collection of answers flagged for being machine-generated in practical usage.Wrzlprmft– Wrzlprmft2023-06-08 16:16:24 +00:00Commented Jun 8, 2023 at 16:16
- All good scenarios as well. Just wondering why nothing like this seems to have been done since SE claims this is an "evidence-backed" assumption.NotTheDr01ds– NotTheDr01ds2023-06-08 18:20:43 +00:00Commented Jun 8, 2023 at 18:20
- 1Unfortunately, most of these scenarios appear to overlook several of the input metrics that various mods have said they take into account (at least in anything but the most glaringly obvious cases) — things like prior posting history and style, just to pick one example (but by no means the only one).Joel Aelwyn– Joel Aelwyn2023-06-09 01:22:09 +00:00Commented Jun 9, 2023 at 1:22
- ^That. I for one rely heavily on user history to find GPT plagiarists. It starts by smelling something fishy with the post, and ends in confirmation with user history.Passer By– Passer By2023-06-09 09:45:05 +00:00Commented Jun 9, 2023 at 9:45
- 1@JoelAelwyn Absolutely, the chances of us finding GPT usage are even higher when we have more data to go on. The point here is simply that some of us do feel that we can identify at a very high rate just based on the unedited GPT output.NotTheDr01ds– NotTheDr01ds2023-06-09 10:43:37 +00:00Commented Jun 9, 2023 at 10:43
- @NotTheDr01ds Fair enough, just wanted to make sure it wasn't being overlooked.Joel Aelwyn– Joel Aelwyn2023-06-09 16:22:28 +00:00Commented Jun 9, 2023 at 16:22
Add a comment |
How to Edit
- Correct minor typos or mistakes
- Clarify meaning without changing it
- Add related resources or links
- Always respect the author’s intent
- Don’t use edits to reply to the author
How to Format
- create code fences with backticks ` or tildes ~ ```
like so
``` - add language identifier to highlight code ```python
def function(foo):
print(foo)
``` - put returns between paragraphs
- for linebreak add 2 spaces at end
- _italic_ or **bold**
- indent code by 4 spaces
- backtick escapes
`like _so_` - quote by placing > at start of line
- to make links (use https whenever possible) <https://example.com>[example](https://example.com)<a href="https://example.com">example</a>
How to Tag
A tag is a keyword or label that categorizes your question with other, similar questions. Choose one or more (up to 5) tags that will help answerers to find and interpret your question.
- complete the sentence: my question is about...
- use tags that describe things or concepts that are essential, not incidental to your question
- favor using existing popular tags
- read the descriptions that appear below the tag
If your question is primarily about a topic for which you can't find a tag:
- combine multiple words into single-words with hyphens (e.g. stack-overflow), up to a maximum of 35 characters
- creating new tags is a privilege; if you can't yet create a tag you need, then post this question without it, then ask the community to create it for you