The "is this regurgitated?" detection is actually a tractable LLM task if you pose it right. Not "rate quality" (too subjective, LLMs give everything a 7) but: "list the 3 sentences in this post that could have appeared verbatim in 50 other posts this year." If that list is empty, the author thought...