Retrieved January 15, 2023. The human raters usually are not specialists in the topic, and so they tend to decide on text that looks convincing. They'd get on lots of signs or symptoms of hallucination, although not all. Accuracy problems that creep in are tough to catch. ^I've tried out employing ChatGPT for almost everything from perform-ass