Law professors rate AI answers higher than peer responses in study
A new study found that U.S. law professors rated answers from large language models (LLMs) higher than those written by their peers in a blinded evaluation of short-answer tutoring in contracts courses. Sixteen law professors created 40 representative questions, wrote answers, and judged 2,918 anonymized comparisons between human and LLM responses. The professors gave LLMs an average win rate of 75.33%, with models performing similarly to the best instructor. LLM responses were flagged as harmful only 3.53% of the time, compared to 12.06% for professor-written answers. The study, by Alejandro Salinas and colleagues, was reported by Marginal Revolution and cited by Andrew Curran and John Chamberlain. The article also notes a separate study in the Journal of Economic Literature finding that AI tools can mass-produce academic finance papers nearly indistinguishable from human research.
What’s reported
Key figures
Sources: marginalrevolution.com
