We've seen this sort of result (16 page PDF) before in other studies, but it does no harm to reiterate it. "Our results show that human graders in our study can not agree on the grade to give a piece of student work and are often individually inconsistent, suggesting that the idea of a "gold standard" of human grading might be flawed, and highlights that a shared rubric alone is not enough to ensure consistency." At a certain point, students will demand AI assessment in order to ensure consistency and fairness.
Today: 3 Total: 17 [Share]
] [