Ofqual's A-level algorithm: why did it fail to make the grade?

There is a lot we can learn from the algebraic symbols used to determine results in England

Pkj = (1-rj)Ckj + rj(Ckj + qkj - pkj)

For such a short string of algebraic symbols, there is a lot we can learn from Ofqual’s grading algorithm (though really it is an equation) – and a lot we can learn about what went wrong.

First and most obviously, the size of the algorithm is an issue. With just four distinct terms – Ckj, qkj, pkj and rj – it shows the sparseness of the inputs. This is not a “big data” solution, gathering every possible piece of information about a student in an attempt to gain a full view of their capability. In fact, it is the opposite: using the smallest possible amount of information, so it can be rapidly gathered and easily standardised.

So what are those terms? The first are three various distributions of grades, k, at schools, j. Ckj is simple enough: it is the historical grade distribution at the school over the last three years, 2017-19.

That tells us already that the history of the school is very important to Ofqual. The grades other pupils got in previous years is a huge determinant to the grades this year’s pupils were given in 2020. The regulator argues this is a plausible assumption but for many students it is also an intrinsically unfair one: the grades they are given are decided by the ability of pupils they may have never met.

qkj is where the pupils’ own ability comes in. That is the predicted grade distribution based on the class’s prior attainment at GCSEs. A class with mostly 9s (the top grade) at GCSE will get a lot of predicted A*s; a class with mostly 1s at GCSEs will get a lot of predicted Us.

pkj is the predicted grade distribution of the previous years, based on their GCSEs. You need to know that because, if previous years were predicted to do poorly and did well, then this year might do the same; and, again, vice versa.

The final term, rj, is different: it is not about grades at all, hence the absence of the k. Instead, it is about how many pupils in the class actually have historical data available. If you can perfectly track down every GCSE result, then it is 1; if you cannot track down any, it is 0.

Finally, we can put the terms together. First, the equation is in two halves, one multiplied by that rj term, and one multiplied by one minus rj, meaning the higher rj is, the lower 1-rj will be. What that says is: “If we don’t know about this group’s GCSE grades, ignore the right half of this equation, and just base everything on last year’s A-levels; to the extent that we do know about their GCSE grades, use that information as well.”

Gavin Williamson, 12 August, to ITV

“[I have] every confidence that the system we have put in place is a robust system, a system that’s fair”

Gavin Williamson, 12 August, to the BBC

“The system, for the overwhelming majority of young people, is going to deliver credible, strong results. It’s a robust system, it’s a fair system, it’s making sure that young people get the grades that they’ve worked so hard towards”

Nick Gibb, 12 August, to Sky News

“Most young people … will get the grade that the teacher sent in to the exam board that they thought they would get.”

Gavin Williamson, 13 August, to Sky News

Q) “Can you give a cast-iron guarantee that you will not be forced into the embarrassing U-turn that John Swinney and Nicola Sturgeon were in Scotland?”

A) "Absolutely"

Boris Johnson, 13 August, to reporters in Northern Ireland

“Let’s be in no doubt about it, the exam results that we’ve got today are robust. They’re good, they’re dependable for employers. It’s very important that for years to come people should be able to look at these grades and think these are robust, these are dependable”

Gavin Williamson, 15 August, interview to the Times

“This is it… No U-turn, no change… [In Scotland] you’ve got a system where there aren’t any controls, you’ve got rampant grade inflation. There’s been no checks and balances in that system; it degrades every single grade as a result and in-baked unfairness” 

The left half, which only gets used if we do not know the GCSE data, is that simple: “Just use the historical A-level results.” And then the right half says: “Use the historical A-level results, but add to them the predictions from this year’s GCSE results, then downgrade them based on how good the last lot of predictions were.” That means a school that regularly gets good A-level results despite having bad GCSEs will get a boost.

Aggregating all those terms together gives us Pkj, the predicted grades for the school.

Even in this short equation, we can see the seeds of a fiasco: prior attainment based exclusively on GCSE results; historical grades stretching back just three years; and a refusal to allow the actual success of the pupil to overrule the situation.

In a better system, perhaps the rest of the process could have ironed out these flaws, but in reality they made them worse.

The decision to give small classes the ability to receive their teachers’ recommended grades is not in the algorithm but led to a boost for elite private schools.

The choice to take the results of the algorithm and further tweak the grade boundaries to prevent overall grade inflation is not in the algorithm but further depressed the larger classes in favour of the smaller.

And the choice to focus, not on determining individual grades, but on determining a distribution for a class which students were then matched to on the basis of their rank in the class, is not an error in the algorithm but a fundamental misunderstanding of what the goal was.

• This article was amended on 24 August 2020. The terms Ckj, qkj, pkj and rj are distinct, but not unique as described in an earlier version; a reference to pkj was corrected to Pkj; and the description of 1-rj in relation to rj was clarified.

Contributor

Alex Hern

The GuardianTramp

Related Content

Article image
'I feel utterly let down': students on Ofqual's review of A-level appeal criteria
Exam regulator suspended its policy for challenging grades hours after it was published

Mattha Busby

16, Aug, 2020 @1:25 PM

Article image
GCSEs and A-levels likely to be partly assessed by cut-down versions of exams
Education secretary to look into use of ‘externally set tasks’ to help teachers in England assess final grades

Richard Adams Education editor

13, Jan, 2021 @4:43 PM

Article image
A-level and GCSE grade inflation ‘inevitable in English system’
School leaders warn this year’s results likely to see inflation similar to last year despite efforts to make system fairer

Sally Weale and Richard Adams

09, May, 2021 @10:31 AM

Article image
England exams row timeline: was Ofqual warned of algorithm bias?
Scrapping followed series of warnings over algorithm’s volatility and fairness

Ben Quinn and Richard Adams

20, Aug, 2020 @6:53 PM

Article image
Ofqual 'blindsided' government by revoking A-level appeals process
Move by regulator could leave thousands of students without university places

Richard Adams and Heather Stewart

16, Aug, 2020 @12:39 PM

Article image
Ofqual chief to face MPs over exams fiasco and botched algorithm grading
Sally Collier to appear before education select committee over A-levels and GCSE results debacle

Heather Stewart, Sally Weale and Kate Proctor

20, Aug, 2020 @8:14 PM

Article image
Gavin Williamson under pressure to resign over A-level results 'fiasco'
Lib Dems say minister is ‘out of excuses’ and Labour calls for results system to be scrapped

Richard Adams and Heather Stewart

14, Aug, 2020 @10:33 AM

Article image
Ofqual ignored exams warning a month ago amid ministers' pressure
Exclusive: Agency said to have been told of risk of erratic outcomes, but was under government pressure on grade inflation

Richard Adams, Jessica Elgot, Heather Stewart and Kate Proctor

19, Aug, 2020 @7:18 PM

Article image
Algorithm method was bound to fail for Ofqual | Letters
Letters: In statistics all predictions should come with a health warning, writes Emeritus Professor Graham Hall, while Dr Peter WH Smith points out that the human factor was missing and Prof John Gray that one cannot use one year’s results to predict the next. Plus letters from John Bailey and Tony Leather

Letters

23, Aug, 2020 @5:12 PM

Article image
Gavin Williamson seeks to blame Ofqual for exams debacle
Education secretary forced into humiliating reversal on A-level and GCSE grades in England

Heather StewartKate Proctor and Sally Weale

17, Aug, 2020 @8:01 PM