The UCAT Situational Judgement subtest is the only part of the admissions battery that does not test reasoning speed or pattern recognition in the abstract. Instead, it places the candidate inside a short clinical or workplace vignette and asks them to rank or choose between actions, judging which response is most appropriate, least appropriate, or somewhere in the middle. The exam is taken on a computer, in the same Pearson VUE digital format as the cognitive subtests, and it is the final of the four subtests candidates usually sit on test day. SJT results are reported as a band from 1 (highest) to 4 (lowest), and the score is used by a large majority of UK medical and dental schools as part of shortlisting, sometimes weighted against the cognitive scaled scores, sometimes used as a tie-breaker. For most candidates reading this, the difficulty is not the ethics itself — most applicants already have a sound moral compass. The difficulty is translating that compass into a four-level hierarchy, under time pressure, on items written deliberately to blur the boundary between adjacent bands. This article walks through the structure of SJT items, the language the test writers use to signal band separation, and the concrete habits that separate a band 1 from a band 2 candidate.
How the UCAT SJT item is actually constructed
An SJT scenario is short on purpose. The test gives you roughly 12 minutes to work through 66 items, which works out at about 10 to 11 seconds per question, and the test designers know that. Each scenario is built from three layers: a setting (a ward, a GP surgery, a tutorial group, a hospital corridor), a triggering event (a colleague makes a mistake, a patient asks an uncomfortable question, a relative pushes for information), and a set of possible responses that vary in appropriateness rather than in truth. You are not asked whether statement A is factually correct. You are asked where it sits on a four-point scale ranging from a very appropriate action to a very inappropriate one.
Two item formats appear. In the multiple-choice format, you are given one scenario and four possible actions, and you must pick the single most appropriate or the single most appropriate action. In the ranking format, you are given one scenario and five possible actions, and you must drag them into order from most to least appropriate. The distinction matters because ranking forces you to discriminate between options that the multiple-choice format would have allowed you to ignore. If you can rank, you can choose; the reverse is not always true. A candidate who has practised only the four-option items will find the five-option ranking items genuinely harder, and the exam contains both formats, so preparation should reflect that mix.
The scenarios themselves are drawn from a defined list of themes: professionalism, patient confidentiality, dealing with mistakes, consent, capacity, working within competence, interactions with colleagues, and the limits of student behaviour on placement. The setting is always plausibly medical, but the ethical substance is rarely about a clinical decision in the technical sense. It is about conduct. In practice this means the content of an SJT item is closer to a medical school fitness-to-practise panel than to a pharmacology viva. A good way to prepare, and the way I usually suggest to candidates in the early weeks, is to read each scenario and ask, before looking at the answer options, what the right action would be in plain English. Then look at the options and see how many of them look like that answer. Almost always, two of the four or five do, and the work is to pick the better of two plausible candidates.
The four bands: what band 1 actually looks like in writing
Many candidates know that the bands run from 1 to 4, but few can articulate the difference between the two highest bands in a way that survives contact with a difficult item. Band 1 represents behaviour that is unambiguously right and within the candidate's role to do, even if the action is not the easiest option. Band 2 represents behaviour that is appropriate but where a stronger or more direct action would have been better, or where the action is appropriate but slightly out of step with the candidate's actual role or competence. Band 3 is the mirror: behaviour that is inappropriate but where a worse response was possible. Band 4 is the worst available action in the context.
The boundary that trips candidates up is between band 1 and band 2. The test does not distinguish them by ethical direction. Both bands describe appropriate actions. The distinction is in the dimension of force. A band 1 response is one that confronts the issue, says what needs to be said, and either acts or escalates decisively. A band 2 response handles the issue by softer means: seeking advice, asking a more senior colleague, deferring the action. Both are right in the sense that neither is harmful, but the band 1 response is what a GMC-registered doctor would do without hesitation, and a band 2 response is what a particularly cautious or junior member of the team might do instead.
Consider a scenario where a junior doctor on your ward has just made a minor prescribing error. The patient is unharmed. What is the most appropriate action? The band 1 answer is to ensure the patient is safe, then speak to the junior doctor and report the error to the consultant in charge. The band 2 answer is to mention it quietly to the junior doctor, perhaps check the patient's notes yourself, but stop short of escalating to the consultant unless asked. The difference is small in real life. On the test, the difference is the boundary between a band 1 and a band 2. The test writers signal the difference with two vocabulary cues. First, band 1 options use verbs of action with a clear patient-impact clause: ensure, inform, escalate. Band 2 options use verbs of deferral: ask, discuss, check. Second, band 1 options name a specific role or destination: the consultant, the ward manager, the patient's named nurse. Band 2 options name a category of person without specifying: a senior colleague, a supervisor, the team. The generalisation is a deliberate test-writer signal that the action is being softened.
There is one more cue that I think is under-appreciated. Band 1 options frequently include a time reference, often implicit, that says the action should happen now. They mention raising the issue immediately, documenting the conversation today, checking on the patient before the end of the shift. Band 2 options frequently mention doing the thing later, at an appropriate moment, in a private setting, after the ward round. None of these cues are deterministic — a single cue can mislead you if the scenario is unusual — but when two or three appear in the same option, the band location is usually clear.
Why empathy can become a blind spot
The ethical content of SJT scenarios is not one-dimensional. The traditional teaching is that SJT is about professionalism, and professionalism is in turn about putting the patient first. Most candidates internalise that message correctly and arrive at the exam over-sensitised to patient welfare, which produces a specific kind of error. The error is what I would call an empathy over-pull: the candidate picks the most patient-centred response in the list without noticing that the response might also breach confidentiality, exceed the candidate's role, or commit the team to a course of action that should not be undertaken without consultation.
A common scenario structure tests exactly this. The patient is a 17-year-old who has just told you, an undergraduate on placement, that they are sexually active and does not want their parents to know. The most patient-centred action appears to be to reassure the patient of confidentiality and to offer advice. The band 1 answer, however, is to acknowledge the patient's concerns, explain the limits of confidentiality in age-appropriate language, and encourage the patient to speak to the GP or a qualified clinician. The candidate's role matters: an undergraduate on a two-week placement is not the right person to give sustained advice, and the test is not testing whether you are kind, it is testing whether you understand the limits of your role while remaining kind.
Here is the practical heuristic that helps. For every SJT item, before you read the options, complete the sentence: the most appropriate action is the one that _________. Fill the blank twice. The first time, write the patient-centred version. The second time, write the role-appropriate version. If the two answers match, you are looking at a band 1 option. If they diverge, the gap between them is exactly where the band 1 / band 2 line sits, and the right answer is whichever option best reconciles them. This is a small adjustment to reading the options, but for most candidates reading this it solves more items than any other technique I have seen.
The same logic applies to scenarios involving colleagues. A junior doctor is rude to a nurse; a senior colleague takes credit for your work; a peer on the course copies from your notes in an exam. The empathy-driven answer in each case is to act in defence of the person who has been wronged. The role-appropriate answer often requires an intermediate step: raising it with the colleague, speaking to a personal tutor, escalating through a formal channel. Both are correct, in spirit. The test wants the one that a doctor would actually do in a registered, employed, accountable role. Almost always, that means going through a channel rather than acting unilaterally.
Reading the stem: who is the actor, what is the relationship
Most candidates focus on the action, but the action only makes sense relative to the actor. The same sentence — "I would speak to the patient privately to discuss what they have told me" — is a different answer in two different scenarios. In one scenario, the actor is a fourth-year medical student. In the other, the actor is a foundation year 2 doctor. The medical student speaking privately with a patient is band 1, assuming the topic is appropriate to the student's role. The FY2 doctor having a private discussion with a patient who has just disclosed a safeguarding concern might be band 3, because the right action is to involve the safeguarding lead, not to have the conversation alone. The wording of the action is identical. The band location is different, and the only way to spot the difference is to read the stem for the actor's role and competence.
Three stem details tend to decide band location. First, the actor's training stage: medical student, foundation doctor, GP trainee, consultant. Second, the relationship between the actor and the other named person: line manager, peer, patient, relative. Third, the setting: in a tutorial, on a ward round, in a corridor, in a tutorial room with no other staff present. Each of these is a deliberate choice by the test writer, and each narrows the set of band 1 responses. The single most common error I see in classroom marking is candidates applying a generic "be professional" rule to a stem where the actor is in a tutorial with a peer, when the specific rule should have been "be collegial but recognise the limits of friendship". The test is granular because the test is reading your ability to handle granularity.
For ranking items, this matters even more. With five options to order, you cannot afford to treat any two as equivalent. If two options look identical in patient-centredness, look at the role language. The one that names a specific, in-role person is higher; the one that names a vague senior or external body is lower. If two options still look identical, look at the time cue. The action taken immediately is higher; the action deferred to a private or appropriate moment is lower, unless the deferral is itself what the scenario is testing for (in which case the cue flips). The system is consistent. It just needs to be read in order.
Time budgeting on the 12-minute subtest
The 66 items in 12 minutes figure sounds unworkable until you realise that almost half the items in any SJT are multiple-choice rather than ranking, and the multiple-choice items are quick. The structure is roughly 30 to 35 multiple-choice items and 30 to 35 ranking items, in a roughly alternating order. The multiple-choice items take 6 to 8 seconds each; the ranking items take 15 to 25 seconds each, depending on the spread of the options. The aggregate works out to about 11 seconds per item, which is the test designer's target.