The IELTS Speaking Part 2 long turn is the segment of the test where candidates receive a written cue card and are given exactly one minute to prepare before speaking for up to two minutes. Unlike other sections of the speaking test, this window provides candidates with a unique opportunity: the chance to make decisions about content, structure, and vocabulary before a single word is spoken aloud. Yet many candidates squander this preparation minute by writing fragments, rehearsing individual sentences, or simply staring at the card. Understanding how to use this preparation time strategically is one of the most underappreciated skills in IELTS Speaking preparation, and it has a direct bearing on performance across all four assessment criteria: fluency and coherence, lexical resource, grammatical range, and pronunciation.
What the one-minute preparation window actually offers
The IELTS Speaking test format allocates sixty seconds between the examiner handing over the cue card and the candidate beginning the long turn. During this interval, candidates may make written notes on a paper notepad provided by the examiner. The cue card itself presents a broad topic and typically four sub-points that the candidate is expected to address. The examiner does not interrupt or offer guidance during this phase; the clock simply runs. This design reflects a deliberate assessment philosophy: the test values the ability to organise thoughts under mild time pressure, a skill that mirrors real-world academic and professional communication scenarios.
The preparation window is not merely a courtesy. It is a structured component of the speaking assessment, and the decisions made during these sixty seconds shape the quality of the talk that follows. Candidates who approach the minute systematically consistently outperform those who approach it reactively. The distinction lies not in intelligence or language ability, but in strategy and rehearsal discipline.
The four-cue-point framework: understanding what the card is really asking
Every IELTS Speaking Part 2 cue card presents a general topic and four bullet points. These four points are not equal in importance, and treating them as a simple checklist to be ticked off sequentially is one of the most common strategic errors candidates make. A closer analysis reveals that the four points typically fall into two functional categories: mandatory content points and expandable content points.
The mandatory content points are the ones the examiner expects every candidate to address. These usually include the core elements of the description — what, who, when, where, or why — depending on the topic. Candidates who omit a mandatory point will be penalised under fluency and coherence, because the talk will appear incomplete or unstructured. The expandable points are the ones where the candidate has the greatest latitude to demonstrate lexical range and grammatical sophistication. These are the territories where Band 7 and above performance is built.
Before writing a single note, high-scoring candidates spend the first ten to fifteen seconds reading the card twice. The first reading identifies the mandatory points. The second reading identifies the expandable points and begins to generate personal vocabulary relevant to the topic. This cognitive triage — deciding which points need basic coverage and which points deserve elaboration — is the foundational decision of the entire preparation window.
Note-taking systems that support rather than constrain
The written notes taken during the one-minute preparation phase serve a specific purpose: they act as a cognitive scaffold during the talk, not as a script to be read from. Candidates who attempt to write full sentences or detailed paragraphs during the prep minute typically find themselves unable to finish their notes, unable to read their handwriting under pressure, or unable to maintain eye contact with the examiner. The note-taking system must therefore be fast, minimal, and personally meaningful.
One effective approach is the keyword-and-symbol system. Instead of writing full words, candidates note single keywords that trigger longer utterances. For example, if the topic is describing a memorable journey, the notes might read: "graduation trip → Japan → 14 days → temple Kyoto → street food → independence → lost map → asked local → laughed together → photo album still." Each keyword functions as a mental anchor that unlocks a sentence or short paragraph during the talk itself.
Another effective technique is the bullet-point expansion mapping. Under each of the four cue points, the candidate writes one or two keywords that correspond to the specific detail they will describe. A blank space is then allocated next to each bullet to represent the portion of the two minutes to be devoted to that point. This visual allocation — even if only approximate — prevents the common problem of spending two-thirds of the talk on the first point and running out of time for the remaining three.
A third approach, particularly useful for candidates targeting Band 7 and above, is the vocabulary pre-loading technique. During the second reading of the card, the candidate identifies two or three sophisticated lexical items relevant to the topic and jots them down. For instance, if the topic involves describing a person who inspired a career choice, the candidate might pre-load "catalyst for my professional trajectory" or "galvanised my ambition." This technique directly supports the lexical resource criterion by ensuring that advanced vocabulary is deployed consciously rather than discovered under pressure.
Structuring the two-minute talk: the introduction-body-conclusion model
Although the IELTS Speaking Part 2 does not require a formal essay structure, the most coherent and highly scored talks follow a recognisable three-part architecture: a brief orienting introduction, a developed body that addresses the cue points, and a natural closing statement. This structure is not an artificial imposition; it is a reflection of how natural spoken discourse functions in academic and professional contexts.
The orienting introduction typically lasts ten to fifteen seconds and serves a single purpose: to signal to the examiner that the topic has been understood and the talk is about to begin. A simple pattern works best: paraphrase the main topic and state the scope. For example: "I'd like to describe a book that has had a significant influence on my thinking. It was a novel I first encountered during my final year of secondary education, and it changed the way I approached ethical questions." This brief framing costs very little time but provides a strong fluency signal.
The body of the talk addresses the four cue points in sequence, with the expandable points receiving proportionally more time. Candidates should aim to spend approximately twenty to thirty seconds on each mandatory point and thirty to forty-five seconds on each expandable point, though these allocations are flexible. The key principle is this: every point must be addressed at least briefly, and at least two points must be developed with specific detail, examples, and extended sentences.
The closing statement is frequently neglected by candidates who run out of time or who are so absorbed in describing content that they forget to wrap up. A natural conclusion does not require a formal summary; it simply requires the candidate to signal that the description is complete. Phrases such as "That's essentially what I wanted to say about that topic," or "In short, this experience remains one of the most formative of my life" serve this function efficiently and provide a clean ending that the examiner can recognise as a full stop.
Triage decisions: which points to expand and how much
One of the most consequential strategic decisions in the preparation window is choosing which cue points to develop in depth. The cue card typically presents four points, but not all points are equally productive for demonstrating language ability. Candidates who try to give equal time to all four points often produce a shallow, list-like talk that lacks depth and does not allow for extended discourse — a key feature of Band 7 and above performance.