IELTS Speaking Part 2, also known as the long turn, requires candidates to speak uninterrupted for up to two minutes on a given topic, supported by a cue card that provides prompts and structure. Unlike IELTS Speaking Part 1, which follows a conversational question-and-answer format, and IELTS Speaking Part 3, which involves abstract discussion with the examiner, Part 2 places the candidate in the role of a speaker with full control over content organisation and discourse flow. Understanding how the cue card functions as a structural scaffold—rather than merely a list of things to mention—is the key difference between candidates who produce organised, coherent talks and those who ramble or lose coherence partway through. This article examines the Part 2 cue card in detail, analyses how its four cue types guide discourse architecture, explains how to use the one-minute preparation window strategically, and provides a framework for building a consistently high-scoring 2-minute response.
What IELTS Speaking Part 2 actually requires: task demands and assessment criteria
The IELTS Speaking Part 2 long turn begins when the examiner hands the candidate a topic card and asks them to speak for one to two minutes. The candidate receives exactly one minute to prepare, during which they may make notes on a piece of paper provided by the examiner. When the minute ends, the examiner signals the candidate to begin speaking, and the candidate must continue until they have finished their response or until the examiner interrupts after the two-minute mark.
The task is designed to assess several dimensions of spoken performance simultaneously. Fluency and coherence account for the logical flow of ideas, the use of cohesive devices, and the ability to sustain a extended turn without unnatural hesitation. Lexical resource evaluates the range and accuracy of vocabulary deployed, including less common words and appropriate collocations. Grammatical range and accuracy measure the complexity and accuracy of sentence structures produced. Finally, pronunciation is assessed in terms of clarity, stress patterns, intonation, and the extent to which the candidate's speech is comprehensible to an average listener.
The cue card is not merely a prompt—it is a deliberate assessment instrument. Its structure reflects the criteria that examiners use to evaluate performance. A candidate who ignores the cue card's guidance and speaks off-topic will score poorly on fluency and coherence, even if their language is otherwise proficient. Conversely, a candidate who follows the cue card's implied structure and delivers a well-organised talk demonstrates discourse management skills that map directly onto the higher band descriptors. Understanding this relationship between the cue card's architecture and the assessment criteria is the first step towards strategic preparation.
Most Part 2 topics follow a consistent pattern: a general subject statement followed by three to four specific sub-points that the candidate is expected to address. The topics themselves range widely—describing a person, place, event, activity, object, or experience—but the underlying task structure remains constant across all variants. This consistency means that the skills required for success in Part 2 are transferable across all topic types, provided the candidate learns to read the cue card as a structural blueprint rather than a checklist.
Anatomy of the Part 2 cue card: identifying the four cue types
Every IELTS Speaking Part 2 cue card contains four distinct elements, each of which serves a different function in guiding the candidate's response. Skilled candidates learn to distinguish between these elements during the one-minute preparation window and to allocate speaking time proportionally to each.
The first element is the topic statement, which appears at the top of the card in a phrase such as "Describe a book that you have recently read" or "Talk about a skill you would like to learn." This statement defines the broad subject area and sets the boundaries of what is relevant. The topic statement alone is insufficient for a full two-minute talk, but it provides the thematic anchor that keeps the response focused.
The second element comprises the content cues—the bullet points or numbered prompts that expand the topic statement into specific aspects. These typically ask the candidate to address what, why, how, or describe characteristics. Common formulations include "Explain why it was important to you," "Describe how you learned about it," or "Talk about how it affected you." These cues are the primary structural pillars of the talk, and each one corresponds roughly to a paragraph or major section of the response.
The third element is the audience framing, which is sometimes implicit rather than explicit. The cue card implicitly positions the candidate as someone speaking to the examiner as if to a moderately informed listener. This framing has implications for the register and level of explanation required. The candidate does not need to define basic concepts, but they should provide sufficient context and reasoning for an intelligent listener who is unfamiliar with the specific subject.
The fourth element is the time boundary, which in Part 2 is set at one to two minutes. This boundary has two implications for response design. First, it imposes a hard limit on the amount of content that can be covered. Second, it rewards economy of expression—candidates who ramble or repeat themselves waste time that could be spent developing richer ideas.
The three-cue versus four-cue card: does the number of prompts matter?
Most Part 2 cue cards contain three or four content cues, though some have fewer. The number of cues does not change the fundamental approach to response design, but it does affect time allocation. A three-cue card allows approximately 40 seconds per cue, while a four-cue card allows approximately 30 seconds per cue. In both cases, the candidate must prioritise depth over breadth—covering all the cues in surface detail is less effective than developing two or three cues with examples and elaboration. The mark scheme rewards substance and coherence over mere completeness of cue coverage.
| Cue card element | Function in the response | Approximate time allocation |
|---|---|---|
| Topic statement | Sets thematic scope; frames the subject | 5–10 seconds (opening) |
| Content cue 1 | First structural pillar; first body paragraph | 25–40 seconds |
| Content cue 2 | Second structural pillar; second body paragraph | 25–40 seconds |
| Content cue 3 | Third structural pillar; third body paragraph or conclusion | 25–40 seconds |
| Closing cue (if present) | Summary or reflection; signals completion | 10–15 seconds |
The 1-minute preparation window: converting reading time into structural planning
The one-minute preparation period before the long turn is unique within the IELTS Speaking test. In Parts 1 and 3, candidates must respond immediately, with no advance preparation. In Part 2, the test format deliberately grants a brief window for planning. This window is not a courtesy—it is a tool, and candidates who use it strategically gain a measurable advantage.
During this minute, the candidate's primary goal should be to design the structure of the response, not to write a script. Attempting to write out full sentences or detailed notes is counterproductive because the time pressure makes it impossible to produce polished language, and the cognitive load of reading from notes during the talk disrupts fluency and eye contact. Instead, the candidate should use the minute to accomplish three concrete tasks.
First, the candidate should analyse the cue card prompts and identify the logical sequence in which they will be addressed. The cue card's prompts may not be presented in the order most suitable for a compelling talk. For example, a cue card might list "what it is," "when you first did it," and "why you enjoyed it." The candidate might choose to reorder these into a narrative arc that begins with the experience and then explains its significance—a structure that creates a stronger sense of progression than a simple catalogue.
Second, the candidate should allocate time across the cues based on which aspects of the topic they feel most confident discussing and which offer the richest language opportunities. If one cue invites the use of descriptive vocabulary—describing a place, for instance—it may warrant more time than a cue that simply requires a factual statement. The candidate should make a rough mental note of how many seconds each section will receive.
Third, the candidate should write brief keywords or short phrases as memory triggers for each section. These notes serve only as a safety net to prevent the candidate from losing their place mid-talk. A few words per cue point are sufficient. The notes should be legible and scannable at a glance, not a dense paragraph that the candidate will struggle to read while maintaining eye contact with the examiner.
What the candidate should avoid during the preparation minute is equally important. Spending time rehearsing exact phrases in the head creates anxiety without improving the quality of spontaneous speech. PANicking about the topic or worrying about lexical gaps wastes time that should be spent on structural planning. And beginning to speak before the examiner signals the start—some candidates cannot resist the temptation—disrupts the test protocol and may count against them.
Building the 2-minute response: discourse architecture for IELTS Speaking Part 2
A high-scoring IELTS Speaking Part 2 response is not simply a collection of true statements about the topic. It is a piece of structured discourse that demonstrates the candidate's ability to organise ideas, control complexity, and maintain coherence across an extended turn. The cue card provides the raw material; the candidate must construct the architecture.
The opening of the response should accomplish two things within approximately five to ten seconds. It should establish the subject clearly and signal the overall direction of the talk. A simple and effective opening formula is to restate the topic statement in the candidate's own words and add a preview of what will follow: "I'd like to talk about a book I read recently—The Remains of the Day—and explain why it made such a deep impression on me, what the story is about, and how it changed the way I think about certain themes." This kind of opening demonstrates fluency, provides orientation for the examiner, and creates a sense of control that carries through the rest of the response.
The body of the response is where the content cues are addressed in turn. Each cue should correspond to a distinct section of the talk, separated from the others by a brief transitional phrase or a pause. The candidate should avoid a common mistake, which is to blend all the cues together into a single undifferentiated stream of speech. A well-structured response has visible paragraphs, even though it is delivered orally. Each paragraph should develop a single aspect of the topic, using examples, reasons, or descriptive details to add substance. The language should become progressively more complex as the response develops—starting with simpler sentences and gradually introducing subordinate clauses, complex tenses, and less common vocabulary as confidence builds.
Cohesive devices play an essential role in maintaining the fluency and coherence scores. Within each paragraph, the candidate should use linking words and phrases to connect ideas: "The reason I want to talk about this is that...", "What struck me most was...", "In addition to this, I should mention...", "When I think about it now, I realise that..." These devices signal to the examiner that the candidate is actively managing discourse structure, not merely producing a string of unrelated statements. At the paragraph level, a transition sentence that bridges one cue to the next reinforces the logical flow: "Having described the setting, I'd now like to move on to what happened as a result."