TOEFL iBT Speaking Task 3 is an integrated academic task that requires candidates to synthesise information from a reading passage with details from a lecture. The examinee reads a short academic text—typically a campus-related announcement or a brief theoretical passage—then listens to a conversation or lecture in which the speaker either supports, refutes, or illustrates the reading material. Candidates then have 30 seconds to prepare and 60 seconds to deliver a response that integrates both sources. This task assesses the ability to comprehend academic discourse, identify the relationship between spoken and written information, and articulate that synthesis in coherent, time-pressured spoken English. Success depends not merely on language proficiency but on a structured approach to reading selectively, listening analytically, and organising a response under examination conditions.
The structure and timing of TOEFL iBT Speaking Task 3
Before examining strategies for individual components, candidates must internalise the temporal architecture of the task. The sequence unfolds across four distinct phases: reading, listening, preparation, and speaking. The reading passage appears on screen for approximately 45 seconds, though candidates control the pace and may reread as needed during this window. The listening segment follows immediately and typically runs between 60 and 90 seconds. After the audio concludes, the screen displays a preparation prompt, and a countdown timer provides exactly 30 seconds for note organisation and mental outlining. The response window is capped at 60 seconds of spoken output. Understanding this sequence allows candidates to allocate cognitive resources appropriately at each stage rather than improvising under pressure.
The passage itself is invariably drawn from an academic or campus context: a proposal to change library hours, a new university policy regarding dining services, a brief explanation of a biological or social phenomenon. The lecture or conversation that follows typically presents a speaker's reaction—often a student or professor commenting on the proposal or illustrating the concept from the reading. The key pedagogical skill assessed here mirrors genuine academic demands: the capacity to engage with written material and then extend, critique, or apply that material in response to spoken discourse.
- Reading phase: approximately 45 seconds, screen-displayed, rereadable
- Listening phase: 60–90 seconds of audio, single opportunity
- Preparation phase: exactly 30 seconds, countdown timer visible
- Response phase: maximum 60 seconds of spoken output
The academic reading component: what to extract and what to ignore
One of the most frequent inefficiencies among TOEFL iBT candidates is attempting to read the passage with equal attention to every sentence. The reading segment in Speaking Task 3 is not a comprehension test; it is a source document. Candidates should adopt a selective extraction strategy that identifies and mentally bookmark three elements: the main proposal or concept, the supporting reasons or evidence presented, and the structure of the argument. The goal is not to retain every detail but to hold a functional mental schema that can be activated when the lecture provides elaboration, counterargument, or illustration.
In practice, this means reading the first sentence of each paragraph with full attention and skimming the remainder for specific details that appear useful. If the passage argues that the university should extend library opening hours, the reader identifies the two or three stated reasons immediately. These reasons become the mental filing system into which lecture information will later be sorted. The candidate who enters the listening phase with a clear sense of what the reading claimed is far better positioned to recognise when the speaker agrees, disagrees, or introduces a related example than the candidate who processed the text at uniform depth throughout.
Note-taking during the reading phase is optional but advisable for candidates who struggle with short-term retention. A few abbreviated phrases—three to five words per key point—are sufficient. These notes serve as an anchor during the listening phase and prevent the common error of losing the thread of the reading when the audio begins.
Identifying signal phrases in academic passages
Academic passages in TOEFL iBT Speaking Task 3 consistently employ signal phrases that indicate structure. Phrases such as "the university proposes," "according to the announcement," or "the passage explains" establish the main claim. Supporting reasons are typically introduced by "for this reason," "the administration argues that," or "the proposal cites evidence that." Train yourself to recognise these patterns so that your extraction becomes rapid and automatic rather than effortful.
Listening strategically: capturing lecture details that matter
The listening segment presents a single audio opportunity—no replay, no pause, no speed control. This asymmetry with the reading phase demands a disciplined listening strategy. Candidates should approach the lecture with an active hypothesis about what they expect to hear, based on the reading: the speaker will likely either support or contradict the reading, and the response quality depends on accurately characterising which of these occurs and with what specificity.
The speaker in the listening segment typically takes one of three positions relative to the reading. First, the speaker may provide specific examples that illustrate or reinforce the reading's claims. Second, the speaker may present reasons why the proposal or theory in the reading is problematic, flawed, or unlikely to achieve its aims. Third, the speaker may describe a personal experience or hypothetical scenario that complicates or extends the reading. Each of these patterns requires a slightly different response structure, so rapid identification of the speaker's stance is among the most valuable skills for this task.
Note-taking during listening should prioritise three categories of information: the speaker's overall attitude (agree, disagree, neutral illustration), the specific reasons or examples provided, and any direct references to the reading. A practical abbreviation system—such as using arrows to indicate cause-effect relationships or circling stance markers—helps candidates reconstruct the lecture's logic during the 30-second preparation window. Candidates who attempt to transcribe the lecture verbatim almost invariably miss the larger argumentative structure; those who capture the architecture of the argument in abbreviated notes are better equipped to produce a coherent response.
The signal detection challenge: when speakers are subtle
Not all TOEFL iBT speakers in Task 3 announce their stance with explicit language. Some speakers express qualified agreement or nuanced disagreement, using phrases such as "I see why they think that, but in practice..." or "the idea makes sense on paper, but." Candidates must listen for these qualifying structures, which signal that the speaker's overall position diverges from the reading even when the opening language appears to agree. Practising with diverse lecture recordings—such as university podcast episodes or online lecture excerpts—builds the ear for these subtle stance markers.
Response structure: the 60-second blueprint for TOEFL iBT Speaking Task 3
Sixty seconds of spoken English is both generous and constrained. It is sufficient to deliver a complete, well-structured response that covers all required elements, but it is not sufficient for digression, repetition, or unfocused narration. The optimal response follows a three-part structure that can be rehearsed and internalised as a default template: introduction and stance identification, first point with lecture support, second point with lecture support. This structure ensures completeness, logical progression, and adherence to the rubric's emphasis on relevant content and clear organisation.
The introduction should accomplish two things in no more than two sentences: identify the topic of the reading and state the speaker's position. A model opening might be: "The reading proposes that the university extend library hours until midnight, arguing that this would benefit students and reduce crowding. The speaker in the lecture expresses scepticism about this proposal, questioning both the demand and the logistics." This framing immediately orients the examiner and establishes the relationship between sources.
Each body paragraph should focus on a single point of connection between the reading and the lecture. For each of the reading's supporting reasons, the candidate should identify the speaker's corresponding reaction—whether confirmation, refutation, or illustration—and articulate both elements concisely. The candidate who says "The reading argues that extended hours would reduce crowding in the library during peak times. However, the speaker points out that the data from other universities show that students tend to cluster in the early evening regardless of opening times" has demonstrated the integrated comprehension the task requires.
Transitional phrases such as "in addition," "however," "consequently," and "the speaker also notes that" signal logical relationships and contribute to the organisation score. Candidates should avoid introducing entirely new information in the final seconds—any point not fully explained within the time limit is better omitted than mentioned incompletely.
Managing the 30-second preparation window
The preparation interval is best used not for writing a script but for three rapid activities: verifying the accuracy of key notes taken during listening, mentally selecting which two or three points to prioritises, and silently rehearsing the opening sentence. Writing out full sentences during this window is counterproductive—it produces awkward eye contact with notes during the response and rarely survives contact with the actual speaking pace. The candidate who internalises the blueprint structure during preparation will find that the first words of the response emerge naturally, freeing cognitive capacity for fluid delivery.