TOEFL Speaking Task 3 requires candidates to synthesise information from a reading passage and a lecture within a strict time envelope. Unlike discrete question types where individual items can absorb overruns, Speaking Task 3 is a linear, unforgiving sequence: every second spent on one element is a second subtracted from another. Misjudging how to distribute time across the reading, the lecture notes, and the 60-second oral response separates mid-band scores from high-band performance. This article provides a phase-by-phase breakdown of the Task 3 time budget, concrete benchmarks for what to accomplish in each segment, and the structural decisions that determine whether a response feels complete or rushed.
Understanding the TOEFL Speaking Task 3 time sequence
Speaking Task 3 operates as a single integrated exercise with four sequential phases. The reading occupies approximately 30 seconds to one minute, the listening segment runs to about 90 seconds, and the candidate is then granted 30 seconds to prepare before delivering a 60-second response. Most preparation materials present these numbers correctly, but few help candidates translate them into actionable targets within each phase. The result is that many candidates enter the response phase under-prepared: they have understood the content but have not decided what to say, in what order, and within how many words. Without this pre-response planning, even strong comprehension fails to translate into a coherent, complete response.
The 30-second preparation period is frequently underestimated. Candidates who have just finished listening to a lecture often feel that they should begin speaking immediately, as if delay signals indecision. In reality, the preparation period exists precisely because the task demands a structured synthesis. The candidates who score highest use those 30 seconds to map their response mentally, not to re-read their notes or listen a second time.
- Reading phase: identify the theory or model and its two supporting reasons
- Listening phase: distinguish the examples or counterexamples the professor introduces
- Preparation phase: outline the three-sentence response structure
- Response phase: deliver the synthesis within a 60-second speaking window
Phase 1 — The reading: what to extract in under 45 seconds
The reading passage in Speaking Task 3 is almost always structured around a theoretical claim, a proposed change, or a contested principle, accompanied by two reasons or pieces of evidence. The passage is brief — typically three to four paragraphs — and its function is to establish the position you must later explain in relation to the lecture. Candidates who attempt to read every sentence closely during this phase frequently run out of time for the listening phase, or find themselves unable to remember which details came from the reading when they later review their notes.
The effective strategy is to treat the reading as a source of two elements only: the central claim and its two supporting reasons. A common passage structure places the main argument in the opening paragraph and the two supporting points in the subsequent body paragraphs. By skimming for these three elements during the reading phase, candidates free cognitive bandwidth for the more demanding lecture phase. Notes taken during the reading should be brief — a single phrase for the thesis and one keyword per supporting reason is sufficient. Attempting to write complete sentences during the reading wastes time and produces notes that will be difficult to integrate with lecture material later.
Reading extraction checklist
- Main claim: what does the passage argue is true or should be done?
- Reason 1: what is the first evidence or argument the passage offers?
- Reason 2: what is the second evidence or argument the passage offers?
- Ignore: elaboration, examples, rhetorical framing during the initial read
Phase 2 — The lecture: mapping the professor's response to the passage
The lecture segment in Speaking Task 3 is always a direct response to the reading passage. The professor either provides examples that support the passage's claims or introduces evidence that complicates or contradicts them. This binary relationship — support or contradict — is the structural backbone of the response, and it determines how the response is framed. Understanding this relationship during the lecture, rather than after it, is one of the most significant time-saving habits a candidate can develop.
While listening, candidates should note the lecture's stance first. If the professor offers examples that illustrate or validate the passage's two reasons, the response framing becomes: the passage argues X and Y; the professor illustrates both points with specific examples. If the professor presents counterexamples, the framing shifts to: the passage argues X, but the professor's examples suggest otherwise. Marking this distinction in notes during the lecture — using a simple label such as "supports" or "contradicts" — eliminates the need to re-evaluate the relationship during the preparation phase.
The lecture typically introduces two concrete examples, one for each of the passage's two reasons. These examples are the primary content of the response. Candidates should record enough detail to paraphrase each example in one or two sentences: the subject, the relevant action or outcome, and the connection to the passage's reason. Abbreviations and single-word triggers are appropriate here; the notes are a memory prompt, not a transcript.
Phase 3 — The 30-second preparation window: building the response skeleton
The preparation phase is the most underused segment of Speaking Task 3. Candidates who spend this time reviewing their notes in a general, unfocused way gain little advantage. The preparation phase should be used to make three specific decisions: what the opening sentence will say, what the two body sentences will cover, and how the response will close. This is not a rehearsal — it is a structural plan.
A reliable three-sentence template for the 60-second response works as follows. Sentence one introduces the passage's thesis and the lecture's stance: "The passage argues that [thesis], but the professor's lecture suggests otherwise by providing two examples." Sentence two addresses the first reason and its corresponding lecture example: "First, the passage claims that [reason one], and the professor illustrates this with [paraphrased example one]." Sentence three addresses the second reason and its corresponding lecture example: "Second, the passage states that [reason two], and the professor supports this with [paraphrased example two]." This template is flexible enough to accommodate both support and contradict scenarios by adjusting the connecting language. The preparation phase allows candidates to populate each slot with specific details from their notes, converting raw shorthand into speakable phrases.
The 30-second window is also the moment to decide on the opening verb tense and phrase structure. Switching between structures mid-response — beginning in present tense and then shifting to past — creates a disfluency that evaluators interpret as uncertainty. Confirming the grammatical frame during preparation eliminates this class of error.
Phase 4 — The 60-second response: what to accomplish and how to pace it
The 60-second speaking window is not long. At a normal conversational pace, 60 seconds accommodates approximately 120 to 150 words in English. This word budget must cover an introduction, two body segments with specific examples, and optionally a brief conclusion. Efficient candidates treat the response as a constrained writing exercise: every word must earn its place.