The TOEFL iBT Speaking section uses a single, unified rubric to evaluate every response you give, whether you are discussing your hometown in an Independent task or summarising a psychology lecture in an Integrated one. That rubric has three dimensions: Delivery, Language Use, and Topic Development. Each dimension carries its own weight, and each has a specific threshold that separates a score of 3 from a score of 4 — the difference between a competitive and an uncompetitive application for most universities. This article unpacks what each dimension actually measures, how raters apply these criteria across the six task types you will encounter, and the concrete adjustments that move your responses into the top band.
Why the same rubric governs all six speaking tasks
One of the most persistent misconceptions about the TOEFL Speaking section is that Integrated tasks and Independent tasks are graded differently. They are not. The TOEFL Educational Testing Service (ETS) applies the same 0–4 scoring scale to every task, with raters trained to evaluate Delivery, Language Use, and Topic Development on each response regardless of whether you are talking about a personal experience or summarising an academic lecture. Understanding this uniformity is the first strategic advantage you can gain. Rather than preparing separate strategies for different task types, you can build a single set of habits that satisfies all three rubric dimensions simultaneously. The result is a more efficient preparation programme and a more consistent performance on test day.
The four Independent/Integrated task pairs in detail
The Speaking section contains four scored tasks that appear in a fixed order. Tasks 1 and 2 are Independent tasks: Task 1 asks you to describe a situation, preference, or personal experience, and you have 15 seconds to prepare and 45 seconds to speak. Task 2 requires you to state and defend a personal opinion on a general topic, also with 15 seconds of preparation and 45 seconds of response time. Tasks 3 and 4 are Integrated tasks: Task 3 involves reading a short passage and listening to a conversation or lecture, after which you must explain how the spoken example relates to the reading. Task 4 asks you to summarise key points from an academic lecture and typically requires you to provide supporting details. Both Integrated tasks give you 30 seconds to prepare and 60 seconds to respond. The rubric applies to every one of these identically.
Delivery: the dimension most candidates underestimate
Delivery encompasses clarity of speech, pace, intonation, and overall fluency. Raters are not evaluating your accent — this point cannot be stated too firmly. The rubric does not penalise non-native English accents provided your pronunciation allows the raters to parse your meaning without conscious effort. What the rubric does penalise is speech that forces the rater to work to decode your words. A delivery score of 4 requires that your response sounds natural and mostly effortless: your pacing stays within a comfortable range (roughly 120–150 words per 45-second response, or 160–200 words for a 60-second response), your intonation signals that you are finishing complete thoughts rather than trailing off, and you avoid long, noticeable pauses. A score of 3 typically means the delivery is intelligible but shows intermittent strain — a handful of unclear pronunciations, one or two awkward silences, or a pace that accelerates into a rushed finish.
The 5-second rule and what it costs you
In practice, the most common Delivery problem I see with test candidates is what I call the rushed opening — the instinct to begin speaking immediately after the preparation timer expires, without taking one deliberate breath first. This costs you roughly 3–5 seconds of a response that could otherwise be used to establish a clear, well-paced start. Most candidates who lose the 4 in Delivery do so not because of a strong accent but because they begin mid-thought, fumble the first phrase, and then spend the rest of the response recovering. A simple pre-speaking habit — a single measured breath and the opening phrase already formed mentally — eliminates this entirely. It takes two weeks of deliberate practice to automate this habit, which is a modest investment for a dimension that represents one-third of your Speaking score.
Language Use: grammar and vocabulary at the sentence level
Language Use evaluates the accuracy and appropriate range of your grammatical structures and vocabulary at the sentence and discourse level. A score of 4 requires mostly error-free grammar with appropriate vocabulary selection — not native-level sophistication, but functional accuracy that does not impede comprehension. Raters are trained to distinguish between errors that are minor slips (a forgotten article, a singular/plural inconsistency) and errors that represent a pattern that disrupts meaning. A single dropped article in an otherwise clean response will not prevent a 4. A string of errors — subject-verb disagreement in consecutive sentences, repeated misuse of prepositions, or consistent tense inconsistency — will pull a response down to 3.
Which grammar structures matter most in 45–60 seconds
Because the Speaking tasks are time-constrained, the rubric cannot expect you to produce the full range of complex structures you might demonstrate in written work. However, certain structures carry disproportionate weight in the Language Use dimension. Sentence variety — the ability to produce both simple and complex sentences without repetitive patterns — signals a score of 4. If every sentence in your 45-second response follows an identical subject-verb-object pattern, the rater will note this as a limitation in range. Target two or three sentence-type variations: a compound sentence using a conjunction, a complex sentence with a dependent clause, and a simple declarative sentence. Vocabulary choice matters at the word level: avoid repeating the same high-frequency words (like "good" or "important") when more precise alternatives exist. In an Integrated task about a university policy, for instance, "campus" and "facility" demonstrate better range than repeating "the university" in every clause.