The PTE Academic Speaking section presents candidates with two task types that superficially appear similar—both require spoken output under time pressure—yet they draw on fundamentally different cognitive operations and are evaluated against distinct scoring priorities. Repeat Sentence tests your ability to capture, retain, and reproduce audio input with high fidelity, while Describe Image demands that you parse visual information, construct a coherent narrative, and deliver it with structural fluency. Understanding how the three core scoring dimensions—Content, Oral Fluency, and Pronunciation—behave differently across these two tasks is essential for allocating your preparation time efficiently and maximising your speaking band score.
This article analyses the scoring mechanics of Repeat Sentence and Describe Image in isolation and side by side, identifies which dimension carries the greatest leverage in each task, and provides targeted training approaches so you can calibrate your practice programme accordingly.
The three scoring dimensions of PTE Speaking
Before examining how each dimension functions across task types, it is worth establishing precisely what the PTE Academic scoring rubric measures under each dimension.
Content refers to the relevance, completeness, and accuracy of what you say. In Repeat Sentence, Content is scored based on how many of the recorded words you successfully reproduce. In Describe Image, Content is assessed against the extent to which you cover the key elements of the image and articulate a logical conclusion.
Oral Fluency measures the smoothness, rhythm, and natural cadence of your speech. The scorer evaluates whether your output sounds like that of a fluent speaker—steady pacing, appropriate grouping of ideas, and minimal hesitation or false starts.
Pronunciation concerns the clarity and intelligibility of individual phonemes and the overall prosodic features of your output, including stress patterns, vowel quality, and consonant precision.
Each dimension is scored independently on a 0–90 scale, and all three contribute to your final PTE Academic Speaking score. However, the weight each task places on each dimension varies considerably.
Repeat Sentence: the memory-first task
In Repeat Sentence, you hear a recording of between 3 and 9 seconds and must repeat it verbatim (or near-verbatim) immediately afterwards. The scoring for this item type is direct: points are awarded proportionally based on the number of words correctly recalled and reproduced. One or two minor errors are tolerated, but each omitted or incorrectly produced word reduces your Content score.
How the scoring dimensions interact in Repeat Sentence
Because the source material is fixed and you cannot add or omit information meaningfully (your goal is fidelity to the original), Content is the dominant dimension in Repeat Sentence. If you miss a key word or introduce a word that was not in the original, your Content score suffers. The scoring algorithm counts word-level matches against the original recording.
Oral Fluency in Repeat Sentence is scored with a specific constraint: the response must be delivered in a single, fluid utterance. Stopping mid-sentence, inserting filler words, or producing noticeable hesitation disrupts the Oral Fluency score. However, since your content is externally defined, the Oral Fluency dimension here primarily rewards uninterrupted, confident delivery rather than sophisticated discourse planning.
Pronunciation in Repeat Sentence is evaluated with particular attention to vowel length and consonant precision, since the scorer can compare your output against the original recording's phoneme sequence. Subtle pronunciation deviations that might pass unnoticed in free speech can reduce your Pronunciation score in Repeat Sentence.
Key preparation strategies for Repeat Sentence
- Practise active listening with shadowing exercises—listen to a sentence and repeat it immediately, mimicking rhythm and intonation as closely as possible.
- Train your short-term auditory memory by gradually increasing sentence length from 3–5 words to 10–12 words before attempting full-length items.
- Work on phonetic precision, especially with commonly mispronounced word endings (final consonants in words like "stopped," "jumped," "handed").
- Avoid filler words such as "um" and "uh"—even a single hesitation can reduce your Oral Fluency score on a short response.
- Develop the habit of starting immediately after the audio ends; any pause before beginning your repetition costs you points on Oral Fluency.
Describe Image: the structure-first task
In Describe Image, you are shown a still image—a graph, chart, diagram, photograph, or map—and given 25 seconds to prepare a spoken response of approximately 40 seconds. Unlike Repeat Sentence, there is no externally defined correct answer. Your score depends entirely on how effectively you structure your description and how well you cover the image's key elements.
How the scoring dimensions interact in Describe Image
In Describe Image, Content becomes a matter of strategic completeness rather than verbatim reproduction. The scorer awards points for covering main trends, key data points, comparisons, and a logical conclusion. If you mention only two of five visible trends in a complex graph, your Content score will be limited regardless of how fluently you speak.
Oral Fluency in Describe Image is arguably more demanding than in Repeat Sentence because you are generating original discourse. The scorer looks for steady pacing, natural phrasing groups, and minimal hesitation or self-correction. Building a reliable template structure is the single most effective way to protect your Oral Fluency score in this item type, because it reduces the cognitive load of content generation and allows you to focus on delivery.
Pronunciation in Describe Image is evaluated in the context of spontaneous, continuous speech. The scorer listens for overall intelligibility and consistent phoneme production. This means that even if your individual phonemes are clear, a tendency to rush through numbers or mumbled clause transitions can reduce your Pronunciation score.
Key preparation strategies for Describe Image
- Develop and rehearse a consistent template structure: identify the type of image, describe the main features in order of visual prominence, highlight key data points or comparisons, and conclude with a summary statement.
- Categorise images into types (line graph, bar chart, pie chart, process diagram, map, photograph) and build a slightly different template for each category to reduce preparation time within the 25-second window.
- Practise speaking for exactly 35–40 seconds—responses shorter than 25 seconds receive no score for Content, while responses exceeding 50 seconds are automatically cut off by the system.
- Train your eye to scan images quickly for titles, axis labels, legends, and anomalous data points within the 25-second preparation window.
- Work on transitional phrases ("as shown in the graph," "the second notable trend is") that allow you to move between ideas without hesitating.
Comparative analysis: scoring dimension weight by task type
The table below summarises the relative importance of each scoring dimension in Repeat Sentence and Describe Image. Ratings indicate how severely an error in that dimension affects your overall score for the item.