How to read a GMAT Focus Data Sufficiency stem in under 30…

Data Sufficiency is the most structurally distinctive question family on the GMAT Focus Quant section, and the one where a small number of repeatable habits separate a 75th-percentile score from a 90th-percentile one. Each item presents a question stem, two labelled statements, and a fixed set of five answer choices whose verbs (sufficient, not sufficient, both, either, neither) carry every ounce of the scoring weight. The arithmetic is rarely the obstacle; the obstacle is keeping your reasoning organised under a 62-minute clock while the test asks, again and again, whether a single piece of information is enough to settle the matter.

This article builds a working strategy for Data Sufficiency on the GMAT Focus Edition. The approach rests on three habits: read the stem for its question type before you touch Statement 1, treat each statement as a self-contained world you can enter or leave, and resist the urge to do the final arithmetic once you already know sufficiency. Each habit is mechanical, trainable, and easy to drop the moment pressure builds, which is why the rest of this piece is dedicated to the concrete moves that keep the habits alive when the timer is running.

Reading the stem in under 30 seconds: identify the question type and the unknown

The single most common failure I see in tutoring sessions is not a miscalculation but a misread stem. Candidates read a Data Sufficiency prompt the way they read a regular Quant problem, which is to say they start hunting for the numbers and only later discover what the question was actually asking. On the GMAT Focus, that delay is expensive. The five answer choices are completely insensitive to the value of the answer, so any arithmetic you perform before you understand the unknown is, in scoring terms, a gift to the clock.

Your first 30 seconds on a Data Sufficiency item should answer two questions, in this order: what kind of value does the question ask for, and what would count as sufficient evidence. Is the question asking for a unique numerical value, a range, an integer count, a yes/no decision, or a relationship such as "x greater than y"? Each of those answer types has a different sufficiency signature. A unique numerical value requires that the data pin down exactly one number. A yes/no question only requires that the data force the same answer every time, regardless of the specific values. A relationship such as "is x positive?" only requires that the data force a single polarity.

Once the question type is clear, look for hidden traps. Some stems contain a quantifier that flips the entire logical load: "What is the value of x?" behaves nothing like "What is one possible value of x?". The first demands a unique answer; the second only demands that the statements jointly restrict x to some set you can name. Other stems bury a constraint inside the wording, such as "x is a positive integer" or "x and y are distinct". Those three words change the sufficiency calculus completely because they restrict the universe the statements can choose from. Underline them in your mind, or write them on your scratch pad. A stem that says "What is the value of x?" with no further constraint is a much harder prompt than the same stem with "x is a positive integer" attached, because the integer condition collapses many near-miss cases into a single verdict.

A useful 30-second drill is to restate the stem out loud, in plain English, as if you were explaining the question to a friend. If your restatement contains the word "exactly", "unique", or "the value of", you are looking for a single answer. If it contains "must be", "always", or "is it true that", you are looking for a forced verdict. If it contains "could be" or "is it possible that", the test is asking whether the statements can produce the named outcome, which is a much softer target. The restatement step costs you ten seconds and saves you the most common error category on the exam: answering a different question than the one that was asked.

The five answer-choice verbs and what they really demand

The Data Sufficiency answer key is fixed across the entire GMAT Focus, and that fixed key is the single biggest strategic gift the format offers. You will never see a sixth option, and you will never see a verb that is not one of five. Memorising the structure of those five choices, including the conditions under which each becomes available, lets you work backwards from the choice set to the logic. The structure looks like this in plain form:

Statement (1) ALONE is sufficient — Statement 2 is neither needed nor able to change the verdict.
Statement (2) ALONE is sufficient — Statement 1 is neither needed nor able to change the verdict.
BOTH statements TOGETHER are sufficient, but NEITHER alone is sufficient — the classic "1+2=3, 1≠2" case.
EACH statement ALONE is sufficient — Statement 1 settles it; Statement 2 also settles it; no need to combine.
NEITHER statement NOR both together is sufficient — the test is essentially undecidable on the evidence provided.

The trap that catches most candidates is to read these five choices as ordinal, that is, to assume the test is moving from "least information" to "most information". It is not. The five choices are logical, not ordinal. Choice (D), the "each alone" option, is no more or less powerful than Choice (B); both are simply scenarios that fit the data. What unifies the five is the work you do on each statement, not the order in which they are listed.

Two tactical points flow from this. First, never eliminate a choice because it "feels too generous". If Statement 1 truly settles the question, then (A) is a legitimate verdict, even when the problem looks like it ought to require both. Second, treat the four "sufficient" outcomes as a single category and the one "insufficient" outcome as its opposite. The whole exam is, in scoring terms, asking: is there enough information, or is there not. The verb that the answer choice uses is just a routing label.

For most candidates I tutor, the highest-leverage move is to translate the answer key into a decision tree. Decide first whether Statement 1 is sufficient. If yes, ask whether Statement 2 is also sufficient. If yes, the answer is (D). If no, the answer is (A). If Statement 1 is not sufficient, move to Statement 2. If Statement 2 is sufficient on its own, the answer is (B). If Statement 2 is also not sufficient, ask whether the two together are sufficient. If yes, the answer is (C). If no, the answer is (E). This is the only logic the answer key actually requires. If you can hold that tree in your head, the verb in front of the answer choice becomes almost decorative.

Testing Statement 1: the art of finding a counterexample

Once you have isolated the question the stem is asking, the work on Statement 1 is binary: is it sufficient, or is it not? Sufficiency in the GMAT Focus sense is a stronger claim than "the statement is true". Sufficiency means that the statement, taken on its own, removes all ambiguity about the unknown. The fastest way to test that claim is to attempt the opposite: try to construct a counterexample. A single valid counterexample is enough to prove that the statement is not sufficient. Two valid examples that yield different answers are also enough. You only need one consistent example to prove that the statement is sufficient.

Counterexample construction is the skill that separates experienced Data Sufficiency solvers from novices. It requires you to treat the statement as a constraint on a variable, and then ask: under this constraint alone, is the unknown forced? If the constraint is "x is a positive integer", the question is whether the stem's question is answered uniquely for every positive integer x. The answer is no for most prompts, yes for a few. If the constraint is "x = 7", the answer is yes for almost any stem. The skill is to read the constraint, sketch the resulting domain, and ask whether the domain collapses to a single value, a single polarity, a single yes/no answer.

In practice, three categories of counterexample appear again and again. The first is the symmetric case, where swapping two variables preserves the constraints. If x and y satisfy a relation and the question asks "what is the value of x?", a symmetric counterexample gives you a different x that also works, which is enough to kill sufficiency. The second is the boundary case, where the unknown sits at the edge of the constraint. If a statement says "x is greater than 5" and the stem asks "is x greater than 10?", the boundary value x = 7 is a counterexample to sufficiency, because the statement does not rule out x = 7. The third is the family case, where the statement pins down a shape but not a scale. "x and y are in a ratio of 2:1" is the canonical example: the ratio is fixed, but the absolute values are not, so any stem asking for a specific value of x is not answered by that statement alone.

One of the most expensive mistakes on Statement 1 is the "partial sufficiency" error, where a candidate finds a single answer under the statement's constraint and concludes that the statement is sufficient. A single answer is a necessary but not sufficient condition for sufficiency. The statement is sufficient only when every value consistent with the statement yields the same answer to the stem's question. A counterexample kills sufficiency; a single example never proves it on its own. Train yourself to ask, after you find a working value: "is this the only value the statement allows?" If you cannot answer yes with confidence, the statement is not sufficient.

Testing Statement 2 and the temptation to combine

Statement 2 testing is structurally identical to Statement 1 testing, and that symmetry is the second biggest strategic gift of the format. The same counterexample logic applies. The same boundary, family, and symmetric cases apply. The only difference is the constraint, and the constraint is the only thing you should be looking at. Candidates often forget this symmetry and start treating Statement 2 as a continuation of Statement 1, which leads them to combine the two statements before they have completed the single-statement analysis. Combining early is the second-most-expensive mistake on Data Sufficiency, after the misread-stem error.

The reason combining early is so costly is that it muddies the answer choice. If you combine before you finish, you cannot cleanly distinguish "(A) sufficient" from "(B) sufficient" from "(C) sufficient". The whole architecture of the answer key is built on the assumption that you have first asked about each statement in isolation. Skip that step and you force yourself to redo the work in less time, on an item that has already consumed more of your budget than it deserved.

A second temptation on Statement 2 is the "information is good" fallacy. Candidates will look at a Statement 2 that adds detail to Statement 1 and assume that the addition must help. Information is not the same as sufficiency. A statement can be informative, plausible, and even numerically true, and still fail to change the answer to the stem's question. The test of sufficiency is whether the statement, taken alone, settles the unknown, not whether it tells you something. The way to neutralise this fallacy is to test Statement 2 in complete isolation, pretending Statement 1 does not exist. If Statement 2 alone settles the stem, the answer is (B), regardless of how elaborate Statement 1 happens to be.

There is one more subtle trap on Statement 2 testing. Some statements, when taken alone, are insufficient for an interesting reason: they are too powerful. A Statement 2 that asserts "x = 7" is, of course, sufficient on its own, but candidates sometimes resist the answer (B) because the statement feels too easy. The test does not calibrate difficulty per item. A one-line Statement 2 is just as valid as a four-line Statement 2, and either can resolve the question. If Statement 2 alone removes the ambiguity, the answer is (B). Period.

Combining the two statements: when it helps and when it does not

The combination step is only relevant when neither statement alone is sufficient. If you have reached this branch of the decision tree, the question is whether the two statements, taken together, force a unique answer. The mechanics are the same as on the single-statement branch: look for a single value the combined constraints force, then check whether any other pair of values also satisfies both. Two specific patterns deserve attention because they recur in scoring material.

The first is the linear-system pattern. When Statement 1 gives you one linear equation in two unknowns and Statement 2 gives you a second linear equation in the same two unknowns, the combination is sufficient because the intersection is a single point. This is the cleanest combination on the exam, and it is the pattern most candidates recognise. The version that catches people is the disguised linear system, where the "equations" are inequalities, ratios, or products. A ratio of 2:1 is a single linear constraint, just like an equation. An inequality such as "x > y" is a half-plane. Two such constraints, properly chosen, pin down a region. The question is whether the stem asks for a value or for a polarity. A stem asking "is x positive?" can be settled by a region even when a stem asking "what is x?" cannot.

The second is the closure pattern. Statement 1 restricts the unknown to a finite set, and Statement 2 picks one element of that set. For example, Statement 1 says "x is one of {3, 5, 7}", and Statement 2 says "x is odd". Together, the statements narrow x to {3, 5, 7}, but they do not collapse to a single value. Add a third fact, "x is a prime" (which the first statement already implies), and the set is still {3, 5, 7}. The combination is not sufficient because two valid scenarios remain. This is the kind of combination step that often fools candidates into choosing (C) when the answer is (E).

The third is the redundancy pattern. Sometimes the two statements are logically equivalent, or one implies the other. If Statement 1 alone is sufficient, and Statement 2 is implied by Statement 1, the answer is still (A), not (D). (D) requires that each statement alone settles the question through independent routes, not that the statements are saying the same thing in two languages. The way to test for redundancy is to ask: if I removed Statement 2 entirely, would Statement 1 still be sufficient? If yes, Statement 2 cannot upgrade the answer beyond (A). If no, Statement 2 might be a contributor, and you should retest whether Statement 2 alone is sufficient.

Time budgeting, pacing, and triage on the 62-minute window

The GMAT Focus Quant section gives you 31 questions in 62 minutes, which works out to two minutes per item on average. Data Sufficiency items vary widely in real cost: a clean ratio or value problem can be disposed of in 60 seconds, while a disguised closure or symmetric-counterexample item can run to three and a half minutes without giving up its secret. The pacing mistake I see most often is letting a single hard Data Sufficiency item eat the budget for the two items that follow it. Because the section is adaptive and unscored items do not appear, every item you skip is a real opportunity cost.

A workable rule of thumb is to budget 90 seconds of active thinking on any Data Sufficiency item, with a hard ceiling of 2 minutes 30 seconds. The 90-second mark is where you should be making an honest decision: do I have a verdict on at least one of the statements, and can I sketch a path to a verdict on the other within another 30 seconds? If the answer is no, mark the item, pick your best guess from the structural cues (see below), and move on. The cost of one extra minute on a stuck item is rarely repaid by the extra insight, because stuck items tend to be stuck for structural reasons rather than arithmetic ones.

When you do need to guess, the answer-key distribution offers a small but real edge. Across the published scoring data, the five Data Sufficiency choices do not appear with equal frequency. Choice (C) tends to be over-represented in many item sets, choice (A) under-represented. That said, the distribution is item-set dependent and not a substitute for reasoning. The way I would use it in practice: if you have a genuinely 50/50 decision between two choices, prefer the one that the item-set tends to favour, but never let the distribution override a clear logical verdict.

Finally, treat the section as a sequence of two-statement trials, not a sequence of problems. Your goal on each item is to record a verdict on Statement 1, a verdict on Statement 2, and a verdict on the combination, in that order, before you even look at the answer choices. If you can produce those three verdicts, the choice is mechanical. If you cannot, the choice is a guess. The discipline of producing those three verdicts is what keeps the time budget honest and prevents the silent drift where you spend four minutes on an item without realising it.

Common pitfalls and how to avoid them

Data Sufficiency is a question family where the same handful of errors account for the majority of lost points. Working through them explicitly is faster than waiting for them to appear in your practice and then patching them one by one.

Misreading the stem: underlining the question type ("value of", "is it true that", "could be") before reading the statements costs ten seconds and prevents the single largest error category.
Finding a working value and declaring sufficiency: sufficiency requires every value, not one value. Train the reflex to ask "is this the only case?" after every successful example.
Combining the statements before testing them alone: the answer key is built on isolated testing. Combining early is a structural mistake, not a tactical one.
Treating information as sufficiency: a statement can be true and uninformative about the stem's question. The test is whether the statement removes the ambiguity, not whether it adds detail.
Letting symmetric and family cases hide: if a constraint pins down a shape but not a scale, or preserves symmetry between two variables, the stem's value question is rarely answered.
Resisting the easy answer: a one-line Statement 2 that settles the question is a legitimate (B). The test does not calibrate statement length against answer strength.

The way to install these reflexes is to keep a small error log: after every practice set, write down the one sentence that describes the mistake you made on each missed item. Patterns will appear within two or three sessions, and the pattern tells you which reflex to drill next. A candidate who logs "answered (D) when the statements were redundant" needs a different drill than a candidate who logs "found a counterexample on Statement 1 but didn't trust it".

Worked example: a representative sufficiency item

Consider the following stem, which captures the structure of a typical middle-difficulty Data Sufficiency item. What is the value of x? Statement 1: 2x + 3y = 17. Statement 2: y = 3. The expected verdict in a working test-taker's first 30 seconds is the question type: a unique numerical value, so the test is whether the data pin down x exactly.

On Statement 1, the constraint is a single linear equation in two unknowns. The line 2x + 3y = 17 contains infinitely many points, so x can be any real number that the equation allows. Picking y = 0 gives x = 8.5; picking y = 1 gives x = 7; picking y = 3 gives x = 4. Three different x values are consistent with Statement 1 alone, so Statement 1 is not sufficient. You have not even needed to look at Statement 2 yet.

On Statement 2, the constraint fixes y = 3. With y known, 2x + 3(3) = 17 gives 2x = 8, so x = 4. The value is unique, and no other value of x is consistent with y = 3. Statement 2 is sufficient on its own, so the answer is (B). Notice that the combination step is never reached: Statement 2 settled the question before the combination ever became relevant. This is the cleanest outcome on the exam, and it should feel fast.

The instructive part is what happens if you skip the isolated testing and jump to combination. If you read both statements together, you still get x = 4, and you might be tempted to choose (C). But (C) requires that neither statement alone is sufficient, and we just showed that Statement 2 alone is sufficient. The verdict is (B). Combining early would have cost you the correct answer. The exercise also shows why the "information is good" fallacy bites: Statement 1 is genuinely informative, and it does contribute to the system, but it does not settle the question alone. The answer is not "more information is better"; the answer is "which statement, on its own, removes the ambiguity".

Worked example: a symmetric counterexample

Consider Is x = y? Statement 1: x + y = 10. Statement 2: xy = 25. The stem is a yes/no question, so the test of sufficiency is whether the data force the same answer every time. The first read on each statement is its logical shape, not its arithmetic.

Statement 1, alone, says x + y = 10. Pick x = 4, y = 6. The pair satisfies the constraint and the answer to the stem is "no, x is not equal to y". Pick x = 5, y = 5. The pair satisfies the constraint and the answer is "yes, x equals y". Two scenarios, two different answers, so Statement 1 is not sufficient. The symmetric counterexample is the workhorse here: Statement 1 preserves the symmetry x ↔ y, so the equality question is wide open.

Statement 2, alone, says xy = 25. The pair x = 5, y = 5 satisfies it and gives "yes". The pair x = 1, y = 25 satisfies it and gives "no". The same symmetric counterexample kills Statement 2's sufficiency. Now the work moves to the combination: the two constraints together give x + y = 10 and xy = 25, which is the classic sum-and-product system. By Vieta's reasoning, x and y are the two roots of t² − 10t + 25 = 0, which is (t − 5)², so the only root is t = 5. Both x and y must equal 5, the answer to the stem is forced to "yes" in every case, and the combination is sufficient. The answer is (C).

The pedagogical point of this pair is that sufficiency in a yes/no question behaves the same way as sufficiency in a value question: you need a forced verdict. A statement that is consistent with both "yes" and "no" outcomes is not sufficient, no matter how informative it looks in isolation. The combination works here because it kills the symmetric counterexample, not because it adds arithmetic complexity.

Comparative table: how sufficiency behaves across question types

The following table summarises the sufficiency criterion for the five common stem types on Data Sufficiency, and it is the kind of reference most candidates print and keep next to their practice set. Treat it as a checklist: identify the row that matches your stem, then read the sufficiency criterion in the right-hand column before you touch the statements.

Stem type	What sufficiency requires	Common counterexample
Unique numerical value	The data force exactly one number	Two consistent examples with different values
Yes or no decision	The data force the same verdict every time	Two consistent examples with opposite verdicts
Could-be possibility	At least one consistent example exists	No consistent example can be constructed
Relationship (e.g., is x > y?)	The polarity is forced	A consistent example that flips the polarity
Count or integer set	The set is forced to one size or one membership	A consistent example that changes the set

Notice how the same logical move, finding a counterexample, applies to every row. The content of the counterexample changes, but the operation does not. That uniformity is what makes Data Sufficiency coachable: once you have a single counterexample habit, you can apply it across the entire question family without re-learning the rules for each stem type.

Integrating Data Sufficiency into a broader preparation plan

Data Sufficiency is roughly one-fifth of the GMAT Focus Quant section, so a sensible preparation budget allocates about 20% of Quant study time to the family. That budget splits, in my experience, into three phases. The first phase is a diagnostic phase: take 20 untimed Data Sufficiency items, score them, and categorise the misses by the error log described above. Most candidates discover that two or three of the six pitfalls account for 80% of their losses. The second phase is a focused phase: pick the two highest-frequency pitfalls, drill 30 items per pitfall, and re-score at the end of each block. The third phase is a mixed phase: blend Data Sufficiency with Problem Solving under timed conditions, so the pacing habits and the sufficiency habits get installed together rather than in isolation.

A common question at this point is how Data Sufficiency preparation should interact with the rest of the GMAT Focus Data Insights section. The short answer is that the logical habits transfer, but the question families do not. Data Sufficiency's signature feature, the two-statement structure, does not appear in Graphics Interpretation, Table Analysis, Multi-Source Reasoning, or Two-Part Analysis. The shared skill is the discipline of separating sufficiency from information. The way I would sequence this in a 10-week study plan is to spend weeks 3 and 4 on Data Sufficiency specifically, then weeks 5 through 8 on the Data Insights families, and reserve weeks 9 and 10 for mixed sets that include all of the above. That sequence lets the sufficiency habit settle before the broader logical habits of the Data Insights section take over.

Finally, treat Data Sufficiency as a question family you can master, not a question family you can hope to survive. The mechanical nature of the answer key means that the gains from a small set of repeatable habits are unusually large compared with the gains from a small set of arithmetic tricks. A candidate who has internalised the decision tree, the counterexample habit, and the isolated-testing discipline will outscore a candidate of equal raw ability who treats Data Sufficiency as 31 separate problems to be solved one at a time. The structure of the format is the strategy.

TestPrep Europe's diagnostic assessment is a natural starting point for candidates who want a sharper view of their Data Sufficiency baseline before they commit to the preparation plan above.

Frequently asked questions

How long should I spend on each GMAT Focus Data Sufficiency item?

Plan for 90 seconds of active thinking per item with a hard ceiling of 2 minutes 30 seconds. Items that exceed that budget are usually stuck for structural reasons, and the section's 62-minute window makes it more efficient to mark them, record a best-guess, and return only if time allows at the end.

What is the most common Data Sufficiency mistake on the GMAT Focus?

The most common mistake is finding one example that fits a statement and concluding the statement is sufficient. Sufficiency requires that every value consistent with the statement yields the same answer. A single example is necessary but not sufficient for sufficiency, which is why counterexample testing is the core habit.

How does the answer key for Data Sufficiency work on the GMAT Focus?

There are exactly five choices, and they are logical rather than ordinal. (A) means Statement 1 alone is sufficient, (B) means Statement 2 alone is sufficient, (C) means both together are sufficient but neither alone is, (D) means each alone is sufficient, and (E) means the data are insufficient even combined. The decision tree that routes between them is fixed and trainable.

Do the two statements ever need to be combined before testing?

No. The architecture of the answer key assumes that each statement is tested in isolation first. Combining before you finish isolated testing obscures the distinction between (A), (B), (C), and (D), and it usually costs more time than it saves. The combination step only becomes relevant after both single-statement tests have returned 'not sufficient'.

How does Data Sufficiency fit into overall GMAT Focus preparation?

Data Sufficiency accounts for roughly one-fifth of the Quant section, so a reasonable budget is about 20% of Quant study time. A workable sequence is a diagnostic phase to identify dominant error patterns, a focused phase of 30-item blocks on the two highest-frequency pitfalls, and a mixed phase that blends Data Sufficiency with Problem Solving under timed conditions.

How to read a GMAT Focus Data Sufficiency stem in under 30 seconds