The Law School Admission Test (LSAT) uses a 120-180 scaled scoring range, yet this single number conceals a layered statistical process that every candidate should understand. For those in the early or middle stages of LSAT preparation, a frequent and critical question arises: how reliably does a practice test score predict the result on test day? The answer involves understanding the mechanics of raw-to-scaled score conversion, the concept of score bands, the reliability of repeated administrations, and the role that practice test data plays in setting realistic targets. This article examines each of these dimensions to equip candidates with a more sophisticated framework for score prediction and strategic goal-setting.
The LSAT 120-180 scale: structure and meaning
The LSAT scaled score range spans from 120 to 180, with 120 representing the lowest possible score and 180 the highest. Unlike examinations where scores are expressed as percentages of questions answered correctly, the LSAT uses a equipercentile conversion process to translate raw performance into this fixed scale. This means that a raw score of approximately 67 out of approximately 100 questions answered correctly typically maps to a scaled score near 170, though the precise conversion varies slightly between test administrations to account for differences in difficulty.
Because the conversion is equipercentile rather than linear, equal intervals on the scale do not represent equal differences in raw performance. Moving from 165 to 170 on the scaled score requires a larger raw score improvement than moving from 155 to 160, because the distribution of scores becomes denser at the upper end of the range. Candidates who understand this non-linearity are better placed to interpret their practice test results and to set appropriately granular goals for improvement.
The percentile rank associated with each scaled score provides the context needed to understand where a given performance sits relative to all other LSAT test-takers over a recent period. A scaled score of 170, for instance, typically places a candidate well above the 97th percentile, whereas a score of 160 generally falls near the 80th percentile. These percentile benchmarks matter because law schools use them, alongside undergraduate grade averages, to compare candidates across administrations and demographic groups.
How raw scores convert to the LSAT scaled metric
The LSAT consists of four scored sections, each containing approximately 22-24 questions, for a total of approximately 99 scored questions across Logical Reasoning (two sections), Analytical Reasoning, and Reading Comprehension, plus an experimental section that is not identified to candidates. The experimental section is included to pre-test future questions and is not scored, but candidates cannot distinguish it from the scored sections during the test.
A candidate's raw score is simply the number of questions answered correctly across the scored sections. There is no penalty for incorrect answers, so every question attempted carries a positive expected value. The raw score is then converted to the 120-180 scale through a process that adjusts for the relative difficulty of the specific test form administered.
The conversion table for each LSAT administration is developed by the Law School Admission Council (LSAC) through extensive statistical analysis. The process involves equating, which ensures that a given scaled score represents the same level of ability regardless of which test form was taken. This means that a raw score of 85 might yield a scaled score of 172 on one test form and 173 on another, depending on the difficulty calibration of the questions included in each form.
Because equating is applied to the raw-to-scaled conversion, candidates can meaningfully compare scores across different test administrations. This is also why the LSAT is considered a standardised assessment in the strongest sense: the scale controls for variation in difficulty, allowing law schools and candidates to interpret scores within a stable frame of reference.
Key components of the raw-to-scaled conversion
- Number of correctly answered questions across all scored sections determines the raw score.
- Equating adjusts the conversion table for the difficulty of the specific test form used.
- The 120-180 scale is fixed and designed to be comparable across all LSAT administrations.
- No deduction applies for incorrect answers, making educated guessing strategically optimal on every question.
Practice test scores as predictors: what the data suggests
For candidates who complete multiple timed practice tests under realistic conditions, the resulting scores form a distribution that typically narrows over the course of preparation. Early practice tests often show wider variance, with scores fluctuating by several points between administrations. As candidates develop greater familiarity with question formats, timing strategies, and logical frameworks, both the mean and the consistency of their practice test scores tend to improve.
The most reliable predictor of test day performance among practice test metrics is the average of the most recent five to seven timed, full-length practice tests. This rolling average tends to predict the eventual scaled score within approximately three to four points for candidates who have completed at least ten full-length practice tests. The reason for this window is that very recent scores carry more diagnostic weight than distant ones, reflecting current ability most accurately, while the averaging smooths out the normal fluctuation between individual administrations.
The distribution of practice test scores also carries diagnostic information. Candidates whose practice scores cluster tightly around a central value — for example, a consistent range of 168 to 171 across multiple recent tests — typically demonstrate more reliable test-day performance than those whose scores swing widely, even if the averages are similar. Wide variance in practice tests can indicate inconsistent command of timing, variable engagement with different question types, or psychological factors such as test anxiety that may need to be addressed before the actual test.
It is worth noting that official LSATPrepTests published by LSAC are generally considered the most accurate predictors of test day scores because they use previously administered, equating-calibrated forms. Third-party practice tests vary in quality, and those that do not incorporate proper equating may produce misleading raw-to-scaled conversion estimates. Candidates are advised to rely primarily on official LSAC materials for predictive practice testing in the final weeks of preparation.
Factors that affect the reliability of practice test score predictions
- Testing environment fidelity: scores taken under timed, uninterrupted conditions with no reference materials more closely predict test day performance.
- Volume of practice tests completed: a larger sample of recent tests produces a more reliable average prediction.
- Quality of practice materials: official LSAC tests are equating-calibrated and thus more predictive than non-official sources.
- Stability of recent scores: a narrow range across the last five to seven tests indicates lower day-to-day variance and higher predictive reliability.
- Section-specific performance: uneven performance across Logical Reasoning, Analytical Reasoning, and Reading Comprehension can create unpredictability that aggregate scores mask.
Score bands and the standard error of measurement
The LSAT reports scores not as single points but within score bands that reflect the statistical uncertainty inherent in any measurement. The standard error of measurement (SEM) for the LSAT is approximately 2.5 to 3.0 scaled score points, meaning that a candidate's true underlying ability is estimated to lie within roughly plus or minus 2.5 to 3 points of the reported score approximately 68% of the time, and within a wider band approximately 95% of the time.
Score bands arise because no single test administration can measure ability with perfect precision. Variations in question selection, the candidate's focus and energy on a given day, and the specific logical configurations encountered all introduce measurement variability. The LSAT score band communicates this uncertainty explicitly, indicating that a reported score of 170 should be interpreted as representing a true ability range, not a precise fixed point.
For candidates and admissions committees, score bands have practical consequences. A candidate whose practice test scores and test day score both fall within the same general band — say, consistently between 168 and 172 — can be interpreted as demonstrating stable ability at that level. Conversely, a candidate whose score jumps significantly from one administration to another may have experienced a genuine change in ability, a favourable/unfavourable question selection, or statistical fluctuation that the score band framework helps to contextualise.