Educational Stratification in PISA – A Browser-Based Statistical Tool for Exploratory Analysis of PISA Microdata, 2000–2022

Author: Kevin Schoenholzer

Correspondence: kevinschoenholzer.com

Date: 2026

Abstract. This article documents the tool (Educational Stratification in PISA), a browser-based tool for exploratory secondary analysis of OECD PISA student microdata across eight cycles (2000–2022). the tool is designed to study the intergenerational transmission of educational achievement—specifically, how parental characteristics (education, occupational status, and household wealth, captured through the ESCS index) relate to children's academic performance at age 15 in mathematics, reading, and science across more than 100 countries worldwide. The application serves as a practical "analysis workbench" for recurring first-pass tasks in large-scale assessment work: computing survey-weighted descriptive statistics, summarizing achievement distributions, estimating ESCS gradients and quartile gaps that quantify how strongly family background predicts student outcomes, comparing pooled and country-panel regression specifications, and decomposing variance into within- and between-country components. To remain usable as a static web application, the tool relies on pre-generated country–year subsets that are loaded on demand, and performs statistical computations client-side. This paper describes the data coverage, implementation architecture, statistical functionality, export formats, and typical use cases (exploration, teaching, and preparatory specification screening prior to code-based replication). The documentation also clarifies current limitations for inferential use, notably the use of a single plausible value per domain and the absence of replicate-weight variance estimation in the interactive layer.

Keywords: intergenerational transmission, educational achievement, ESCS, parental education, parental occupation, family wealth, PISA, survey weighting, regression, variance decomposition, web-based analytics, reproducibility

1. Introduction

OECD PISA is widely used in secondary analyses because it combines cross-nationally comparable achievement measures with extensive student background information, repeated in a regular cycle since 2000 (OECD, 2009, 2024). At the same time, routine empirical tasks with PISA microdata remain implementation-heavy for many users: the data arise from complex sampling; achievement is released as plausible values rather than single test scores; and results can be sensitive to choices such as which weights are used, which cycles and countries are included, and which regression specification is treated as the descriptive model of interest (Mislevy et al., 1992; Wu, 2005; OECD, 2009).

the tool was developed as a technical and educational tool to make these common analytic operations easier to run, easier to iterate, and easier to export in a structured form, without requiring local installation or bespoke scripts for each exploratory question. The central research question the tool supports is: how do parental characteristics—specifically education, occupational status, and household resources—predict children's educational achievement at age 15? This is the core question of intergenerational transmission of educational achievement, and the tool operationalizes it through the PISA ESCS index and related family background measures.

1.1 Contribution and Audience

The application is intended for researchers, instructors, and students who need rigorous comparative analysis without bespoke statistical code. Its contribution is both methodological and infrastructural:

the tool interface screenshot
Figure 1. The the tool (Educational Stratification in PISA) interface. The application provides modular analysis tabs and export tools for publication-quality outputs examining how family background relates to student achievement.

2. Data Coverage, Provenance, and Processing

the tool currently covers eight PISA cycles (2000, 2003, 2006, 2009, 2012, 2015, 2018, 2022), encompassing over 100 participating countries and economies and approximately 3.5 million student observations. Data access and harmonization are supported through the learningtower R package, which provides cleaned and harmonized PISA extracts with standardized variable naming across cycles; the CRAN release for learningtower (version 1.1.0) documents coverage for 2000–2022 and provides the recommended package citation (Wang et al., 2024).

2.1 Static Data Architecture

A central implementation choice is a static, browser-friendly data layout. Instead of loading a full cross-cycle microdata file into memory, the tool stores data as country–year JSON "chunks" and loads only the requested subsets when a user selects countries and years. This reduces memory load (typically 30–40 MB for a working selection versus 1.25 GB for the full dataset) and supports interactive use in standard browsers. The chunking strategy and sample-construction rules (including current listwise deletion for required fields) are documented in the project's data and methodology pages.

2.2 Key Variables: ESCS and the Measurement of Family Background

The primary outcome variables are PISA achievement scores in mathematics, reading, and science—measuring what 15-year-olds know and can do in these domains. The primary stratification variable is ESCS (Economic, Social, and Cultural Status), which operationalizes family background through three components:

ESCS is standardized to an OECD-referenced scale (mean 0, SD 1 for OECD countries), making it interpretable as a composite measure of family socioeconomic advantage. Recent OECD technical documentation summarizes the construction and intended use for cross-country and trend analyses (Wuyts, 2024). When users examine ESCS gradients or quartile gaps in the tool, they are directly measuring the strength of intergenerational transmission—how strongly these parental characteristics predict their children's academic performance at age 15.

3. Outcomes, Weights, and the Interpretation of Achievement

3.1 Survey Weighting

Because PISA is a complex sample, the tool's descriptive and regression outputs are computed using sampling weights. The tool defaults to the final student weight (W_FSTUWT) and optionally supports senate weights (W_FSENWT) for alternative estimands; replicate weights are described in the documentation but are not currently used in the interactive variance estimation (OECD, 2009).

Weighted mean: μ = Σ(w_i × y_i) / Σ(w_i) Weighted variance: σ² = Σ(w_i × (y_i - μ)²) / Σ(w_i)

3.2 Plausible Values

Achievement in PISA is released as plausible values, which are designed to support valid population inference under matrix sampling and latent proficiency estimation. Standard practice treats plausible values analogously to multiple imputations: analysts repeat computations across plausible values and combine estimates and variances accordingly (Mislevy et al., 1992; Rubin, 1987; Wu, 2005). For interactivity, the tool currently uses the first plausible value per domain in the web interface and flags this explicitly; users aiming for publication-grade inference are expected to replicate selected specifications using all plausible values and appropriate variance procedures.

4. Statistical Functionality

the tool implements a set of recurring "building blocks" that users typically assemble when producing descriptive briefs, teaching demonstrations, or preliminary robustness checks focused on intergenerational transmission of educational achievement.

4.1 Descriptive Statistics

The tool computes survey-weighted descriptive statistics, including weighted means, variances, and weighted quantiles. These quantities are used both for direct reporting and as inputs to distributional summaries.

4.2 Distributional Summaries

the tool summarizes achievement distributions using indices such as the Gini coefficient (with Lorenz curve display), the coefficient of variation, and percentile ratios. These measures characterize the spread and shape of achievement distributions within and across countries.

4.3 ESCS Gradients: Measuring Intergenerational Transmission

The ESCS gradient is the central measure for quantifying intergenerational transmission. It is computed as the slope from a survey-weighted regression of achievement on ESCS (optionally including simple controls such as gender and parental education). This coefficient represents how many score points of achievement are associated with a one-unit increase in family socioeconomic status—directly measuring how strongly parental characteristics predict children's outcomes.

Achievement_i = β₀ + β₁ × ESCS_i + ε_i β₁ = ESCS gradient (intergenerational transmission coefficient)

A larger ESCS gradient indicates stronger intergenerational transmission—children's achievement is more strongly predicted by their parents' education, occupation, and wealth. Cross-country comparisons of ESCS gradients reveal which education systems show more or less mobility in educational outcomes.

4.4 Quartile Gaps: Family Background Differences in Achievement

the tool computes quartile-based ESCS gaps by constructing ESCS quartiles using weighted ranks and reporting Q4–Q1 differences (the gap between children from the most and least advantaged family backgrounds) along with standardized effect sizes (Cohen's d). This provides an intuitive measure of how much family background matters: the score-point difference between students from high-ESCS versus low-ESCS families.

4.5 Regression Models

the tool provides a small family of regression specifications intended for quick comparison rather than definitive modeling:

Pooled OLS: y_i = β₀ + β₁x_i + γ'c_i + ε_i Fixed Effects: y_it = α_i + β₁x_it + γ'c_it + ε_it Random Effects: y_it = β₀ + β₁x_it + γ'c_it + u_i + ε_it

The interface reports coefficient tables and conventional fit metrics (including AIC/BIC) and offers standard diagnostic plots (residuals, Q-Q plots). Model diagnostics include Hausman tests when both FE and RE models are available.

4.6 Variance Decomposition

the tool includes a variance decomposition module that partitions variance into within- and between-country components and reports an ICC-style summary of clustering at the country level. This reveals how much variation in the relationship between family background and achievement occurs within versus between countries.

5. Implementation Architecture, Exports, and Reproducibility

the tool is implemented as a modular ES6 JavaScript application with an R-based preprocessing pipeline that generates country–year data chunks and a metadata catalog used to drive the interface and loading logic. All computations in the interactive application are performed client-side after data are loaded, which keeps deployment simple (static hosting) and makes the computational steps inspectable in the project materials.

Component Purpose Reference
Data generation scripts Extract and chunk country-year microdata Pipeline scripts
Methodology appendix Formal statistical definitions and formulas Methodology document
Data sources Variable provenance and coverage Data sources document

5.1 Exportability

A practical design priority is exportability. the tool is intended to support workflows where users explore specifications interactively and then export outputs for inclusion in briefs, appendices, slides, or follow-up coding. The tool exports:

This is not a substitute for a full scripted pipeline, but it aligns with standard expectations for computational reproducibility in exploratory work, where the key requirement is that the analytic choices are recorded and can be re-executed in a more formal environment (Peng, 2011).

Reproducibility note: All exports encode the selected countries, years, outcome variables, predictor variables, control choices, and weighting schemes at the time of analysis, so that results can be traced back to specific specifications.

6. Typical Use Cases

the tool is most useful when the goal is to move quickly from a question about intergenerational transmission to a defensible descriptive answer, while keeping specification choices visible.

6.1 Cross-Country Profiling

Select a year and a set of countries to compare weighted distributional summaries and ESCS gradients, then export a compact table and figures for briefing or teaching. This reveals which countries show stronger or weaker intergenerational transmission of educational achievement.

6.2 Sensitivity Screening

Compare pooled and country-panel specifications, toggle weight conventions (where appropriate to the estimand), and assess whether headline patterns about family background effects are stable to common alternatives before deciding which models deserve full PV-complete, replicate-weight inference in a code-based workflow.

6.3 Trend Exploration

Inspect how ESCS gaps and gradients evolve across cycles for selected countries, treating the outputs as descriptive signals about whether intergenerational transmission is strengthening or weakening over time.

6.4 Classroom Instruction

the tool can be used to demonstrate why weights matter, what plausible values are, and how different regression specifications change descriptive associations between family background and achievement, without requiring students to install specialized software or manage large microdata files (OECD, 2009; Wu, 2005).

7. Limitations and Appropriate Interpretation

the tool's outputs should be interpreted as descriptive associations produced by a transparent, interactive implementation, not as definitive inferential results.

8. Conclusion

the tool is a browser-based statistical tool that packages common first-pass operations used in large-scale assessment work—weighted descriptives, distributional summaries, ESCS gradients and quartile gaps measuring intergenerational transmission, specification comparisons, and variance decomposition—into a static, export-oriented interface for PISA cycles 2000–2022. Its value is primarily practical: it shortens the path from question to documented descriptive output about how parental education, occupation, and wealth relate to children's educational achievement at age 15 worldwide. It supports teaching and methods demonstrations, and helps users define and screen specifications before re-estimating selected models with full plausible-value combination and replicate-weight inference in a dedicated statistical environment.

Software Availability and Documentation

References

Mislevy, R. J., Beaton, A. E., Kaplan, B., & Sheehan, K. M. (1992). Estimating population characteristics from sparse matrix samples of item responses. Journal of Educational Measurement, 29(2), 133–161. https://doi.org/10.1111/j.1745-3984.1992.tb00371.x

OECD. (2009). PISA data analysis manual: SPSS (2nd ed.). OECD Publishing. https://doi.org/10.1787/9789264056275-en

OECD. (2024). PISA 2022 Technical Report. OECD Publishing. https://doi.org/10.1787/01820d6d-en

Peng, R. D. (2011). Reproducible research in computational science. Science, 334(6060), 1226–1227. https://doi.org/10.1126/science.1213847

Rubin, D. B. (1987). Multiple imputation for nonresponse in surveys. John Wiley & Sons. https://doi.org/10.1002/9780470316696

Wang, K., Yacobellis, P., Siregar, E., Romanes, S., Fitter, K., Dalla Riva, G. V., Cook, D., Tierney, N., Dingorkar, P., Sai Subramanian, S., & Chen, G. (2024). learningtower: OECD PISA datasets from 2000–2022 in an easy-to-use format (R package, Version 1.1.0). https://doi.org/10.32614/CRAN.package.learningtower

Wu, M. (2005). The role of plausible values in large-scale surveys. Studies in Educational Evaluation, 31(2–3), 114–128. https://doi.org/10.1016/j.stueduc.2005.05.005

Wuyts, C. (2024). The measurement of socio-economic status in PISA. OECD.

Back to Application