Understanding Academic Performance &
Growth Patterns in VPK
This page is optimized for desktop viewing.
Study Overview
The ECPRG used the Early Childhood Integrated Data System linked dataset to investigate learning outcomes among students enrolled in VPK across Florida. Specifically, the ECPRG used machine learning and artificial intelligence to detect and describe kindergarten readiness (KR) growth patterns characterized by individual-, household-, and classroom-level features. Children have different experiences—both negative and positive—as members of their respective families and peer groups.¹,² Within education and numerous other fields, Bronfenbrenner’s bioecological systems framework attributes differential outcomes to different combinations of exposures, such that each exposure is considered in context.¹,² Children who attended VPK exhibited differential learning trajectories depending on family, peer and classroom context, with varying effects on development and preparation for kindergarten.
In this study, we investigated the effects of children’s home- and school-based learning environments on their academic growth, operationalized by (1) children’s initial scores on the Florida Assessment of Student Thinking (FAST) upon entering VPK and (2) their growth (difference between initial and end-of-year FAST assessment). Our machine learning and artificial intelligence analyses identified contexts under which children are likely to be ready for kindergarten.
1. Bronfenbrenner, U., & Morris, P. A. (2006). The bioecological model of human development. In W. Damon & R. M. Lerner (Eds.), Handbook of Child Psychology, Vol. 1: Theoretical models of human development (6th ed., pp. 793 – 828). New York: Wiley.
2. Elder Jr, G. H. (1998). The life course as developmental theory. Child Development, 69(1), 1-12.
Key Domains
Starting with a broad collection of child and family characteristics, and scores on the Classroom Assessment Scoring System (CLASS), our machine learning analyses identified five domains of predictors related to kindergarten readiness. Each of these five domains include directly observed characteristics of children, families, and classrooms. These individual characteristics, which constitute the broader domains, were included in the analyses that follow.

Mother’s Highest Education

Mother’s Birth Country

Health

Social Services

Attendance
Key Findings
school
Lower initial scores often associated with accelerated growth rates, which occur in specific contexts
supervisor_account
Combinations of family and sociodemographic factors contribute to initial scores
pattern
Complex interaction patterns identified between child, family, and classroom characteristics
monitoring
CLASS dimensions and composite had negligible associations with VPK growth, and only within narrowly-defined contexts
Figure 1. Machine Learning Framework
Terminal Node Random Forests
Ctree for Subgroup Identification
Random Forest for Initial Predictor Performance
Step 1: Identify globally important predictors across the entire population
- Variable importance scores represent population-level effects
- The threshold for variable selection becomes particularly meaningful as it determines the complexity of subsequent stages
- Selected variables represent robust, generalizable relationships since they emerge in the full population
- No need for external validation of variable importance since this represents the true population structure
Step 2: Identify subpopulations from the population based on the important predictors
- Splits identified using conditional inference trees (ctree) represent true population-level heterogeneity rather than sample-specific patterns
- Terminal nodes define genuine subpopulations rather than sample-based groupings
- The hierarchical structure reveals how key predictors interact to create naturally occurring subgroups
- Node size becomes a direct measure of subpopulation prevalence
- The sequence of splits shows the relative strength of different contextual effects in the population
Step 3: Identify conditional importance patterns within defined subpopulations
- Results reveal true conditional relationships within subpopulations
- Including all predictors (even those below the Stage 1 threshold) allows detection of context-specific importance
- Variable importance within nodes shows genuine heterogeneity in predictor relationships
- The comparative analysis across nodes reveals how context modifies predictor importance
- These analyses reveal second-order relationships after accounting for primary population structure
Methodology
Analysis based on comprehensive data collection across multiple domains, focusing on both baseline and progressive growth measures.
3. Breiman, L. (2001). Random forests. Machine Learning, 45, 5-32.
4. Hothorn, T., Hornik, K., & Zeileis, A. (2006). Unbiased recursive partitioning: A conditional inference framework. Journal of Computational and Graphical Statistics, 15(3), 651-674.
5. Shapley, L. (1953). 17. A value for n-person games. In H. Kuhn & A. Tucker (Ed.), Contributions to the Theory of Games, Volume II (pp. 307-318). Princeton: Princeton University Press.
Key Findings for Pre-VPK Performance and Monthly Growth Units (MGUs)
Education Level Growth Comparison
Education Level | Initial FAST | Growth Rate (MGU) | Growth vs Average |
---|---|---|---|
≤8th Grade | 609.7 | 1.18 | +17% |
9th-12th Grade | 624.9 | 1.10 | +10.3% |
HS/GED | 639.0 | 1.05 | +5.2% |
Some College | 650.2 | 1.00 | -0.4% |
Associate’s Degree | 657.6 | 1.00 | -0.2% |
Bachelor’s Degree | 672.2 | 0.94 | -5.6% |
Master’s Degree | 683.1 | 0.92 | -7.9% |
Doctoral Degree | 689.9 | 0.90 | -9.7% |
Definitions
- A Monthly Growth Unit (MGU) represents the average rate of growth; values greater than 1 indicate higher-than-average growth, while values less than 1 indicate lower-than-average growth.
- For Growth vs Average, growth at least 1% higher than average is presented in green, growth at least 1% below average is in red, and growth in between these values is presented in black.
Key Findings
- Growth-Education Inverse Relationship
– Higher education → Lower growth rates– Lower education → Higher growth rates
- Initial-Education Positive Relationship
– Higher education → Higher initial scores– Lower education → Higher initial scores
- Maximum Growth Differential
– 0.38 MGU gap (≤8th Grade vs Bachelor’s)– 38% growth rate difference
- Pattern suggests strong compensatory effect
Education Level and Attendance Relationship to Initial FAST Scores
Education Level | Low Attendance | Medium Attendance | High Attendance |
---|---|---|---|
≤8th Grade | 594.2 | 610.3 | 634.0 |
9th-12th Grade | 618.6 | 626.8 | 623.8 |
HS/GED | 629.9 | 640.4 | 643.2 |
Some College | 641.0 | 650.6 | 666.3 |
Associate’s Degree | 647.1 | 658.6 | 663.7 |
Bachelor’s Degree | 657.6 | 671.9 | 692.6 |
Master’s Degree | 672.5 | 682.7 | 699.8 |
Doctoral Degree | 685.4 | 688.5 | 707.9 |
Education Level and Attendance Effects on Growth
Education Level | Average MGU | Growth vs Average | ||||
---|---|---|---|---|---|---|
Low Attendance | Medium Attendance | High Attendance | Low Attendance | Medium Attendance | High Attendance | |
≤8th Grade | 0.97 | 1.21 | 1.23 | -3.4% | +21.0% | +23.4% |
9th-12th Grade | 1.02 | 1.10 | 1.36 | +1.9% | +10.0% | +35.9% |
HS/GED | 0.95 | 1.05 | 1.26 | -5.1% | +5.3% | +26.4% |
Some College | 0.91 | 1.00 | 1.11 | -9.4% | +0.3% | +11.0% |
Associate’s Degree | 0.96 | 0.99 | 1.20 | -3.9% | -1.1% | +19.8% |
Bachelor’s Degree | 0.94 | 0.94 | 0.97 | -5.5% | -5.9% | -2.9% |
Master’s Degree | 0.83 | 0.93 | 0.94 | -16.8% | -7.3% | -5.6% |
Doctoral Degree | 0.84 | 0.89 | 1.05 | -15.6% | -10.7% | +5.2% |
Definitions
- A Monthly Growth Unit (MGU) represents the average rate of growth; values greater than 1 indicate higher-than-average growth, while values less than 1 indicate lower-than-average growth.
- Attendance Levels:
– Low: Less than 50 hours per month– Medium: Between 50 and 60 hours per month– High: Greater than 60 hours per month
- For Growth vs Average, growth at least 1% higher than average is presented in green, growth at least 1% below average is in red, and growth in between these values is presented in black.
Key Findings
- Greater attendance associated with higher growth
– Consistent across education levels– Strongest at lower education levels
- Compensatory Power
– Highest MGUs: High attendance + lower education
SNAP Status and Education Level on Initial and Growth
Education Level | SNAP Status | Initial FAST | Average MGU | Growth vs Average |
---|---|---|---|---|
≤8th Grade | No | 606.2 | 1.22 | +21.8% |
Yes | 621.2 | 1.15 | +14.8% | |
9th-12th Grade | No | 624.3 | 1.11 | +11.1% |
Yes | 625.0 | 1.10 | +10.0% | |
HS/GED | No | 644.4 | 1.05 | +5.0% |
Yes | 636.2 | 1.05 | +5.3% | |
Some College | No | 655.3 | 0.97 | -2.9% |
Yes | 645.9 | 1.02 | +1.6% | |
Associate’s Degree | No | 663.3 | 0.97 | -3.2% |
Yes | 648.9 | 1.04 | +4.4% | |
Bachelor’s | No | 674.9 | 0.93 | -7.0% |
Yes | 658.8 | 1.01 | +1.4% | |
Master’s Degree | No | 684.7 | 0.92 | -7.8% |
Yes | 667.1 | 0.91 | -9.3% | |
Doctoral Degree | No | 692.0 | 0.89 | -10.5% |
Yes | 648.8 | 1.07 | +6.6% |
Definitions
- A Monthly Growth Unit (MGU) represents the average rate of growth; values greater than 1 indicate higher-than-average growth, while values less than 1 indicate lower-than-average growth.
- SNAP = Supplemental Nutrition Assistance Program
- For Growth vs Average, growth at least 1% higher than average is presented in green, growth at least 1% below average is in red, and growth in between these values is presented in black.
Key Findings
- SNAP use is associated with consistently lower initial scores across education levels.
- There is a compensatory effect of VPK for children who received SNAP.
- Education Level
- Education Level and VPK Attendance
- Education Level and Maternal Country of Origin
- Health Indicators
- SNAP
- WIC
- Marital Status
Initial Score and Growth Rate (MGU) Comparison
608-624
1.32
32%
608-624
1.17
17%
627-653
1.09
9%
638-671
1.00
Average
657-680
0.94
-6%
680-708
0.95
-5%
Key Findings
- Growth-Education Inverse Relationship
– Higher education → Lower growth rates– Lower education → Higher growth rates
- Maximum Growth Differential
– 0.38 MGU gap (≤8th vs Bachelor’s)– 38% growth rate difference
- Pattern suggests strong compensatory effect
Initial Score and Growth Rate (MGU) Comparison
Key Findings
- Greater attendance associated with higher growth
– Consistent across education levels– Strongest at lower education levels
- Compensatory Power
– Highest MGUs: High attendance + lower education
Initial Score Comparison
Growth Rate Comparison (GMU)
Compensatory Learning
- Children whose mothers are from the US have higher initial scores across all education levels.
- Children whose mothers are not from the US have higher growth rates across all education levels.
Early Health — Prenatal Visits (Kotelchuck Index)
Key Findings
- Higher Kotelchuck Index scores show persistent developmental benefits in terms of higher initial scores and greater MGUs.
Current Health Status (BMI)
Key Findings
- Lower maternal BMI is associated with higher initial scores.
- Children whose mothers had higher BMIs show a compensatory effect from VPK.
Initial Score and Growth Rate (MGU) Comparison
Key Findings
- SNAP use is associated with consistently lower initial scores across education levels.
- There is a compensatory effect of VPK for children who received SNAP.
Initial Score and Growth Rate (MGU) Comparison
Key Findings
- WIC use is associated with consistently lower initial scores across education levels.
- There is a compensatory effect of VPK for children who received WIC.
Initial Score and Growth Rate (MGU) Comparison
Key Findings
- Children whose mothers are married demonstrate consistently higher initial scores and have similar MGUs across the observed education levels
Note: Data for marriage status was only available for education levels Associates and above in the provided dataset.