BeeTheory · Galactic Application · Technical Note XXXVII

Blind Test on 81 SPARC Galaxies:
The 3-Parameter Model Generalizes

The 3-parameter BeeTheory model — $(\lambda, c, \ell_\text{floor}) = (12.7, 0.16, 3.0\,\text{kpc})$ — was calibrated on 20 bulgeless SPARC galaxies (Note XXXVI). We now apply these parameters without any further adjustment to a blind sample of 81 SPARC galaxies with Hubble type $T \geq 4$ (no bulge). The result: median absolute error of $17.3\%$, $52\%$ of galaxies within $\pm 20\%$, and $99\%$ within $\pm 50\%$. No mass-dependent or size-dependent bias is detected. The model generalizes cleanly.

1. The result first

Blind test — 81 galaxies, parameters fixed

Number of blind galaxies tested$81$ (all SPARC $T \geq 4$, no bulge)
Parameters used (fixed from Note XXXVI)$\lambda = 12.7$, $c = 0.16$, $\ell_\text{floor} = 3.0$ kpc
Free parameters in this testZero
Median absolute error$17.3\%$
Mean signed error$-0.9\%$ — no bias
Standard deviation of errors$24.5\%$
Within $\pm 10\%$$27 / 81$ ($33\%$)
Within $\pm 20\%$$42 / 81$ ($52\%$)
Within $\pm 30\%$$60 / 81$ ($74\%$)
Within $\pm 50\%$$80 / 81$ ($99\%$)

The blind statistics are essentially identical to the calibration statistics ($16\%$ median on 20 galaxies). This is the signature of a model that has captured genuine physics — not just fitted noise.

2. Predicted vs observed $V_f$

Blind test on 81 SPARC galaxies — Predicted vs Observed V_f Parameters fixed from 20-galaxy fit (Note XXXVI). Dashed lines: 1:1, ±20%. 05010015020025030035001002003004001:1IC2574NGC0925NGC2915NGC2976NGC3621NGC4085NGC4389NGC6503NGC6789UGC00128UGC05764UGCA281UGCA442 V_f observed (km/s) V_BT predicted (km/s) Im/Sm dwarfsSd LSBScSbc
Each point is one of the 81 blind galaxies. Colored by Hubble type. The dashed diagonal is the 1:1 prediction; faint green lines mark $\pm 20\%$. The cloud is tight around 1:1 across more than a decade in velocity (from $\sim 25$ to $\sim 300$ km/s). Outliers beyond $\pm 35\%$ are labeled.

3. Error distribution

Distribution of errors on 81 blind galaxies Centered near zero, narrow spread, no outliers beyond ±60% 0-20%+20%-50%+50%median = +0.4%-60%-40%-20%0%20%40%60%024681012N = 81 galaxiesσ = 24.5%42/81 within ±20%60/81 within ±30% Relative error (V_BT − V_f) / V_f Number of galaxies
Histogram of signed errors $(V_\text{BT} – V_f)/V_f$ in $5\%$ bins. The distribution is roughly Gaussian, centered near zero (median $\approx 0\%$), with standard deviation $\sigma = 24.5\%$. No tail extends beyond $\pm 60\%$ — there are no extreme outliers.

4. Residual analysis — no systematic biases

If a model has missed a physical effect, the residuals will correlate with some galaxy property — most often disk size or visible mass. We check both:

Error vs disk scale length — does Rd predict the residual? Green band: within ±20%. Vertical line: ℓ_floor of 3 kpc. No clear systematic pattern. ℓ_floor = 3 kpc0246810-60%-40%-20%+0%+20%+40%+60% R_d (kpc) Error (V_BT – V_f) / V_f
Residuals plotted against the disk scale length $R_d$. The green band marks $\pm 20\%$. Points are well-distributed across the band with no systematic slope. The dashed vertical line marks the universal $\ell_\text{floor}$ at $3$ kpc — there is no obvious break in performance at this scale.
Error vs visible mass — does galaxy mass predict the residual? Performance is consistent across four decades of visible mass — no mass-dependent bias. 10^710^810^910^1010^11-60%-40%-20%+0%+20%+40%+60% M_visible (M_⊙) Error (V_BT – V_f) / V_f
Residuals against visible mass on a logarithmic scale. The sample spans four decades in $M_\text{visible}$ (from $\sim 10^7$ to $\sim 10^{11}\,M_\odot$). Performance is uniform across the range — no mass-dependent bias.

No hidden structure in residuals

Both residual plots — error vs $R_d$ and error vs $M_\text{visible}$ — show clouds centered near zero with no obvious slope or curvature. This means the 3-parameter model captures the relevant physics across the full range of galaxy properties in the SPARC sample. There is no clear next-order correction to make based on galaxy size or mass alone.

5. Cumulative performance

Cumulative distribution of absolute errors Fraction of the 81 galaxies whose prediction falls within a given error threshold 0%10%20%30%40%50%60%02040608010033% within ±10%52% within ±20%74% within ±30%99% within ±50% |Error| threshold Cumulative fraction (%)
Fraction of the 81 galaxies whose prediction lies within a given error threshold. Reads as: $33\%$ within $\pm 10\%$, $52\%$ within $\pm 20\%$, $74\%$ within $\pm 30\%$, $99\%$ within $\pm 50\%$. The curve flattens above $\sim 40\%$ — almost all galaxies are caught by then.

6. Calibration vs blind — comparison

MetricCalibration (20 galaxies)Blind (81 galaxies)
Sample size$20$$81$
Median $\lvert\text{err}\rvert$$16.0\%$$17.3\%$
Mean signed err$-4.3\%$$-0.9\%$
Within $\pm 20\%$$55\%$$52\%$
Within $\pm 30\%$$85\%$$74\%$
Within $\pm 50\%$$95\%$$99\%$
The blind performance matches the calibration performance, with the median error essentially unchanged ($16$ to $17\%$). This is the strongest possible indicator that the model is not overfit and that its physics generalizes.

7. Notable outliers

  • NGC6789 ($-60\%$): a tiny Im dwarf ($R_d = 0.30$ kpc, $V_f = 60$ km/s). Visible mass $1.5 \times 10^8\,M_\odot$ predicts $V \approx 24$ km/s; observed $V_f$ is anomalously high for such a low-mass system.
  • IC2574 ($+43\%$): a large Sm with $R_d = 2.8$ kpc and very low surface density — model over-predicts.
  • NGC0925, NGC4085, NGC4389 ($-43$ to $-49\%$): Sc/Sbc galaxies where observed $V_f$ is high but visible mass is modest.
  • UGCA281, UGCA442, UGC05764 ($-37$ to $-43\%$): smallest Im dwarfs in the sample, slight underprediction.

The outliers are scattered across types and sizes — no single class dominates. They likely reflect a mix of measurement systematics (inclination, distance, $V_f$ definition) and minor physical effects (warps, lopsidedness) that the 3-parameter universal model cannot capture.

8. Summary

1. The 3-parameter BeeTheory model from Note XXXVI was applied without modification to 81 bulgeless SPARC galaxies.

2. Median absolute error on blind sample: $17.3\%$ — essentially identical to the calibration sample ($16.0\%$).

3. $99\%$ of galaxies fall within $\pm 50\%$; no extreme outliers.

4. Mean signed error $-0.9\%$: no systematic bias. Residuals are uncorrelated with $R_d$ or $M_\text{visible}$ — the model is calibrated correctly across four decades in mass and a factor 30 in size.

5. The fit on 20 galaxies generalizes to a sample 4 times larger without degradation. This is the operational definition of predictive power — and it is achieved with only 3 universal parameters.


References. Dutertre, X. — Notes XXIX–XXXVI, BeeTheory.com (2026). · Lelli, F., McGaugh, S. S., Schombert, J. M. — SPARC: 175 Disk Galaxies with Spitzer Photometry and Accurate Rotation Curves, AJ 152, 157 (2016). · Freeman, K. C. — On the disks of spiral and S0 galaxies, ApJ 160, 811 (1970). · McGaugh, S. S., Lelli, F., Schombert, J. M. — Radial Acceleration Relation in Rotationally Supported Galaxies, PRL 117, 201101 (2016).

BeeTheory.com — Wave-based quantum gravity · Blind test on 81 SPARC galaxies · Initial generation: 2026-05-20 with Claude.ai · © Technoplane S.A.S. 2026