1. Scope & methodological framing
The principal metric for the performance of an active space is not whether it yields the lowest
energy, but whether it yields a consistent description of a chemical bond. The
active space must be chemically meaningful. There can be many local minima in
CASSCF optimization, so one must be sure the optimizations are properly converged.
Traditional active-space ranking often overweights a single scalar — the optimized SA-CASSCF
energy. On the photochemically interesting systems studied here, that scalar can reward
chemically poor orbital sets (deep-σ correlation, far virtuals, masks that omit the frontier
orbitals altogether). The contribution of this report is a chemistry-consistent
validation framework that filters those failure modes before any energy
ranking is performed and redirects the search toward physically meaningful sub-spaces.
The benchmark covers three frontier-driven photochemical systems — formaldehyde, ethylene
and 1,3-butadiene — with CAS(2,2) and CAS(4,4), 11 independent GA seeds and 47 SA-CASSCF
runs in total. No fitness adjustment, no hand-picked active space, and no partial scan is
used; every winner is screened against five independent conditions before being reported.
2. Executive summary
11 / 11
accepted winners are chemically meaningful and converge cleanly
Across CH₂O · C₂H₄ · C₄H₆ · 11 independent GA seeds · 47 / 47 SA-CASSCF runs converged
5 / 11 winners that pure-energy ranking would have promoted are exposed as anti-chemical and rejected pre-CASSCF
Headline methodological observation.
The SA-CASSCF orbital landscape on these systems is genuinely multi-basin.
Simple orbital rotations cannot traverse all basins; combinatorial search over candidate sub-spaces can.
On butadiene at φ = 86°, two CAS(4,4) solutions are separated by an essentially orthogonal active dimension
(89.85°) and ≈ 18 kcal/mol of energy — a gap that 30 intra-active multistarts at σmax = 3.0 do not bridge.
Quantitative evidence in §4.
- CH₂O behaves as a sensitivity benchmark for small-space search: on CAS(2,2) over 30+ candidate MOs, the chemically correct n→π* manifold is identifiable and reproducible once the chemistry condition is enforced.
- C₂H₄ contains the most flagrant anti-chemical failure of the benchmark — seed 73's σ-only winner with zero HOMO/LUMO content — and the chemistry condition redirects it to the canonical π/π* set
[6, 7, 8, 9].
- C₄H₆ beats the NOON-MP2 baseline by ≈ 16 kcal/mol on RE,raw with a chemically cleaner π-canonical solution; here the NOON-MP2 mask itself is the chemistry outlier.
Figure 1. Headline result. Fraction of GA winners that satisfy the per-CAS frontier-orbital
rule, with and without the chemistry-aware acceptance condition. Without the condition,
energy-only ranking promotes anti-chemical winners on CH₂O and C₂H₄; with the condition,
every accepted winner is chemistry-valid.
11/11
chemistry-valid winners
(post-screen)
5/11
would have been anti-chemical
(without the screen)
3
benchmark molecules
(CH₂O · C₂H₄ · C₄H₆)
47
geometries probed
across the scans
47/47
SA-CASSCF converged
on accepted candidates
99/99
internal regression tests
pass on the gate logic
Practical workflow
STEP 1
Search
Combinatorial proposal of candidate active sub-spaces over the SCF MO list
→
STEP 2
Chemical gates
Frontier-orbital overlap, mask validity, frozen-core sanity — fail = veto
→
STEP 3
Convergence check
SA-CASSCF on every geometry; non-converged ⇒ worst-case sentinel
→
STEP 4
Rank by energy
RE,raw + smoothness diagnostics, only on candidates that passed steps 2 and 3
Chemistry validity and convergence are checked first; energy ranking is the last step, not the first.
3. The five tests an active space must pass
Lowest energy alone is not enough. For these frontier-driven photochemical benchmarks, an
active space is reported only when it satisfies all five of the following conditions, in this
order. The first three are hard gates (yes/no decisions); the last two are continuous
diagnostics that accompany every accepted winner.
1Chemical meaning
The mask must contain the molecular orbitals physically involved in the studied photochemical channel — frontier overlap with {HOMO−1, HOMO, LUMO, LUMO+1} above a per-CAS threshold.
2Smooth path behaviour
The S₀ surface and the S₁−S₀ gap must vary smoothly across the reaction-coordinate scan; large adjacent-geometry jumps flag basin switching and disqualify the candidate.
3Proper convergence
Any non-converged SA-CASSCF cycle on any geometry of the scan propagates as a worst-case sentinel; partial scans cannot win the population step.
4Reproducibility across seeds
Independent GA seeds must converge to the same physical sub-space (or to a small number of chemistry-equivalent classes) before the result is reported.
5Competitive energy
Once 1–4 are satisfied, RE,raw = Σg S₀(g) is reported and compared to the NOON-MP2 baseline. Energy ranking is the last test, not the first.
Per-CAS frontier rule used by test 1
| target CAS size |
required overlap with frontier window {HOMO−1, HOMO, LUMO, LUMO+1} |
rationale |
| CAS(2,2) | ≥ 1 of 4 frontier MOs | e.g. CH₂O n→π*; the n-orbital sits outside the strict HOMO/LUMO pair by construction |
| CAS(4,4) | ≥ 2 of 4 frontier MOs | π/π* manifolds; rejects σ-only and far-virtual-only winners |
Test 1 acts as a pre-evaluation veto: candidates that fail it are not assigned an
energy ranking and cannot win the GA step. It is not a soft penalty added to a score — it is a
yes/no decision that prevents the optimizer from converging on a chemically empty solution
because that solution happens to lower the SA-energy sum through σ-correlation.
4. Multi-basin reality & the limits of intra-active multistart
Why is a chemistry-aware acceptance condition necessary in the first place? Because the
SA-CASSCF orbital-optimization landscape on these systems is genuinely multi-basin, and
standard intra-active multistart cannot move between basins. This is a methodological
observation, not an implementation detail: it is the reason a per-candidate energy ranking is
unreliable on its own, and the reason the problem deserves a combinatorial exploration over
candidate sub-spaces rather than a deeper local search inside one.
4.1 The butadiene basin-jump scan
Figure 2. SA(2)-CASSCF total energies along the C₂–C₃ torsion of 1,3-butadiene for three
candidate CAS(4,4) active spaces, evaluated under identical multistart settings
(M = 30 starts, σmax = 3.0). Three distinct basins are visible; the largest
single-geometry split is 28 mEh ≈ 17.7 kcal/mol at φ = 86°. The shaded
region marks the basin-jump zone where intra-active multistart cannot escape, even with
aggressive rotation amplitude.
4.2 The geometric reason: the subspace-overlap diagnostic
Figure 3. Singular values of the cross-overlap matrix
S = MOAT·SAO·MOB between the active orbitals of
the two competing CAS(4,4) basins at φ = 86°. Three of four active dimensions are 95–99 %
shared; the fourth is essentially orthogonal (89.85° principal angle). This is the geometric
reason intra-active unitary rotation cannot bridge these basins — it would require crossing a
direction that sits outside the 4-dimensional sub-space being rotated.
The combinatorial exploration over candidate active sub-spaces samples the basin landscape
independently of intra-active orientation: each candidate mask is a fresh sample. This is what
allows the framework to satisfy two conditions that would otherwise be in tension —
chemistry validity and proper convergence — without resorting to either a
hand-picked active space or to a single-objective energy minimization that ignores chemistry.
What this implies practically. Different chemistry-valid masks may converge to
distinct, physically meaningful basins. Simple orbital rotations cannot traverse all basins.
Combinatorial mask exploration is therefore not a luxury; it is the appropriate sampling
strategy at this scale.
5. Formaldehyde (CH₂O) — search-sensitivity benchmark
Role in the benchmark: a sensitivity benchmark for search initialization.
CH₂O has only 2 active orbitals out of 30+ candidate MOs, so the GA's combinatorial budget is
tight; this is the system that exposes whether the chemistry condition is strong enough to keep
energy-only winners from drifting into deep-σ correlation when search room is small.
System: Formaldehyde · CAS(2,2) · cc-pVDZ · SA(2) [0.5, 0.5] · 4 geometries
along R(C=O) · HOMO = 7, LUMO = 8, NOON-MP2 baseline mask = [6, 9]
Result: all 5 GA winners contain at least one frontier MO once the chemistry
condition is enforced; the three energy-only winners that previously contained none are
redirected toward the n→π* manifold.
Per-seed comparison (5 seeds)
| seed |
energy-only winner (before screen) |
frontier MOs in mask |
chemistry-aware winner (after screen) |
frontier MOs in mask |
changed? |
| 7 | [3, 17] | 0 | [6, 25] | 1 | yes |
| 17 | [3, 23] | 0 | [6, 22] | 1 | yes |
| 42 | [6, 27] | 1 | [6, 27] | 1 | — |
| 73 | [6, 27] | 1 | [6, 27] | 1 | — |
| 101 | [3, 31] | 0 | [5, 9] | 1 | yes |
Three of five energy-only winners contained MO 3 (HOMO−4 region) — a deep-σ correlation that
lowers the SA-energy sum without describing the n→π* manifold of formaldehyde. After the
chemistry-aware condition, every winner contains at least one MO in {6, 7, 8, 9}.
With the literature NOON-MP2 mask [6, 9] seeded into the initial population, all
5 seeds converge to it and confirm
RE,raw ≈ −455.36 Eh with full SA-CASSCF
convergence on every geometry.
What this measures. When a trusted baseline mask is seeded into the initial
population, all 5 seeds converge to it cleanly. Without seeding, the GA does not always reach
the n→π* manifold inside the 10-step × 50-organism budget — a normal search-budget effect on
a 2-of-30 CAS, and exactly the regime where the chemistry condition earns its keep: it
identifies the under-budget cases unambiguously and redirects them into the correct manifold.
6. Ethylene (C₂H₄) — flagship redirection
System: Ethylene torsion · CAS(4,4) · cc-pVDZ · SA(2) · HOMO = 7, LUMO = 8 · canonical π / π* manifold = [6, 7, 8, 9]
Result: the most flagrant anti-chemical winner of the entire benchmark — seed 73
selecting [2, 4, 21, 22] with zero HOMO/LUMO MOs — is rejected. The
redirected winner is the textbook π/π* set.
Per-seed comparison (3 seeds)
| seed |
energy-only winner (before screen) |
frontier MOs |
chemistry-aware winner (after screen) |
frontier MOs |
note |
| 42 | [4, 7, 8, 18] | 2 | [4, 7, 8, 18] | 2 | chemistry-clean before and after; preserved |
| 73 | [2, 4, 21, 22] | 0 | [6, 7, 8, 9] | 4 | redirected to canonical π / π* |
| 101 | [3, 7, 15, 46] | 1 | [6, 7, 8, 9] | 4 | redirected to canonical π / π* |
The two winners that the energy-only score promoted to the top — seed 73 with all MOs outside
the frontier window, and seed 101 with only one MO inside — both collapse to the canonical
π manifold [6, 7, 8, 9] = (HOMO−1, HOMO, LUMO, LUMO+1) under the
chemistry-aware acceptance. Seed 42's pre-screen winner already contains both HOMO and LUMO and
is preserved.
The number of distinct cross-seed equivalence classes for the ethylene torsion benchmark drops
from 3 to 2. The remaining multimodality is between two chemistry-valid
attractors — [6, 7, 8, 9] and [4, 7, 8, 18] — not between chemistry
and anti-chemistry. Both contain HOMO and LUMO and have RE,raw lower than the
NOON-MP2 baseline mask [5, 7, 8, 22]; this is now an honest fitness improvement,
not a numerical exploit.
7. 1,3-Butadiene (C₄H₆) — improvement over the NOON-MP2 baseline
System: 1,3-butadiene torsion · CAS(4,4) · cc-pVDZ · SA(2)
Result: π-canonic [13, 14, 15, 16] = (HOMO−1, HOMO, LUMO, LUMO+1)
wins on 2 of 3 seeds in both regimes. The third seed converges to a chemistry-valid alternative
that contains 2 frontier MOs. The framework outperforms the NOON-MP2 baseline by
25 mEh ≈ 16 kcal/mol on RE,raw — with a chemically
cleaner mask than the baseline.
Per-seed comparison (3 seeds)
| seed |
winner (both regimes) |
frontier MOs |
RE,raw / Eh |
vs NOON-MP2 baseline [13,14,15,19] |
| 42 | [13, 14, 15, 16] | 4 | −619.9220 | −25 mEh (≈ −16 kcal/mol) |
| 73 | [13, 14, 15, 16] | 4 | −619.9220 | −25 mEh (≈ −16 kcal/mol) |
| 101 | [13, 14, 29, 39] | 2 | −619.9220 | −25 mEh (within tolerance, distinct mask) |
The NOON-MP2 baseline for butadiene CAS(4,4) selects MO 19 (LUMO+4) instead of MO 16 (LUMO+1)
as the second virtual — a known weakness of NOON ranking on multi-π manifolds. The
chemistry-aware framework reproduces this finding without ever bypassing the chemistry
condition: the winners that beat NOON-MP2 contain HOMO and LUMO themselves.
The reported result is therefore not "the lowest energy" — it is "the lowest energy
among the chemically meaningful candidates". For butadiene CAS(4,4), the two
coincide; the report flags that the NOON-MP2 mask is the chemistry outlier in this case,
and the framework picks the reasonable π set.
This is also the centrepiece example of the multi-basin observation in §4: the same
butadiene scan exhibits ≈ 18 kcal/mol single-geometry splits between basins that intra-active
multistart cannot bridge. The π-canonic basin is reachable here only because the search
proposes different masks, not different rotations of the same mask.
8. Convergence integrity across the three systems
Figure 4. SA(2)-CASSCF convergence on the accepted (chemistry-valid) winner of every seed,
across the three molecules — 47 / 47 = 100 % geometry-level convergence.
No statistic in this report is taken on a partial scan. Every RE,raw is the sum of
S₀ over the full set of geometries; if any geometry of the scan failed to converge, that
candidate carries the worst-case sentinel and cannot win the population step. This is the
architectural requirement that prevents a single accidentally-converged garbage geometry from
short-circuiting an entire ranking. The NaN- and Inf-propagation paths are covered by the
internal regression suite (99 / 99 unit tests).
9. Key lessons
The framework's practical workflow (Search → Chemical Gates → Convergence Check → Rank by Energy)
is summarized in the Executive Summary. The lessons below are the take-aways from running it
on the three benchmark systems.
What this benchmark teaches
- Energy alone can mislead. 5 of 11 winners would have been anti-chemical without the chemistry condition.
- Frontier consistency matters. For frontier-driven photochemical channels, a mask that omits the HOMO or the LUMO entirely cannot describe the physics — regardless of how attractive its raw energy is.
- Multimodality is real. CASSCF orbital optimization on these systems exhibits multiple basins separated by directions outside the active sub-space; intra-active multistart does not bridge them, but combinatorial mask exploration does.
- Search and validation must coexist. Combinatorial mask exploration provides the basin sampling; chemistry gates and convergence checks provide the discipline to keep only the physically meaningful results.
10. Protocol & what is reported
Method
- Electronic structure: PySCF · SCF (RHF) reference · SA(2)-CASSCF with equal weights [0.5, 0.5]
- Basis: cc-pVDZ on every system
- Geometry scans: R(C=O) for CH₂O · torsion for C₂H₄ and C₄H₆
- Selector: inZOR-ND evolutionary search over MO subset masks · 3–5 independent seeds per molecule (11 total)
- Acceptance conditions: mask validity, frozen-core, frontier-orbital window (per-CAS rule)
Reported quantities
- Per-seed winning mask, with explicit listing of which MOs sit inside the frontier window
- RE,raw = Σg S₀(g) over the full scan
- S₀ second-derivative variance and largest S₁−S₀ gap jump (continuous diagnostics)
- SA-CASSCF convergence flag per geometry
- Cross-seed equivalence classes (different masks that converge to the same physical sub-space are reported as one class)
What is not reported
- No proprietary score is presented as the primary criterion; the chemistry-validity condition is structural and the smoothness diagnostics are physical.
- No claim is made that the framework supersedes any standard reference method (NOON, AVAS, hand-picked active spaces). Where the framework agrees with NOON-MP2, that is reported as agreement; where it disagrees (butadiene CAS(4,4)), the disagreement is documented with the underlying chemistry reason.
- No partially-converged scans are aggregated into the headline figures.
11. Conclusions
★
Across three benchmark systems, the chemistry-consistent framework repeatedly separated meaningful active spaces from misleading low-quality attractors, while preserving energetic competitiveness.
- The chemistry condition is non-redundant with energy ranking — 5 of 11 GA winners would have been anti-chemical without it; the other 6 are unaffected.
- Multi-basin behaviour is real — basins separated by orthogonal active directions and ≈ 18 kcal/mol on butadiene; intra-active multistart cannot bridge them, but combinatorial mask exploration can.
- Convergence is enforced architecturally — 47 / 47 SA-CASSCF runs converged on accepted candidates; non-converged geometries cannot win a sort.
- Energetic competitiveness is preserved or improved — on butadiene CAS(4,4) the GA winner beats NOON-MP2 by ≈ 16 kcal/mol on RE,raw while still containing both HOMO and LUMO.
Headline. 11 / 11 chemistry-valid · 5 / 11 anti-chemical winners removed at the
source · 47 / 47 SA-CASSCF runs converged · ≈ 16 kcal/mol over NOON-MP2 on butadiene · all
numbers reproducible from the per-seed result files shipped with the report.