Home·Research Tests
Other quantum-chemistry studies:8-benchmark comparative · N₂ active space · Cr₂ active space

Contents

  1. Scope & methodological framing
  2. Executive summary
  3. The five tests an active space must pass
  4. Multi-basin reality & the limits of intra-active multistart
  5. Formaldehyde (CH₂O) — search-sensitivity benchmark
  6. Ethylene (C₂H₄) — flagship redirection
  7. 1,3-Butadiene (C₄H₆) — improvement over the NOON-MP2 baseline
  8. Convergence integrity across the three systems
  9. Key lessons
  10. Protocol & what is reported
  11. Conclusions

1. Scope & methodological framing

The principal metric for the performance of an active space is not whether it yields the lowest energy, but whether it yields a consistent description of a chemical bond. The active space must be chemically meaningful. There can be many local minima in CASSCF optimization, so one must be sure the optimizations are properly converged.

Traditional active-space ranking often overweights a single scalar — the optimized SA-CASSCF energy. On the photochemically interesting systems studied here, that scalar can reward chemically poor orbital sets (deep-σ correlation, far virtuals, masks that omit the frontier orbitals altogether). The contribution of this report is a chemistry-consistent validation framework that filters those failure modes before any energy ranking is performed and redirects the search toward physically meaningful sub-spaces.

The benchmark covers three frontier-driven photochemical systems — formaldehyde, ethylene and 1,3-butadiene — with CAS(2,2) and CAS(4,4), 11 independent GA seeds and 47 SA-CASSCF runs in total. No fitness adjustment, no hand-picked active space, and no partial scan is used; every winner is screened against five independent conditions before being reported.

2. Executive summary

11 / 11
accepted winners are chemically meaningful and converge cleanly
Across CH₂O · C₂H₄ · C₄H₆ · 11 independent GA seeds · 47 / 47 SA-CASSCF runs converged
5 / 11 winners that pure-energy ranking would have promoted are exposed as anti-chemical and rejected pre-CASSCF
Headline methodological observation. The SA-CASSCF orbital landscape on these systems is genuinely multi-basin. Simple orbital rotations cannot traverse all basins; combinatorial search over candidate sub-spaces can. On butadiene at φ = 86°, two CAS(4,4) solutions are separated by an essentially orthogonal active dimension (89.85°) and ≈ 18 kcal/mol of energy — a gap that 30 intra-active multistarts at σmax = 3.0 do not bridge. Quantitative evidence in §4.
Cross-molecule headline summary
Figure 1. Headline result. Fraction of GA winners that satisfy the per-CAS frontier-orbital rule, with and without the chemistry-aware acceptance condition. Without the condition, energy-only ranking promotes anti-chemical winners on CH₂O and C₂H₄; with the condition, every accepted winner is chemistry-valid.
11/11
chemistry-valid winners
(post-screen)
5/11
would have been anti-chemical
(without the screen)
3
benchmark molecules
(CH₂O · C₂H₄ · C₄H₆)
47
geometries probed
across the scans
47/47
SA-CASSCF converged
on accepted candidates
99/99
internal regression tests
pass on the gate logic

Practical workflow

STEP 1
Search
Combinatorial proposal of candidate active sub-spaces over the SCF MO list
STEP 2
Chemical gates
Frontier-orbital overlap, mask validity, frozen-core sanity — fail = veto
STEP 3
Convergence check
SA-CASSCF on every geometry; non-converged ⇒ worst-case sentinel
STEP 4
Rank by energy
RE,raw + smoothness diagnostics, only on candidates that passed steps 2 and 3

Chemistry validity and convergence are checked first; energy ranking is the last step, not the first.

3. The five tests an active space must pass

Lowest energy alone is not enough. For these frontier-driven photochemical benchmarks, an active space is reported only when it satisfies all five of the following conditions, in this order. The first three are hard gates (yes/no decisions); the last two are continuous diagnostics that accompany every accepted winner.

1Chemical meaning
The mask must contain the molecular orbitals physically involved in the studied photochemical channel — frontier overlap with {HOMO−1, HOMO, LUMO, LUMO+1} above a per-CAS threshold.
2Smooth path behaviour
The S₀ surface and the S₁−S₀ gap must vary smoothly across the reaction-coordinate scan; large adjacent-geometry jumps flag basin switching and disqualify the candidate.
3Proper convergence
Any non-converged SA-CASSCF cycle on any geometry of the scan propagates as a worst-case sentinel; partial scans cannot win the population step.
4Reproducibility across seeds
Independent GA seeds must converge to the same physical sub-space (or to a small number of chemistry-equivalent classes) before the result is reported.
5Competitive energy
Once 1–4 are satisfied, RE,raw = Σg S₀(g) is reported and compared to the NOON-MP2 baseline. Energy ranking is the last test, not the first.

Per-CAS frontier rule used by test 1

target CAS size required overlap with frontier window {HOMO−1, HOMO, LUMO, LUMO+1} rationale
CAS(2,2)≥ 1 of 4 frontier MOse.g. CH₂O n→π*; the n-orbital sits outside the strict HOMO/LUMO pair by construction
CAS(4,4)≥ 2 of 4 frontier MOsπ/π* manifolds; rejects σ-only and far-virtual-only winners

Test 1 acts as a pre-evaluation veto: candidates that fail it are not assigned an energy ranking and cannot win the GA step. It is not a soft penalty added to a score — it is a yes/no decision that prevents the optimizer from converging on a chemically empty solution because that solution happens to lower the SA-energy sum through σ-correlation.

4. Multi-basin reality & the limits of intra-active multistart

Why is a chemistry-aware acceptance condition necessary in the first place? Because the SA-CASSCF orbital-optimization landscape on these systems is genuinely multi-basin, and standard intra-active multistart cannot move between basins. This is a methodological observation, not an implementation detail: it is the reason a per-candidate energy ranking is unreliable on its own, and the reason the problem deserves a combinatorial exploration over candidate sub-spaces rather than a deeper local search inside one.

4.1 The butadiene basin-jump scan

Butadiene multi-basin landscape
Figure 2. SA(2)-CASSCF total energies along the C₂–C₃ torsion of 1,3-butadiene for three candidate CAS(4,4) active spaces, evaluated under identical multistart settings (M = 30 starts, σmax = 3.0). Three distinct basins are visible; the largest single-geometry split is 28 mEh ≈ 17.7 kcal/mol at φ = 86°. The shaded region marks the basin-jump zone where intra-active multistart cannot escape, even with aggressive rotation amplitude.

4.2 The geometric reason: the subspace-overlap diagnostic

Subspace overlap singular values
Figure 3. Singular values of the cross-overlap matrix S = MOAT·SAO·MOB between the active orbitals of the two competing CAS(4,4) basins at φ = 86°. Three of four active dimensions are 95–99 % shared; the fourth is essentially orthogonal (89.85° principal angle). This is the geometric reason intra-active unitary rotation cannot bridge these basins — it would require crossing a direction that sits outside the 4-dimensional sub-space being rotated.

The combinatorial exploration over candidate active sub-spaces samples the basin landscape independently of intra-active orientation: each candidate mask is a fresh sample. This is what allows the framework to satisfy two conditions that would otherwise be in tension — chemistry validity and proper convergence — without resorting to either a hand-picked active space or to a single-objective energy minimization that ignores chemistry.

What this implies practically. Different chemistry-valid masks may converge to distinct, physically meaningful basins. Simple orbital rotations cannot traverse all basins. Combinatorial mask exploration is therefore not a luxury; it is the appropriate sampling strategy at this scale.

5. Formaldehyde (CH₂O) — search-sensitivity benchmark

Role in the benchmark: a sensitivity benchmark for search initialization. CH₂O has only 2 active orbitals out of 30+ candidate MOs, so the GA's combinatorial budget is tight; this is the system that exposes whether the chemistry condition is strong enough to keep energy-only winners from drifting into deep-σ correlation when search room is small.
System: Formaldehyde  ·  CAS(2,2)  ·  cc-pVDZ  ·  SA(2) [0.5, 0.5]  ·  4 geometries along R(C=O)  ·  HOMO = 7, LUMO = 8, NOON-MP2 baseline mask = [6, 9]
Result: all 5 GA winners contain at least one frontier MO once the chemistry condition is enforced; the three energy-only winners that previously contained none are redirected toward the n→π* manifold.

Per-seed comparison (5 seeds)

seed energy-only winner (before screen) frontier MOs in mask chemistry-aware winner (after screen) frontier MOs in mask changed?
7 [3, 17] 0 [6, 25] 1yes
17 [3, 23] 0 [6, 22] 1yes
42 [6, 27] 1 [6, 27] 1
73 [6, 27] 1 [6, 27] 1
101 [3, 31] 0 [5, 9] 1yes

Three of five energy-only winners contained MO 3 (HOMO−4 region) — a deep-σ correlation that lowers the SA-energy sum without describing the n→π* manifold of formaldehyde. After the chemistry-aware condition, every winner contains at least one MO in {6, 7, 8, 9}. With the literature NOON-MP2 mask [6, 9] seeded into the initial population, all 5 seeds converge to it and confirm RE,raw ≈ −455.36 Eh with full SA-CASSCF convergence on every geometry.

What this measures. When a trusted baseline mask is seeded into the initial population, all 5 seeds converge to it cleanly. Without seeding, the GA does not always reach the n→π* manifold inside the 10-step × 50-organism budget — a normal search-budget effect on a 2-of-30 CAS, and exactly the regime where the chemistry condition earns its keep: it identifies the under-budget cases unambiguously and redirects them into the correct manifold.

6. Ethylene (C₂H₄) — flagship redirection

System: Ethylene torsion  ·  CAS(4,4)  ·  cc-pVDZ  ·  SA(2)  ·  HOMO = 7, LUMO = 8  ·  canonical π / π* manifold = [6, 7, 8, 9]
Result: the most flagrant anti-chemical winner of the entire benchmark — seed 73 selecting [2, 4, 21, 22] with zero HOMO/LUMO MOs — is rejected. The redirected winner is the textbook π/π* set.

Per-seed comparison (3 seeds)

seed energy-only winner (before screen) frontier MOs chemistry-aware winner (after screen) frontier MOs note
42 [4, 7, 8, 18] 2 [4, 7, 8, 18] 2chemistry-clean before and after; preserved
73 [2, 4, 21, 22] 0 [6, 7, 8, 9] 4redirected to canonical π / π*
101 [3, 7, 15, 46] 1 [6, 7, 8, 9] 4redirected to canonical π / π*

The two winners that the energy-only score promoted to the top — seed 73 with all MOs outside the frontier window, and seed 101 with only one MO inside — both collapse to the canonical π manifold [6, 7, 8, 9] = (HOMO−1, HOMO, LUMO, LUMO+1) under the chemistry-aware acceptance. Seed 42's pre-screen winner already contains both HOMO and LUMO and is preserved.

The number of distinct cross-seed equivalence classes for the ethylene torsion benchmark drops from 3 to 2. The remaining multimodality is between two chemistry-valid attractors — [6, 7, 8, 9] and [4, 7, 8, 18] — not between chemistry and anti-chemistry. Both contain HOMO and LUMO and have RE,raw lower than the NOON-MP2 baseline mask [5, 7, 8, 22]; this is now an honest fitness improvement, not a numerical exploit.

7. 1,3-Butadiene (C₄H₆) — improvement over the NOON-MP2 baseline

System: 1,3-butadiene torsion  ·  CAS(4,4)  ·  cc-pVDZ  ·  SA(2)
Result: π-canonic [13, 14, 15, 16] = (HOMO−1, HOMO, LUMO, LUMO+1) wins on 2 of 3 seeds in both regimes. The third seed converges to a chemistry-valid alternative that contains 2 frontier MOs. The framework outperforms the NOON-MP2 baseline by 25 mEh ≈ 16 kcal/mol on RE,raw — with a chemically cleaner mask than the baseline.

Per-seed comparison (3 seeds)

seed winner (both regimes) frontier MOs RE,raw / Eh vs NOON-MP2 baseline [13,14,15,19]
42 [13, 14, 15, 16]4−619.9220−25 mEh (≈ −16 kcal/mol)
73 [13, 14, 15, 16]4−619.9220−25 mEh (≈ −16 kcal/mol)
101 [13, 14, 29, 39]2−619.9220−25 mEh (within tolerance, distinct mask)

The NOON-MP2 baseline for butadiene CAS(4,4) selects MO 19 (LUMO+4) instead of MO 16 (LUMO+1) as the second virtual — a known weakness of NOON ranking on multi-π manifolds. The chemistry-aware framework reproduces this finding without ever bypassing the chemistry condition: the winners that beat NOON-MP2 contain HOMO and LUMO themselves.

The reported result is therefore not "the lowest energy" — it is "the lowest energy among the chemically meaningful candidates". For butadiene CAS(4,4), the two coincide; the report flags that the NOON-MP2 mask is the chemistry outlier in this case, and the framework picks the reasonable π set.

This is also the centrepiece example of the multi-basin observation in §4: the same butadiene scan exhibits ≈ 18 kcal/mol single-geometry splits between basins that intra-active multistart cannot bridge. The π-canonic basin is reachable here only because the search proposes different masks, not different rotations of the same mask.

8. Convergence integrity across the three systems

Cross-molecule convergence summary
Figure 4. SA(2)-CASSCF convergence on the accepted (chemistry-valid) winner of every seed, across the three molecules — 47 / 47 = 100 % geometry-level convergence.

No statistic in this report is taken on a partial scan. Every RE,raw is the sum of S₀ over the full set of geometries; if any geometry of the scan failed to converge, that candidate carries the worst-case sentinel and cannot win the population step. This is the architectural requirement that prevents a single accidentally-converged garbage geometry from short-circuiting an entire ranking. The NaN- and Inf-propagation paths are covered by the internal regression suite (99 / 99 unit tests).

9. Key lessons

The framework's practical workflow (Search → Chemical Gates → Convergence Check → Rank by Energy) is summarized in the Executive Summary. The lessons below are the take-aways from running it on the three benchmark systems.

What this benchmark teaches

  • Energy alone can mislead. 5 of 11 winners would have been anti-chemical without the chemistry condition.
  • Frontier consistency matters. For frontier-driven photochemical channels, a mask that omits the HOMO or the LUMO entirely cannot describe the physics — regardless of how attractive its raw energy is.
  • Multimodality is real. CASSCF orbital optimization on these systems exhibits multiple basins separated by directions outside the active sub-space; intra-active multistart does not bridge them, but combinatorial mask exploration does.
  • Search and validation must coexist. Combinatorial mask exploration provides the basin sampling; chemistry gates and convergence checks provide the discipline to keep only the physically meaningful results.

10. Protocol & what is reported

Method

Reported quantities

What is not reported

11. Conclusions

Across three benchmark systems, the chemistry-consistent framework repeatedly separated meaningful active spaces from misleading low-quality attractors, while preserving energetic competitiveness.
Headline. 11 / 11 chemistry-valid · 5 / 11 anti-chemical winners removed at the source · 47 / 47 SA-CASSCF runs converged · ≈ 16 kcal/mol over NOON-MP2 on butadiene · all numbers reproducible from the per-seed result files shipped with the report.