Tutorial: Full Workflow¶

A complete end-to-end guide for optimizing a Transition State Force Field (TSFF) using Q2MM's clean model layer. We walk through the SN2 reaction F⁻ + CH₃F → FCH₃ + F⁻ — a textbook nucleophilic substitution with a well-defined D₃ₕ-like transition state.

Prerequisites¶

What you need before starting

Python 3.10+ with Q2MM installed (pip install q2mm)
NumPy and SciPy (installed automatically with Q2MM)
An MM engine — OpenMM (pip install openmm), JAX (pip install "q2mm[jax]"), JAX-MD (pip install "q2mm[jax-md]"), or Tinker (free for academic use)
The SN2 example files in examples/sn2-test/

QM engine optional: This tutorial includes pre-computed QM reference data in examples/sn2-test/qm-reference/, so you can complete the full workflow without a QM engine. If you want to generate your own QM data, you'll need Psi4 or Gaussian.

Quick install

pip install "q2mm[openmm]"              # Q2MM + OpenMM backend (from PyPI)

Add --pre to pip install if a stable release hasn't been published yet.

Atom numbering for this tutorial (0-indexed):

Index   Element   Role
  0       C       Central carbon
  1       F       Leaving / attacking fluorine
  2       F       Leaving / attacking fluorine
  3       H       Methyl hydrogen
  4       H       Methyl hydrogen
  5       H       Methyl hydrogen

Step 1: Obtain QM Reference Data¶

Every TSFF parameterisation starts with quantum-mechanical reference data for the transition state: an optimized geometry and the Hessian matrix (second derivatives of the energy with respect to nuclear coordinates).

Using pre-computed data (fastest)

The examples/sn2-test/qm-reference/ directory contains ready-to-use QM data for the SN2 tutorial. No QM engine needed:

import numpy as np
from pathlib import Path

QM_REF = Path("examples/sn2-test/qm-reference")

hessian     = np.load(str(QM_REF / "sn2-ts-hessian.npy"))       # (18, 18)
frequencies = np.loadtxt(QM_REF / "sn2-ts-frequencies.txt")     # cm⁻¹
# Geometry is loaded in Step 2 via Q2MMMolecule.from_xyz()

Skip to Step 2 if using these files.

Generating your own QM data¶

If you want to run the QM calculation yourself (or adapt this for your own molecule), expand the section for your QM engine:

Psi4 (recommended, open-source)

Psi4 runs inside Python — you extract the Hessian and frequencies directly from the wavefunction object (wfn), then save them as NumPy arrays.

import numpy as np
import psi4

psi4.set_memory("2 GB")
psi4.set_num_threads(4)
psi4.core.set_output_file("psi4-output.dat", False)

# Define the SN2 transition-state geometry (charge −1, singlet)
ts_mol = psi4.geometry("""
    -1 1
    C     0.000000    0.000000    0.000000
    F     0.000000    0.000000    1.800000
    F     0.000000    0.000000   -1.800000
    H     1.026720    0.000000    0.000000
    H    -0.513360    0.889165    0.000000
    H    -0.513360   -0.889165    0.000000
""")

# Saddle-point optimisation at B3LYP/6-31G*
psi4.set_options({
    "basis": "6-31G*",
    "reference": "rhf",
    "opt_type": "ts",          # ← saddle-point search
    "geom_maxiter": 100,
})
ts_energy = psi4.optimize("b3lyp", molecule=ts_mol)

# Frequency calculation → Hessian
ts_energy_freq, ts_wfn = psi4.frequency(
    "b3lyp", molecule=ts_mol, return_wfn=True
)
hessian = np.array(ts_wfn.hessian())          # shape (3N, 3N), Hartree/Bohr²
frequencies = np.array(ts_wfn.frequencies())   # cm⁻¹

# Verify: exactly 1 imaginary frequency (negative value) = valid TS
n_imaginary = np.sum(frequencies < 0)
assert n_imaginary == 1, f"Expected 1 imaginary freq, got {n_imaginary}"

# Save for later steps
ts_mol.save_xyz_file("qm-reference/sn2-ts-optimized.xyz", True)
np.save("qm-reference/sn2-ts-hessian.npy", hessian)
np.savetxt("qm-reference/sn2-ts-frequencies.txt", frequencies)

Install: conda install psi4 -c conda-forge

Gaussian (commercial license)

Run a opt=(ts,calcfc) freq job, then parse the log file with Q2MM's GaussLog parser:

from q2mm.io.gaussian import GaussLog
from q2mm.models.hessian import reform_hessian

log = GaussLog("sn2-ts.log", au_hessian=True)

# Geometry comes from the archive section
structures = log.structures          # list of Structure objects
atoms = structures[-1].atoms         # last (optimized) geometry

# Reconstruct the Cartesian Hessian from eigenvalues / eigenvectors
eigenvalues = log.evals
eigenvectors = log.evecs
hessian = reform_hessian(eigenvalues, eigenvectors)

Pass au_hessian=True to keep the Hessian in atomic units (Hartree/Bohr²) — QFUERZA estimation expects this.

Jaguar (Schrödinger)

Parse the .in file (Hessian) and .out file (frequencies, eigenvectors):

from q2mm.io.jaguar import JaguarIn, JaguarOut

jag_out = JaguarOut("sn2-ts.out")
eigenvalues = jag_out.eigenvalues
eigenvectors = jag_out.eigenvectors
structures = jag_out.structures
frequencies = jag_out.frequencies

num_atoms = structures[0].coords.shape[0]
jag_in = JaguarIn("sn2-ts.in")
hessian = jag_in.get_hessian(num_atoms)   # (3N, 3N), Hartree/Bohr²

Jaguar is commonly used for organometallic transition states where pseudopotentials like LACVP** are needed. See the Rh-enamide benchmark for a worked case study.

Transition-state validation

A valid transition state has exactly one imaginary (negative) vibrational frequency — the reaction coordinate. If you see zero or more than one, the geometry has not converged to a first-order saddle point.

Step 2: Build a Q2MMMolecule¶

Q2MMMolecule is Q2MM's format-agnostic molecular structure. It auto-detects bonds and angles from covalent radii and stores the QM Hessian alongside the geometry.

Bond detection and bond_tolerance

Not all file formats include bond information — XYZ files, for instance, only store atom symbols and Cartesian coordinates. When connectivity is missing, Q2MM infers bonds by comparing every atom–atom distance to the sum of their covalent radii scaled by bond_tolerance:

bonded if distance < bond_tolerance × (r_cov_A + r_cov_B)

The default bond_tolerance=1.3 works for ground-state molecules. For transition states — where bonds are partially formed or broken — you typically need 1.4 or higher. For example, the C–F distance in the SN2 TS (~1.84 Å) is much longer than a typical C–F bond (~1.38 Å). If bonds are missing from your molecule, increase this value.

Formats that do include explicit bond tables (MOL2, MacroModel .mmo) skip detection entirely — use from_structure() and the bonds and angles from the file are preserved as-is, with no recalculation.

From an XYZ file (simplest)

The XYZ and Hessian files here were saved by Psi4 in Step 1 (ts_mol.save_xyz_file(...) and np.save(..., hessian)). If you skipped that step, the pre-computed files in examples/sn2-test/qm-reference/ are identical.

import numpy as np
from pathlib import Path
from q2mm.models.molecule import Q2MMMolecule

QM_REF = Path("examples/sn2-test/qm-reference")

# Load the optimised TS geometry saved by Psi4
mol = Q2MMMolecule.from_xyz(
    QM_REF / "sn2-ts-optimized.xyz",
    charge=-1,
    name="SN2_TS",
    bond_tolerance=1.4,   # ← 1.4× covalent radii to catch long TS bonds
)

# Attach the QM Hessian (also saved from Psi4's wfn object)
hessian = np.load(str(QM_REF / "sn2-ts-hessian.npy"))
mol = mol.with_hessian(hessian)

# Inspect auto-detected connectivity
print(f"Atoms:  {mol.n_atoms}")
print(f"Bonds:  {len(mol.bonds)}")
print(f"Angles: {len(mol.angles)}")

for bond in mol.bonds:
    print(f"  {bond.element_pair}: {bond.length:.4f} Å")

Expected output:

Atoms:  6
Bonds:  5
Angles: 7
  ('C', 'F'): 1.8427 Å
  ('C', 'F'): 1.8427 Å
  ('C', 'H'): 1.0767 Å
  ('C', 'H'): 1.0767 Å
  ('C', 'H'): 1.0767 Å

From a Gaussian log file

If you already have a Gaussian opt freq log file, you can build the molecule directly from the parsed structures — no separate XYZ file needed:

from q2mm.io.gaussian import GaussLog
from q2mm.models.molecule import Q2MMMolecule
from q2mm.models.hessian import reform_hessian

log = GaussLog("sn2-ts.log", au_hessian=True)

# Build molecule from the last (optimised) structure
structure = log.structures[-1]
mol = Q2MMMolecule.from_structure(
    structure,
    charge=-1,
    bond_tolerance=1.4,
    hessian=reform_hessian(log.evals, log.evecs),
)

The from_structure() constructor also preserves atom type labels from MacroModel .mmo files, which is useful for matching to existing force field parameters.

From a QCElemental Molecule

If you use QCElemental in your QM workflow:

import qcelemental as qcel
from q2mm.models.molecule import Q2MMMolecule

qcel_mol = qcel.models.Molecule(...)
mol = Q2MMMolecule.from_qcel(qcel_mol, name="my-molecule")

From raw arrays (manual construction)

If your data comes from a custom source rather than an XYZ file:

import numpy as np
from q2mm.models.molecule import Q2MMMolecule

coordinates = np.array([
    [ 0.000000,  0.000000,  0.000000],   # C
    [ 0.000000,  0.000000,  1.800000],   # F
    [ 0.000000,  0.000000, -1.800000],   # F
    [ 1.026720,  0.000000,  0.000000],   # H
    [-0.513360,  0.889165,  0.000000],   # H
    [-0.513360, -0.889165,  0.000000],   # H
])

mol = Q2MMMolecule(
    symbols=["C", "F", "F", "H", "H", "H"],
    geometry=coordinates,
    charge=-1,
    name="sn2-ts",
    bond_tolerance=1.4,
    hessian=hessian,   # (18×18) array in Hartree/Bohr²
)

print(f"Bonds: {len(mol.bonds)}, Angles: {len(mol.angles)}")

Step 3: Initialise the Force Field with QFUERZA¶

QFUERZA (Farrugia et al., J. Chem. Theory Comput. 2025, 22, 469–476) extracts harmonic force constants directly from the QM Hessian matrix using Seminario projection. For each bond or angle, it projects the Hessian onto the internal coordinate's subspace and takes the eigenvalue along that direction. For hydrogen angle bends — where plain projection overestimates by ~2× — QFUERZA substitutes a reliable empirical default. This produces excellent initial parameter estimates — often within 10–20% of the final optimised values — without running a single MM calculation.

Quick start — auto-create and estimate

from q2mm.models.seminario import estimate_force_constants

# estimate_force_constants accepts a single molecule or a list
ff = estimate_force_constants(
    mol,
    zero_torsions=True,    # set torsion barriers to zero (common for TS)
    au_hessian=True,       # Hessian is in Hartree/Bohr²
    invalid_policy="skip", # skip negative force constants (TS artefacts)
)

print(f"Bond params:    {len(ff.bonds)}")
print(f"Angle params:   {len(ff.angles)}")
print(f"Torsion params: {len(ff.torsions)}")

for b in ff.bonds:
    print(f"  {b.elements}: k = {b.force_constant:.3f} mdyn/Å, "
          f"r₀ = {b.equilibrium:.4f} Å")
for a in ff.angles:
    print(f"  {a.elements}: k = {a.force_constant:.6f} mdyn·Å/rad², "
          f"θ₀ = {a.equilibrium:.1f}°")

With an existing force field template

If you already have an MM3 .fld file with initial guesses (or placeholder values), pass it so QFUERZA updates the force constants in place while preserving atom types and row numbers:

from q2mm.models.forcefield import ForceField
from q2mm.models.seminario import estimate_force_constants

# Load template with initial guesses (replace with your .fld path)
initial_ff = ForceField.from_mm3_fld("my-system.fld")

# QFUERZA updates force constants, keeps equilibrium values and metadata
estimated_ff = estimate_force_constants(
    mol,
    forcefield=initial_ff,
    zero_torsions=True,
    au_hessian=True,
    invalid_policy="skip",
)

# Compare before / after
for i, (old, new) in enumerate(zip(initial_ff.bonds, estimated_ff.bonds)):
    delta = new.force_constant - old.force_constant
    print(f"  Bond {old.elements}: {old.force_constant:.3f} → "
          f"{new.force_constant:.3f} mdyn/Å  (Δ = {delta:+.3f})")

What invalid_policy='skip' does

At a transition state the reaction-coordinate mode has negative curvature. The Seminario projection used by QFUERZA can produce negative or complex force constants for bonds along this coordinate. invalid_policy="skip" leaves those parameters unchanged rather than inserting unphysical values.

Step 4: Set Up Reference Data¶

The ReferenceData container holds the QM target values that the objective function will try to reproduce. Each entry has a kind (energy, frequency, bond length, bond angle, torsion angle), a value, and a weight that controls its importance in the fit.

Quick start — auto-populate from a molecule

The simplest approach auto-extracts bond lengths and angles from the molecule we already built, and adds frequencies from the QM calculation:

import numpy as np
from q2mm.optimizers.objective import ReferenceData

# Load frequencies from QM output
ts_freqs = np.loadtxt("examples/sn2-test/qm-reference/sn2-ts-frequencies.txt")

# One call populates everything
ref = ReferenceData.from_molecule(
    mol,
    frequencies=ts_freqs,
    skip_imaginary=True,  # skip the imaginary TS mode
)

print(f"Reference observations: {ref.n_observations}")
# → bonds + angles + real frequencies

Default weights are bond_length=10.0, bond_angle=5.0, frequency=1.0. Override with the weights parameter:

ref = ReferenceData.from_molecule(
    mol,
    frequencies=ts_freqs,
    weights={"bond_length": 50.0, "bond_angle": 25.0, "frequency": 2.0},
)

Auto-populate from a Gaussian .fchk file

If you have a Gaussian formatted checkpoint file (.fchk), you can build both the molecule and reference data in one step:

ref, mol = ReferenceData.from_fchk(
    "examples/ethane/GS.fchk",
    bond_tolerance=1.3,
)
print(f"Molecule: {mol}")
print(f"Observations: {ref.n_observations}")
# The molecule has the Hessian attached automatically
print(f"Hessian shape: {mol.hessian.shape}")

Auto-populate from a Gaussian .log file

For Gaussian log files from opt freq jobs:

ref, mol = ReferenceData.from_gaussian(
    "sn2-ts.log",
    bond_tolerance=1.4,
    charge=-1,
    include_frequencies=True,
    skip_imaginary=True,
)

Multi-molecule training sets

For optimising against multiple conformers or molecules:

ref = ReferenceData.from_molecules(
    [mol_gs, mol_ts],
    frequencies_list=[freqs_gs, freqs_ts],
    skip_imaginary=True,
)
# Each molecule gets a sequential molecule_idx (0, 1, ...)

Manual construction (full control)

You can still build ReferenceData entry by entry when you need complete control over what goes in:

ref = ReferenceData()

for bond in mol.bonds:
    ref.add_bond_length(
        bond.length,
        atom_indices=(bond.atom_i, bond.atom_j),
        weight=10.0,
        label=f"{bond.element_pair} bond",
    )

for angle in mol.angles:
    ref.add_bond_angle(
        angle.value,
        atom_indices=(angle.atom_i, angle.atom_j, angle.atom_k),
        weight=5.0,
        label=f"{angle.elements} angle",
    )

# Bulk-add frequencies
ref.add_frequencies_from_array(ts_freqs, weight=1.0, skip_imaginary=True)

# Add an energy target
ref.add_energy(-239.12345, weight=1.0, label="TS energy")

Choosing weights

Weights balance the influence of different data types:

Bond lengths are in Ångströms (small numbers); give them higher weight (~10) so a 0.01 Å error matters as much as a 1 kcal/mol energy error.
Angles are in degrees; weight ~5 is typical.
Frequencies can have large absolute values but small relative errors; weight ~1 is usually fine.

There is no single "correct" weighting — iterate and compare results.

Step 5: Create the Objective Function¶

The ObjectiveFunction ties together the force field, the MM engine, the molecular structures, and the reference data into a single callable that scipy.optimize.minimize can drive.

At each evaluation it:

Sets the force-field parameters from the current parameter vector
Runs the MM engine (energy, geometry, frequencies) for each molecule
Computes weighted residuals against the reference data
Returns the sum of squared residuals

from q2mm.optimizers.objective import ObjectiveFunction

objective = ObjectiveFunction(
    forcefield=ff,
    engine=engine,        # your MM backend (see below)
    molecules=[mol],
    reference=ref,
)

# Evaluate at the initial (QFUERZA) parameters
initial_params = ff.get_param_vector()
initial_score = objective(initial_params)
print(f"Initial score: {initial_score:.6f}")
print(f"Parameters:    {len(initial_params)}")

Setting up an MM engine

The engine argument is any object implementing the MMEngine abstract base class from q2mm.backends.base. Q2MM ships with backends for OpenMM, JAX, JAX-MD, and Tinker:

from q2mm.backends.mm.openmm import OpenMMEngine
engine = OpenMMEngine()

from q2mm.backends.mm import JaxEngine
engine = JaxEngine()

from q2mm.backends.mm import JaxMDEngine
engine = JaxMDEngine()

from q2mm.backends.mm.tinker import TinkerEngine
engine = TinkerEngine(tinker_dir="/usr/local/bin")

Step 6: Optimise the Force Field¶

ScipyOptimizer wraps scipy.optimize.minimize with sensible defaults for force-field fitting.

Single-shot optimisation with L-BFGS-B

Setting	Value	Rationale
`method`	`L-BFGS-B`	Bounded quasi-Newton — fast convergence for smooth, differentiable objectives
`eps`	`1e-3`	Finite-difference step for gradient estimation. FF parameters have magnitudes ~0.5–10, so scipy's default (~1e-8) is far too small and produces noisy gradients
`maxiter`	`500`	Generous iteration budget; most runs converge in 50–200
`use_bounds`	`True`	Prevents parameters from drifting to unphysical values (e.g., negative bond lengths)

from q2mm.optimizers.scipy_opt import ScipyOptimizer

optimizer = ScipyOptimizer(
    method="L-BFGS-B",
    maxiter=500,
    ftol=1e-8,
    eps=1e-3,
    use_bounds=True,
    verbose=True,
)

result = optimizer.optimize(objective)
print(result.summary())

Expected output:

Method: L-BFGS-B
Success: True — CONVERGENCE: REL_REDUCTION_OF_F_<=_FACTR*EPSMCH
Score: 0.045321 → 0.001234 (97.3% improvement)
Iterations: 87, Evaluations: 1043

Alternative optimisers

For noisy or discontinuous landscapes, derivative-free methods can be more robust:

optimizer = ScipyOptimizer(method="Nelder-Mead", maxiter=2000)

optimizer = ScipyOptimizer(method="Powell", maxiter=1000)

optimizer = ScipyOptimizer(method="least_squares", maxiter=500)

Inspecting the result

# Fractional improvement (0 = no change, 1 = perfect)
print(f"Improvement: {result.improvement:.1%}")

# Optimised parameters are already applied to the ForceField
optimised_ff = objective.forcefield
for b in optimised_ff.bonds:
    print(f"  {b.elements}: k = {b.force_constant:.4f} mdyn/Å, "
          f"r₀ = {b.equilibrium:.4f} Å")

# Convergence history (score at each evaluation)
# Requires: pip install matplotlib
import matplotlib.pyplot as plt
plt.semilogy(result.history)
plt.xlabel("Evaluation")
plt.ylabel("Objective")
plt.title("Convergence")
plt.savefig("convergence.png")

Step 6b: Grad-Simp Cycling (Recommended for Large Systems)¶

For systems with more than ~10 parameters, a single optimizer often leaves residual error. The OptimizationLoop alternates between a gradient-based pass on all parameters and a simplex pass on the least gradient-suitable parameters, combining the strengths of both approaches.

Grad-simp cycling

from q2mm.optimizers.cycling import OptimizationLoop

loop = OptimizationLoop(
    objective,
    max_params=3,         # simplex on bottom 3 by simp_var per cycle
    max_cycles=10,        # up to 10 grad-simp cycles
    convergence=0.01,     # stop when <1% improvement per cycle
    full_method="L-BFGS-B",
    simp_method="Nelder-Mead",
    full_maxiter=200,
    simp_maxiter=200,
    verbose=True,
)

result = loop.run()
print(result.summary())

Each cycle:

Full-space gradient pass — L-BFGS-B on all N parameters
Sensitivity analysis — rank parameters by simplex suitability (lowest simp_var)
Subspace simplex — Nelder-Mead on only the 3 least gradient-suitable parameters
Convergence check — stop when improvement drops below threshold

When to use cycling vs single-shot

For ≤ 10 parameters, a single ScipyOptimizer call (Step 6) is usually sufficient. For larger systems — especially transition-state force fields with coupled parameters — the cycling loop typically produces better results. See the Optimization Guide for a detailed comparison.

Step 6c: Optax Optimizers (JAX only)¶

If you're using the JAX backend, you can use optax optimizers — Adam, AdaGrad, SGD, and AdamW — as an alternative to SciPy's L-BFGS-B. These are JAX-native adaptive optimizers that use analytical gradients automatically.

Why would you use optax? On rugged potential energy surfaces like MM3, Adam's per-parameter adaptive learning rates and momentum can dramatically outperform L-BFGS-B. On CH₃F with MM3, Adam achieves 56.3 cm⁻¹ RMSD — 10× better than L-BFGS-B's 579.0 (see Small Molecules).

Optax Adam optimization

from q2mm.optimizers.optax import OptaxOptimizer

optimizer = OptaxOptimizer(
    optimizer="adam",
    learning_rate=0.01,
    max_steps=2000,
)

result = optimizer.optimize(objective)
print(result.summary())

Expected output:

Method: optax:adam
Success: False — max_steps reached
Score: 192.000000 → 56.300000 (70.7% improvement)
Iterations: 2000, Evaluations: 2000

Learning rate schedules

Optax supports learning rate schedules that decay the LR during optimisation. Cosine annealing starts at your LR and decays smoothly to zero:

optimizer = OptaxOptimizer(
    optimizer="adam",
    learning_rate=0.01,
    max_steps=2000,
    schedule="cosine",
)

JAX backend required

OptaxOptimizer works best with the JAX backend because it uses analytical gradients via jax.grad. It will fall back to finite-difference gradients on other backends (OpenMM, Tinker), but the FD overhead negates the advantage of adaptive optimizers. Use ScipyOptimizer for non-JAX backends.

When to use optax vs SciPy vs cycling

Smooth landscape (harmonic form): Use ScipyOptimizer("L-BFGS-B") — curvature information matters here and L-BFGS-B excels.
Rugged landscape (MM3 form): Use OptaxOptimizer("adam") — momentum helps escape local minima.
Large systems (10+ params): Use OptimizationLoop (Step 6b) — the grad-simp cycling loop combines multiple strategies.
See the Optimization Guide for a full comparison with benchmark data.

Step 6d: JaxOpt Optimizers (JAX only — End-to-End Differentiable)¶

If you're using the JAX backend, you can also use JAXopt for end-to-end differentiable optimisation. Unlike optax (which differentiates through the loss only), JaxOpt compiles the entire objective — force field → MM engine → loss — into a single JIT-traced function. This enables true second-order methods like L-BFGS and L-BFGS-B with exact gradients.

JaxOpt L-BFGS-B optimization

from q2mm.optimizers.jaxopt_opt import JaxOptOptimizer

optimizer = JaxOptOptimizer(
    method="lbfgsb",   # or "lbfgs", "gradient_descent"
    maxiter=500,
)

result = optimizer.optimize(objective)
print(result.summary())

On CH₃F (harmonic), JaxOpt L-BFGS-B achieves 528 cm⁻¹ RMSD — matching SciPy L-BFGS-B's 529 cm⁻¹ while using exact gradients throughout.

JAX backend required

JaxOptOptimizer only works with JaxEngine. It converts the ObjectiveFunction into a frozen ObjectiveSpec and compiles the entire pipeline with jax.jit. Non-JAX backends are not supported.

When to use JaxOpt

JaxOpt is most useful when you want the same algorithms as SciPy (L-BFGS-B) but with exact analytical gradients instead of finite differences. The gradient quality is identical to optax, but the optimiser itself is second-order. See Workflow D in the Optimization Guide for details.

Step 7: Export the Optimised Force Field¶

Q2MM can write the optimised parameters to MM3 .fld, Tinker .prm, AMBER .frcmod, or OpenMM .xml format. For JAX and JAX-MD backends, save the parameter vector directly as a NumPy array. The ForceField object has convenience methods for all formats:

optimised_ff.to_mm3_fld("optimized_mm3.fld")
optimised_ff.to_tinker_prm("optimized.prm")
optimised_ff.to_amber_frcmod("optimized.frcmod")
optimised_ff.to_openmm_xml("forcefield.xml")
np.save("optimized_params.npy", optimised_ff.get_param_vector())

Expand each format below for details and template-based export options:

MM3 .fld (Schrödinger MacroModel)

from q2mm.io import save_mm3_fld

output_path = save_mm3_fld(
    optimised_ff,
    "optimized_mm3.fld",
    template_path="my-system.fld",         # preserves header / metadata
    substructure_name="SN2 TS Optimized",
)
print(f"Saved: {output_path}")

When you pass template_path, Q2MM reads the original .fld file, updates only the bond and angle parameters that were optimised, and writes everything else (headers, VdW parameters, comments) unchanged. This is essential for round-trip compatibility with MacroModel.

Tinker .prm

from q2mm.io import save_tinker_prm

save_tinker_prm(
    optimised_ff,
    "optimized.prm",
    template_path="template.prm",
)

AMBER .frcmod

from q2mm.io import save_amber_frcmod

save_amber_frcmod(
    optimised_ff,
    "optimized.frcmod",
    template_path="template.frcmod",   # preserves headers and unmodified sections
)

OpenMM .xml

from q2mm.io import save_openmm_xml

# Standalone ForceField XML (with AtomTypes and Residues)
save_openmm_xml(optimised_ff, "forcefield.xml", molecule=mol)

You can also serialize the exact OpenMM System object for archival:

from q2mm.backends.mm.openmm import OpenMMEngine

engine = OpenMMEngine()
engine.export_system_xml("system.xml", mol, optimised_ff)

System XML serializes the topology-specific System with all particles and forces. ForceField XML is a standalone definition loadable by openmm.app.ForceField() — more portable and applicable to different topologies.

JAX / JAX-MD (parameter vector)

JAX and JAX-MD backends work with the ForceField parameter vector directly — there's no separate file format. Save and reload with NumPy:

import numpy as np

np.save("optimized_params.npy", optimised_ff.get_param_vector())

To reload into a new session:

params = np.load("optimized_params.npy")
ff.set_param_vector(params)

Complete script¶

Here is the full pipeline in one script. The SN2 example files in examples/sn2-test/ contain pre-computed QM data so you can run the QFUERZA + analysis steps immediately.

"""Full TSFF pipeline — SN2 F⁻ + CH₃F transition state."""

import numpy as np
from pathlib import Path

from q2mm.models.molecule import Q2MMMolecule
from q2mm.models.forcefield import ForceField
from q2mm.models.seminario import estimate_force_constants
from q2mm.optimizers.objective import ObjectiveFunction, ReferenceData
from q2mm.optimizers.scipy_opt import ScipyOptimizer

QM_REF = Path("examples/sn2-test/qm-reference")

# ── Step 1: Load QM data ──────────────────────────────────────────
mol = Q2MMMolecule.from_xyz(
    QM_REF / "sn2-ts-optimized.xyz",
    charge=-1,
    name="SN2_TS",
    bond_tolerance=1.4,
)
hessian = np.load(str(QM_REF / "sn2-ts-hessian.npy"))
mol = mol.with_hessian(hessian)

print(f"Loaded: {mol.n_atoms} atoms, {len(mol.bonds)} bonds, "
      f"{len(mol.angles)} angles")

# ── Step 2: QFUERZA estimation ──────────────────────────────────
ff = estimate_force_constants(
    mol,
    zero_torsions=True,
    au_hessian=True,
    invalid_policy="skip",
)

print("\nQFUERZA estimates:")
for b in ff.bonds:
    print(f"  Bond {b.elements}: k={b.force_constant:.4f} mdyn/Å, "
          f"r₀={b.equilibrium:.4f} Å  {b.label}")
for a in ff.angles:
    print(f"  Angle {a.elements}: k={a.force_constant:.6f} mdyn·Å/rad², "
          f"θ₀={a.equilibrium:.1f}°  {a.label}")

# ── Step 3: Reference data ────────────────────────────────────────
ts_freqs = np.loadtxt(str(QM_REF / "sn2-ts-frequencies.txt"))

ref = ReferenceData.from_molecule(
    mol,
    frequencies=ts_freqs,
    skip_imaginary=True,
)
print(f"\nReference observations: {ref.n_observations}")

# ── Step 4: Optimise (requires an MM engine) ──────────────────────
from q2mm.backends.mm.openmm import OpenMMEngine

engine = OpenMMEngine()

objective = ObjectiveFunction(
    forcefield=ff,
    engine=engine,
    molecules=[mol],
    reference=ref,
)

optimizer = ScipyOptimizer(
    method="L-BFGS-B", maxiter=500, eps=1e-3
)
result = optimizer.optimize(objective)
print(result.summary())

# ── Step 5: Export ────────────────────────────────────────────────
ff.to_mm3_fld("sn2-ts-qfuerza.fld")

Next steps¶

Once you have completed this tutorial, consider:

Multiple conformers — add ground-state CH₃F alongside the TS to train a force field that reproduces both minima and the saddle point. Load qm-reference/ch3f-optimized.xyz and its Hessian as a second molecule.
Frequency matching — add QM vibrational frequencies to the reference data (Step 4) for a tighter fit of force constants.
Torsion scanning — for systems with soft torsions, run a QM torsion scan and add the energy profile to ReferenceData for proper barrier heights.
Custom weighting — experiment with different weights to balance geometry accuracy against energy/frequency reproduction.
Larger systems — the Rh-enamide example in examples/rh-enamide/ demonstrates TSFF fitting for a transition-metal catalysed reaction with significantly more parameters.
Alternative optimisers — try Nelder-Mead for noisy landscapes, or least_squares (Levenberg-Marquardt) when you have more observations than parameters.
Consult the API reference — see the API Reference for the complete interface of ForceField, Q2MMMolecule, ObjectiveFunction, and all I/O functions.