Ghost Molecules? A Lennard-Jones Stress Test for Neural Potentials

  • Home
  • /Ghost Molecules? A Lennard-Jones Stress Test for Neural Potentials

image

Ghost Molecules? A Lennard-Jones Stress Test for Neural Potentials

07 Jan 2026

If a learned potential makes short-range repulsion too cheap, an optimizer can accept overlapping atoms. That is the "ghost molecule" failure: not a rendering glitch, but bad extrapolation near the steepest part of the energy curve. This post covers a deliberately small test: a one-dimensional Lennard-Jones dimer. It is not a drug-discovery benchmark, a quantum chemistry result, or a direct test of the Pauli exclusion principle. The goal is narrower: check whether a model that looks good on the observed shoulder still respects the repulsive core.

The ground truth is analytic:

V(r)=4ϵ[(σr)12(σr)6]V(r)=4\epsilon\left[\left(\frac{\sigma}{r}\right)^{12}-\left(\frac{\sigma}{r}\right)^6\right]

with derivative labels:

dVdr=4ϵ[12σ12r13+6σ6r7]\frac{dV}{dr}=4\epsilon\left[-12\sigma^{12}r^{-13}+6\sigma^6r^{-7}\right]

The "Soft Wall" Problem

Use the reciprocal coordinate u=1/ru=1/r. As atoms approach, the leading repulsive term grows like u12u^{12}. A generic MLP trained on a finite interval has no built-in reason to continue that growth outside the samples. It can fit the attractive well and the first part of the shoulder, then extend into the core with a wall that is smooth, finite, and too soft.

This is not a universal-approximation complaint. On a covered training interval, a large enough model with enough data can fit the curve. The failure mode starts when optimization walks past the edge of the training distribution.

What Changed

The stable ZeroProofML variant keeps a deep backbone, but uses an improper rational output head. In this experiment, ZeroProofML provides the rational-head inductive bias and stable parameterization; the Lennard-Jones oracle supplies the target energies and derivatives.

f(x)=P(x)Q(x),degP>degQf(x)=\frac{P(x)}{Q(x)}, \qquad \deg P > \deg Q

For the LJ preset, degP=3\deg P=3 and degQ=1\deg Q=1. The denominator is parameterized to stay positive, and the leading numerator coefficient is positive. The feature is scaled reciprocal distance:

x=uumax,umax=10.95x=\frac{u}{u_{\max}}, \qquad u_{\max}=\frac{1}{0.95}

This does not hard-code Lennard-Jones, and it does not guarantee the true u12u^{12} asymptote. It gives the dangerous direction a better default: positive super-linear growth instead of a saturating tail or weak ramp.

The Test

The stress test used the following setup:

Item Setting
Seeds 10
LJ parameters ϵ=20, σ=1\epsilon=20,\ \sigma=1
Training range r0.95σr \ge 0.95\sigma
Core test range 0.2σr0.6σ0.2\sigma \le r \le 0.6\sigma
Training signal energy + log(1+dV/dr)\log(1+\lvert dV/dr\rvert) derivative match

The core metric is log10_energy_mae_core on samples with r<0.5σr<0.5\sigma:

1nilog10(V^i+1)log10(Vi+1)\frac{1}{n}\sum_i \left|\log_{10}(\lvert \hat V_i\rvert+1)-\log_{10}(\lvert V_i\rvert+1)\right|
Model Core log-error
ZeroProofML stable improper head 1.332 ± 0.399
Deep MLP baseline 4.842 ± 0.0505

Takeaway

Interpolation scores can hide weak extrapolation near singular or near-singular limits. For this LJ dimer benchmark, a positive improper rational head gives short-range wall extrapolation a better shape than a smooth MLP baseline. Broader molecular-simulation claims still need separate validation: multi-atom systems, force-field datasets, integration stability, conservation behavior, and thermodynamic tests.

Join our newsletter!

Enter your email to receive our latest newsletter.

Don't worry, we don't spam

Related Articles

blog cover image
04 May 2026

ZeroProofML 0.5.1: Strict Inference You Can Audit

ZeroProofML 0.5.1 makes strict rational inference inspectable across export, validation, and deployment handoff.

blog cover image
07 Jan 2026

Ghost Molecules? A Lennard-Jones Stress Test for Neural Potentials

A controlled Lennard-Jones dimer test shows where smooth neural potentials can look accurate yet extrapolate too softly in the repulsive core.

blog cover image
08 Sep 2025

Why Your Neural Network Can't Learn 1/x (And What We Did About It)

Why smooth activations create "Soft Walls" near poles, and how Signed Common Meadows (SCM) fix it for robotics, pharma, and electronics.

Ghost Molecules? A Lennard-Jones Stress Test for Neural Potentials | ZeroProofML | ZeroProofML