Rowan Benchmarks

NNP Arena

evaluating NNP methods against coupled cluster & experimental data and other tests

Molecular Energy

Name
UMA Small 1.1 (OMol task)4.380.210.140.901.52
OMol25's eSEN Conserving Small4.400.210.140.911.52
UMA Medium 1.1 (OMol task)4.650.210.140.881.60
OrbMol (Orb-v3 Conservative OMol)4.660.220.150.941.61
B97-3c10.160.300.352.323.46
AIMNet2 (ωB97M-D3, new)14.660.540.392.354.96
AIMNet2-NSE17.320.550.413.035.83
Prescient's StrainRelief MACE19.550.570.614.466.64
GFN2-xTB18.650.720.7314.606.82
Orb-v3 (Conservative Inf. OMat)21.330.880.977.707.53
eSEN-OAM22.160.840.697.057.67
MACE-MP-0b2(Large)-D3BJ26.760.811.1214.609.49
Overall scores are a weighted average of individual benchmark results.
The GMTKN55 WTMAD-2 shown here is filtered to only include exclusively neutral, singlet, and elemental-organic subsets.

Molecular Optimization

Name
AIMNet2 (ωB97M-D3, new)2512.88210.16
AIMNet2-NSE24100.88140.67
eSEN-OAM24109.46190.21
OMol25's eSEN Conserving Small24100.88180.25
UMA Small 1.1 (OMol task)24101.21170.38
OrbMol (Orb-v3 Conservative OMol)1488.7190.50
Optimization is a simple test that evaluates a method's ability to optimize 25 drug-like molecules without producing imaginary frequencies. Optimizations are run using the sella optimizer, 0.01 eV/Å fmax, and max. 250 steps. See all optimization results. See our blog post testing NNP–optimizer pairings.

Periodic Optimization

Name
r²SCAN-3c0.973.681.71
UMA Small 1.1 (OMol task)1.432.701.78
UMA Medium 1.1 (OMol task)2.233.022.45
UMA Small 1.1 (OMC task)1.574.792.46
Egret-12.614.383.09
MACE-MP-0b2(Large)-D3BJ3.473.033.35
GFN2-xTB5.387.766.03
UMA Medium 1.1 (OMC task)9.218.789.09
AIMNet2 (ωB97M-D3, new)9.3219.4112.10
eSEN-OAM9.1632.3515.54
Orb-v3 (Conservative Inf. OMat)26.7912.3022.80
Overall scores are a weighted average of individual benchmark results.

Speed

Name
AIMNet2 (ωB97M-D3, new)0.024.67
AIMNet2-NSE0.024.52
OrbMol (Orb-v3 Conservative OMol)0.032.51
OMol25's eSEN Conserving Small0.100.85
UMA Small 1.1 (OMol task)0.120.71
MACE-MP-0b2(Large)-D3BJ0.130.68
eSEN-OAM0.410.21
UMA Medium 1.1 (OMol task)0.550.16

This benchmark measures the speed of running molecular dynamics (MD) simulations on tacrolimus (126 atoms) through ASE with a 1 fs timestep at 300 K for 50 steps. All calculations were run on A10G GPUs through Modal. See all speed results.

View Results by Benchmark

View Results by Method