Rowan Benchmarks

NNP Arena

evaluating NNP methods against coupled cluster & experimental data and other tests

Molecular Energy

Name
UMA Medium 1.1 (OMol task)3.750.210.140.881.32
OMol25's eSEN Conserving Small3.860.210.140.911.35
UMA Small 1.1 (OMol task)3.900.210.140.901.37
OrbMol (Orb-v3 Conservative OMol)4.130.220.150.941.45
B97-3c10.330.300.352.323.52
AIMNet2 (ωB97M-D3, new)14.460.540.392.354.90
AIMNet2-NSE16.370.550.413.035.53
MACE-Osaka24-large-D3BJ18.740.380.284.886.24
Prescient's StrainRelief MACE19.890.570.614.466.75
GFN2-xTB18.940.720.7314.606.91
Orb-v3 (Conservative Inf. OMat)21.440.880.977.707.56
eSEN-OAM22.500.840.697.057.78
MACE-MP-0b2(Large)-D3BJ27.140.811.1214.609.61
Overall scores are a weighted average of individual benchmark results.
The GMTKN55 WTMAD-2 shown here is filtered to only include exclusively neutral, singlet, and elemental-organic subsets.

Molecular Optimization

Name
AIMNet2 (ωB97M-D3, new)2512.88210.16
AIMNet2-NSE24100.88140.67
eSEN-OAM24109.46190.21
OMol25's eSEN Conserving Small24100.88180.25
UMA Small 1.1 (OMol task)24101.21170.38
OrbMol (Orb-v3 Conservative OMol)1488.7190.50
Optimization is a simple test that evaluates a method's ability to optimize 25 drug-like molecules without producing imaginary frequencies. Optimizations are run using the sella optimizer, 0.01 eV/Å fmax, and max. 250 steps. See all optimization results. See our blog post testing NNP–optimizer pairings.

Periodic Optimization

Name
r²SCAN-3c0.973.681.71
UMA Small 1.1 (OMol task)1.432.701.78
UMA Medium 1.1 (OMol task)2.233.022.45
UMA Small 1.1 (OMC task)1.574.792.46
Egret-12.614.383.09
MACE-MP-0b2(Large)-D3BJ3.473.033.35
GFN2-xTB5.387.766.03
MACE-Osaka24-large-D3BJ7.563.486.44
UMA Medium 1.1 (OMC task)9.218.789.09
AIMNet2 (ωB97M-D3, new)9.3219.4112.10
eSEN-OAM9.1632.3515.54
Orb-v3 (Conservative Inf. OMat)26.7912.3022.80
Overall scores are a weighted average of individual benchmark results.

Speed

Name
AIMNet2 (ωB97M-D3, new)0.024.67
AIMNet2-NSE0.024.52
OrbMol (Orb-v3 Conservative OMol)0.032.51
OMol25's eSEN Conserving Small0.100.85
UMA Small 1.1 (OMol task)0.120.71
MACE-MP-0b2(Large)-D3BJ0.130.68
eSEN-OAM0.410.21
UMA Medium 1.1 (OMol task)0.550.16

This benchmark measures the speed of running molecular dynamics (MD) simulations on tacrolimus (126 atoms) through ASE with a 1 fs timestep at 300 K for 50 steps. All calculations were run on A10G GPUs through Modal. See all speed results.

View Results by Benchmark

View Results by Method

Rowan Benchmarks