More consistently use existing charge caching #1115

mattwthompson · 2024-11-21T17:14:27Z

Description

A few code paths didn't use the improvements of #1066 / #1069, which caused some horrible performance in some number of corner cases. (Writing out GROMACS files in the protein-ligand example took 10 minutes (!) compared to a few seconds for OpenMM.)

I'm more than a little confused as to how from_openmm code paths might have been hit in this example - which just uses SMIRNOFF force fields and Interchange.combine - but I could be misremembering which changes actually affected things.

Checklist

Add tests
Lint
Update docstrings

review-notebook-app · 2024-11-21T17:14:33Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

codecov · 2024-11-21T17:21:00Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 93.47%. Comparing base (6a515c0) to head (9ad8427).
Report is 5 commits behind head on main.

Additional details and impacted files

mattwthompson · 2024-11-21T17:24:12Z

Upstream:

============================= slowest 20 durations =============================
474.12s call     examples/protein_ligand/protein_ligand.ipynb::Cell 20
427.49s call     examples/protein_ligand/protein_ligand.ipynb::Cell 22
188.02s call     examples/protein_ligand/protein_ligand.ipynb::Cell 26
95.74s call     examples/ligand_in_water/ligand_in_water.ipynb::Cell 14
85.04s call     examples/ligand_in_water/ligand_in_water.ipynb::Cell 18
69.02s call     examples/packed_box/packed_box.ipynb::Cell 6
65.37s call     examples/ligand_in_water/ligand_in_water.ipynb::Cell 8
62.18s call     examples/protein_ligand/protein_ligand.ipynb::Cell 7
59.11s call     examples/ligand_in_water/ligand_in_water.ipynb::Cell 6
32.89s call     examples/lammps/lammps.ipynb::Cell 6
30.41s call     examples/amber/amber.ipynb::Cell 4
30.02s call     examples/host-guest/host_guest.ipynb::Cell 6
27.48s call     examples/protein_ligand/protein_ligand.ipynb::Cell 9
20.64s call     examples/openmm/openmm.ipynb::Cell 5
19.13s call     examples/protein_ligand/protein_ligand.ipynb::Cell 18
18.69s call     examples/openmm/openmm.ipynb::Cell 4
14.53s call     examples/packed_box/packed_box.ipynb::Cell 4
14.51s call     examples/ligand_in_water/ligand_in_water.ipynb::Cell 19
12.16s call     examples/openmm/openmm.ipynb::Cell 0
11.10s call     examples/amber/amber.ipynb::Cell 0
================= 119 passed, 3 skipped in 1247.79s (0:20:47) ==================

These changes:

============================= slowest 20 durations =============================
89.78s call     examples/ligand_in_water/ligand_in_water.ipynb::Cell 14
64.33s call     examples/ligand_in_water/ligand_in_water.ipynb::Cell 18
63.69s call     examples/ligand_in_water/ligand_in_water.ipynb::Cell 8
63.53s call     examples/protein_ligand/protein_ligand.ipynb::Cell 7
57.87s call     examples/ligand_in_water/ligand_in_water.ipynb::Cell 6
38.[25](https://github.com/openforcefield/openff-interchange/actions/runs/11958262648/job/33337380981?pr=1115#step:8:26)s call     examples/packed_box/packed_box.ipynb::Cell 6
33.39s call     examples/lammps/lammps.ipynb::Cell 6
30.02s call     examples/host-guest/host_guest.ipynb::Cell 6
29.95s call     examples/protein_ligand/protein_ligand.ipynb::Cell 22
28.00s call     examples/amber/amber.ipynb::Cell 4
[26](https://github.com/openforcefield/openff-interchange/actions/runs/11958262648/job/33337380981?pr=1115#step:8:27).46s call     examples/protein_ligand/protein_ligand.ipynb::Cell 9
25.93s call     examples/protein_ligand/protein_ligand.ipynb::Cell 26
21.18s call     examples/openmm/openmm.ipynb::Cell 5
19.35s call     examples/openmm/openmm.ipynb::Cell 4
17.60s call     examples/protein_ligand/protein_ligand.ipynb::Cell 18
15.10s call     examples/packed_box/packed_box.ipynb::Cell 4
11.91s call     examples/openmm/openmm.ipynb::Cell 0
11.38s call     examples/lammps/lammps.ipynb::Cell 0
11.18s call     examples/amber/amber.ipynb::Cell 0
9.00s call     examples/packed_box/packed_box.ipynb::Cell 2
================== 119 passed, 3 skipped in 304.15s (0:05:04) ==================

Yoshanuikabundi

Wow that's a big speedup!

Am I right in thinking that this has implications for modifying an ElectrostaticsCollection after the charges have been cached? If so, since we demonstrate modifying collections in the examples, we should probably add some sort of invalidate_cache() method and document this behavior both in the collection docstring and in the relevant examples. Modifying a collection according to the example and having that modification only sometimes reflected in outputs would be quite frustrating.

Might it also make sense to generalize this caching code to the Collection class - perhaps have a parameters: dict[TopologyKey | <etc>, Potential property there? I imagine most of the cost of _get_charges() is the repeated dictionary lookups, though maybe having to do an attribute lookup would just reproduce that. parameters: dict[TopologyKey | <etc>, tuple[int, ...] would be a nicer, maybe faster API, but it would mean each collection would have to somehow define an ordered set of parameters. I'm not sure if that would be worthwhile.

Other than the needed documentation and a few extremely minor nits, this looks excellent. The code seems much easier to read and with less indirection. Great work!

Yoshanuikabundi · 2025-01-14T00:54:03Z

openff/interchange/smirnoff/_gromacs.py

@@ -185,7 +181,7 @@ def _convert(
    _partial_charges: dict[int | VirtualSiteKey, float] = dict()

    # Indexed by particle (atom or virtual site) indices
-    for key, charge in interchange["Electrostatics"]._get_charges().items():
+    for key, charge in interchange["Electrostatics"].charges.items():


Why not leave the electrostatics_collection assignment in place above and then use it here...

I don't completely follow - the appeal here is to save those lookups from above and I'm not sure what the charge variable is doing above in that scope

Yoshanuikabundi · 2025-01-14T00:54:11Z

openff/interchange/smirnoff/_gromacs.py

@@ -585,7 +581,7 @@ def _convert_virtual_sites(
                residue_index=molecule.atoms[0].residue_index,
                residue_name=molecule.atoms[0].residue_name,
                charge_group_number=1,
-                charge=interchange["Electrostatics"]._get_charges()[virtual_site_key],
+                charge=interchange["Electrostatics"].charges[virtual_site_key],


... and here?

openff/interchange/smirnoff/_nonbonded.py

Co-authored-by: Josh A. Mitchell <yoshanuikabundi@gmail.com>

…harge-caching

mattwthompson · 2025-01-15T16:37:28Z

Thanks @Yoshanuikabundi!

FIX: More consistently use existing charge caching

396d472

mattwthompson force-pushed the use-charge-caching branch from 60660a0 to 396d472 Compare November 21, 2024 17:30

mattwthompson marked this pull request as ready for review November 21, 2024 18:46

mattwthompson requested a review from Yoshanuikabundi November 21, 2024 18:50

Yoshanuikabundi approved these changes Jan 14, 2025

View reviewed changes

j-wags assigned Yoshanuikabundi Jan 14, 2025

jameseastwood assigned mattwthompson and unassigned Yoshanuikabundi Jan 14, 2025

This was referenced Jan 15, 2025

Consider making ElectrostaticsCollection.invalidate_cache #1142

Open

Push parameter caching up to Collection? #1143

Open

mattwthompson and others added 3 commits January 15, 2025 08:18

Update openff/interchange/smirnoff/_nonbonded.py

9f03ef2

Co-authored-by: Josh A. Mitchell <yoshanuikabundi@gmail.com>

Merge remote-tracking branch 'upstream/main' into use-charge-caching

57a9668

Merge remote-tracking branch 'upstream/use-charge-caching' into use-c…

9ad8427

…harge-caching

mattwthompson merged commit 9460db5 into main Jan 15, 2025
23 checks passed

mattwthompson deleted the use-charge-caching branch January 15, 2025 16:37

mattwthompson added this to the 0.4.1 milestone Jan 15, 2025

mattwthompson mentioned this pull request Jan 17, 2025

Cache some charge increment calculations #1126

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More consistently use existing charge caching #1115

More consistently use existing charge caching #1115

mattwthompson commented Nov 21, 2024 •

edited

Loading

review-notebook-app bot commented Nov 21, 2024

codecov bot commented Nov 21, 2024 •

edited

Loading

mattwthompson commented Nov 21, 2024

Yoshanuikabundi left a comment

Yoshanuikabundi Jan 14, 2025

mattwthompson Jan 15, 2025

Yoshanuikabundi Jan 14, 2025

mattwthompson commented Jan 15, 2025

More consistently use existing charge caching #1115

More consistently use existing charge caching #1115

Conversation

mattwthompson commented Nov 21, 2024 • edited Loading

Description

Checklist

review-notebook-app bot commented Nov 21, 2024

codecov bot commented Nov 21, 2024 • edited Loading

Codecov Report

mattwthompson commented Nov 21, 2024

Yoshanuikabundi left a comment

Choose a reason for hiding this comment

Yoshanuikabundi Jan 14, 2025

Choose a reason for hiding this comment

mattwthompson Jan 15, 2025

Choose a reason for hiding this comment

Yoshanuikabundi Jan 14, 2025

Choose a reason for hiding this comment

mattwthompson commented Jan 15, 2025

mattwthompson commented Nov 21, 2024 •

edited

Loading

codecov bot commented Nov 21, 2024 •

edited

Loading