Are there any rules of thumb to optimize the number of CPUs to perform a LAMMPS simulation of a given size efficiently?
For example, there are 32 cores under each node in my computational resource. How do I choose the proper number of nodes and the proper number of cores for the LAMMPS simulation? Does it depend on the number of system atoms in the simulation or on something else?
Any suggestions appreciated.