Our parallel execution is made by a simple one-dimensional domain decomposition for a-axis of the unit cell, while other parameters are also used for the parallelization in each subroutine case by case. Therefore, it is better to choose the a-axis as the longest axis for a good road balancing, although, of course, the parallel execution is made for any unit cell.