At the moment, if there are many orders in a batch absolute tolerances in e.g. the cost coverage test are easily reached. This is, however, not an indication of misbehavior.
We should think about scaling the absolute tolerance up in case of multiple orders in a batch. Linear scaling might be the right thing to do: if there are n orders in a batch then the tolerance for an alert should be n times the absolute tolerance for a single order batch.