getCommonChunk() was fast but incomplete after #238
it complete but slow after #242 and #255
Current code involves loop over all variables and group values (by= in data table), which could be moved to C++ for big constant factor speedups (because intermediate allocations could be avoided).