when should groupByKey API used in spark programming? -
groupbykey suffers shuffling data.and groupbykey functionality can achieved either using combinebykey or reducebykey.so when should api used ? there use case ?
according link below, groupbykey should avoided.
Comments
Post a Comment