when should groupByKey API used in spark programming? -


groupbykey suffers shuffling data.and groupbykey functionality can achieved either using combinebykey or reducebykey.so when should api used ? there use case ?

according link below, groupbykey should avoided.

avoid groupbykey


Comments

Popular posts from this blog

c# - Validate object ID from GET to POST -

node.js - Custom Model Validator SailsJS -

php - Find a regex to take part of Email -