Workflow

map

flatmap

mapPartitions: do something with partition value

mapPartitionsWithIndex: with index

collect gives you a list

union(two rdd to one rdd) subtract intersection

glom: value -> list(value)

foreachPartition: do thing to partition, each partition use f.

fold give initial, and function, just like reduce

aggregate initial seq operation, comb operation

max