Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Benchmarking Streaming Computation Engines at Yahoo (yahooeng.tumblr.com)
41 points by YAFZ on Dec 18, 2015 | hide | past | favorite | 3 comments


If I'm reading this correctly, the Kafka topic only had 5 partitions, but they had 10 workers.

With the Spark direct stream, kafka partitions are 1:1 with spark partitions, which means at most half of the workers would be doing work without a shuffle.

Seems like a pretty basic oversight that should be addressed.


This is the first mention I've seen of flink on HN.


there were a few before but not many.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: