
Hadoop: DFS IO Write Performance
• DFS IO included in Hadoop, measures sequential access throughput
• We have two map tasks each writing to a file of increasing size (1-10GB)
• Significant improvement with IPoIB, SDP and 10GigE
• With SSD, performance improvement is almost seven or eight fold!
• SSD benefits not seen without using high-performance interconnect!
• In-line with comment on Google keynote about I/O performance
Four Data Nodes
Using HDD and SSD
Average Write Throughput (MB/sec)
CCGrid '11
143
Comentarios a estos manuales