Accession Number : ADA561775
Title : Understanding TCP Incast and Its Implications for Big Data Workloads
Descriptive Note : Technical rept.
Corporate Author : CALIFORNIA UNIV BERKELEY DEPT OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCE
Personal Author(s) : Chen, Yanpei ; Griffit, Rean ; Zats, David ; Katz, Randy H
PDF Url : ADA561775
Report Date : 06 Apr 2012
Pagination or Media Count : 11
Abstract : TCP incast is a recently identified network transport pathology that affects many-to-one communication patterns in datacenters. It is caused by a complex interplay between datacenter applications, the underlying switches, network topology, and TCP, which was originally designed for wide area networks. Incast increases the queuing delay of flows, and decreases application level throughput to far below the link bandwidth. The problem especially affects computing paradigms in which distributed processing cannot progress until all parallel threads in a stage complete. Examples of such paradigms include distributed file systems, web search, advertisement selection, and other applications with partition or aggregation semantics [5, 18, 25].
Descriptors : *COMMUNICATIONS NETWORKS, BANDWIDTH, DISTRIBUTED DATA PROCESSING, QUEUEING THEORY, WORKLOAD
Subject Categories : Non-radio Communications
Distribution Statement : APPROVED FOR PUBLIC RELEASE