Accession Number : ADA561775

Title :   Understanding TCP Incast and Its Implications for Big Data Workloads

Descriptive Note : Technical rept.

Corporate Author : CALIFORNIA UNIV BERKELEY DEPT OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCE

Personal Author(s) : Chen, Yanpei ; Griffit, Rean ; Zats, David ; Katz, Randy H

PDF Url : ADA561775

Report Date : 06 Apr 2012

Pagination or Media Count : 11

Abstract : TCP incast is a recently identified network transport pathology that affects many-to-one communication patterns in datacenters. It is caused by a complex interplay between datacenter applications, the underlying switches, network topology, and TCP, which was originally designed for wide area networks. Incast increases the queuing delay of flows, and decreases application level throughput to far below the link bandwidth. The problem especially affects computing paradigms in which distributed processing cannot progress until all parallel threads in a stage complete. Examples of such paradigms include distributed file systems, web search, advertisement selection, and other applications with partition or aggregation semantics [5, 18, 25].

Descriptors :   *COMMUNICATIONS NETWORKS, BANDWIDTH, DISTRIBUTED DATA PROCESSING, QUEUEING THEORY, WORKLOAD

Subject Categories : Non-radio Communications

Distribution Statement : APPROVED FOR PUBLIC RELEASE