Evaluating Methods of Transferring Large Datasets

Full item record

dc.contributor.authorKopeć, Jakub
dc.contributor.organizationInterdisciplinary Centre for Mathematical and Computational Modelling, University of Warsawen
dc.date.accessioned2022-11-22T11:52:45Z
dc.date.available2022-11-22T11:52:45Z
dc.date.issued2022
dc.description.abstractOur society critically depends on data, Big Data. The humanity generates and moves data volumes larger than ever before and their increase is continuously accelerating. The goal of this research is to evaluate tools used for the transfer of large volumes of data. Bulk data transfer is a complex endeavour that requires not only sufficient network infrastructure, but also appropriate software, computing power and storage resources. We report on the series of storage benchmarks conducted using recently developed elbencho tool. The tests were conducted with an objective to understand and avoid I/O bottlenecks during data transfer operation. Subsequently Ethernet and InfiniBand networks performance was compared using Ohio State University bandwidth benchmark (OSU BW) and iperf3 tool. For comparison we also tested traditional (very inefficient) Linux scp and rsync commands as well as tools designed specifically to transfer large datasets more efficiently: bbcp and MDTMFTP. Additionally the impact of using simultaneous multi-threading and Ethernet jumbo frames on transfer rate was evaluated.en
dc.identifier.citationKopeć, J. (2022). Evaluating Methods of Transferring Large Datasets. In: Panda, D.K., Sullivan, M. (eds) Supercomputing Frontiers. SCFA 2022. Lecture Notes in Computer Science, vol 13214. Springer, Cham. https://doi.org/10.1007/978-3-031-10419-0_7en
dc.identifier.doi10.1007/978-3-031-10419-0_7
dc.identifier.isbn978-3-031-10418-3
dc.identifier.isbn978-3-031-10419-0
dc.identifier.urihttps://open.icm.edu.pl/handle/123456789/21888
dc.language.isoen
dc.publisherSpringer Natureen
dc.rightsUznanie autorstwa 4.0 Międzynarodowe*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/*
dc.subjectI/Oen
dc.subjectfile systemsen
dc.subjectdata managementen
dc.subjectdata transferen
dc.subjectfile transfer protocolsen
dc.subjectnetwork evaluationen
dc.subjectI/O benchmarkingen
dc.subjectElbenchoen
dc.titleEvaluating Methods of Transferring Large Datasetsen
dc.typebookParten
Files for this record
Original bundle
Now showing 1 - 1 of 1
Name: 978-3-031-10419-0_7.pdf
Size: 447.24 KB
Format: Adobe Portable Document Format
Description:
License files
Name: license.txt
Size: 236 B
Format: Plain Text
Description:
Name: license_rdf
Size: 913 B
Format: RDF serialized in XML
Description: