CERN Tape Archive: a distributed, reliable and scalable scheduling system
The CERN Tape Archive (CTA) provides a tape backend to disk systems and, in conjunction with EOS, is managing the data of the LHC experiments at CERN. Magnetic tape storage offers the lowest cost per unit volume today, followed by hard disks and flash. In addition, current tape drives deliver a soli...
Main Authors: | , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
EDP Sciences
2021-01-01
|
Series: | EPJ Web of Conferences |
Online Access: | https://www.epj-conferences.org/articles/epjconf/pdf/2021/05/epjconf_chep2021_02037.pdf |
id |
doaj-0ff0f7ef36fc452b9a70d0bd94ecad6c |
---|---|
record_format |
Article |
spelling |
doaj-0ff0f7ef36fc452b9a70d0bd94ecad6c2021-08-26T09:27:32ZengEDP SciencesEPJ Web of Conferences2100-014X2021-01-012510203710.1051/epjconf/202125102037epjconf_chep2021_02037CERN Tape Archive: a distributed, reliable and scalable scheduling systemCano Eric0Bahyl Vladimír1Caffy Cédric2Cancio Germán3Davis Michael4Keeble Oliver5Kotlyar Viktor6Leduc Julien7Murray Steven8CERN—European Organization for Nuclear ResearchCERN—European Organization for Nuclear ResearchCERN—European Organization for Nuclear ResearchCERN—European Organization for Nuclear ResearchCERN—European Organization for Nuclear ResearchCERN—European Organization for Nuclear ResearchInstitute for High Energy Physics named by A.A. Logunov of National Research Center “Kurchatov Institute”CERN—European Organization for Nuclear ResearchCERN—European Organization for Nuclear ResearchThe CERN Tape Archive (CTA) provides a tape backend to disk systems and, in conjunction with EOS, is managing the data of the LHC experiments at CERN. Magnetic tape storage offers the lowest cost per unit volume today, followed by hard disks and flash. In addition, current tape drives deliver a solid bandwidth (typically 360MB/s per device), but at the cost of high latencies, both for mounting a tape in the drive and for positioning when accessing non-adjacent files. As a consequence, the transfer scheduler should queue transfer requests before the volume warranting a tape mount is reached. In spite of these transfer latencies, user-interactive operations should have a low latency. The scheduling system for CTA was built from the experience gained with CASTOR. Its implementation ensures reliability and predictable performance, while simplifying development and deployment. As CTA is expected to be used for a long time, lock-in to vendors or technologies was minimized. Finally, quality assurance systems were put in place to validate reliability and performance while allowing fast and safe development turnaround.https://www.epj-conferences.org/articles/epjconf/pdf/2021/05/epjconf_chep2021_02037.pdf |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Cano Eric Bahyl Vladimír Caffy Cédric Cancio Germán Davis Michael Keeble Oliver Kotlyar Viktor Leduc Julien Murray Steven |
spellingShingle |
Cano Eric Bahyl Vladimír Caffy Cédric Cancio Germán Davis Michael Keeble Oliver Kotlyar Viktor Leduc Julien Murray Steven CERN Tape Archive: a distributed, reliable and scalable scheduling system EPJ Web of Conferences |
author_facet |
Cano Eric Bahyl Vladimír Caffy Cédric Cancio Germán Davis Michael Keeble Oliver Kotlyar Viktor Leduc Julien Murray Steven |
author_sort |
Cano Eric |
title |
CERN Tape Archive: a distributed, reliable and scalable scheduling system |
title_short |
CERN Tape Archive: a distributed, reliable and scalable scheduling system |
title_full |
CERN Tape Archive: a distributed, reliable and scalable scheduling system |
title_fullStr |
CERN Tape Archive: a distributed, reliable and scalable scheduling system |
title_full_unstemmed |
CERN Tape Archive: a distributed, reliable and scalable scheduling system |
title_sort |
cern tape archive: a distributed, reliable and scalable scheduling system |
publisher |
EDP Sciences |
series |
EPJ Web of Conferences |
issn |
2100-014X |
publishDate |
2021-01-01 |
description |
The CERN Tape Archive (CTA) provides a tape backend to disk systems and, in conjunction with EOS, is managing the data of the LHC experiments at CERN.
Magnetic tape storage offers the lowest cost per unit volume today, followed by hard disks and flash. In addition, current tape drives deliver a solid bandwidth (typically 360MB/s per device), but at the cost of high latencies, both for mounting a tape in the drive and for positioning when accessing non-adjacent files. As a consequence, the transfer scheduler should queue transfer requests before the volume warranting a tape mount is reached. In spite of these transfer latencies, user-interactive operations should have a low latency.
The scheduling system for CTA was built from the experience gained with CASTOR. Its implementation ensures reliability and predictable performance, while simplifying development and deployment. As CTA is expected to be used for a long time, lock-in to vendors or technologies was minimized.
Finally, quality assurance systems were put in place to validate reliability and performance while allowing fast and safe development turnaround. |
url |
https://www.epj-conferences.org/articles/epjconf/pdf/2021/05/epjconf_chep2021_02037.pdf |
work_keys_str_mv |
AT canoeric cerntapearchiveadistributedreliableandscalableschedulingsystem AT bahylvladimir cerntapearchiveadistributedreliableandscalableschedulingsystem AT caffycedric cerntapearchiveadistributedreliableandscalableschedulingsystem AT canciogerman cerntapearchiveadistributedreliableandscalableschedulingsystem AT davismichael cerntapearchiveadistributedreliableandscalableschedulingsystem AT keebleoliver cerntapearchiveadistributedreliableandscalableschedulingsystem AT kotlyarviktor cerntapearchiveadistributedreliableandscalableschedulingsystem AT leducjulien cerntapearchiveadistributedreliableandscalableschedulingsystem AT murraysteven cerntapearchiveadistributedreliableandscalableschedulingsystem |
_version_ |
1721195758694170624 |