CERN Tape Archive: a distributed, reliable and scalable scheduling system

The CERN Tape Archive (CTA) provides a tape backend to disk systems and, in conjunction with EOS, is managing the data of the LHC experiments at CERN. Magnetic tape storage offers the lowest cost per unit volume today, followed by hard disks and flash. In addition, current tape drives deliver a soli...

Full description

Bibliographic Details
Main Authors: Cano Eric, Bahyl Vladimír, Caffy Cédric, Cancio Germán, Davis Michael, Keeble Oliver, Kotlyar Viktor, Leduc Julien, Murray Steven
Format: Article
Language:English
Published: EDP Sciences 2021-01-01
Series:EPJ Web of Conferences
Online Access:https://www.epj-conferences.org/articles/epjconf/pdf/2021/05/epjconf_chep2021_02037.pdf
id doaj-0ff0f7ef36fc452b9a70d0bd94ecad6c
record_format Article
spelling doaj-0ff0f7ef36fc452b9a70d0bd94ecad6c2021-08-26T09:27:32ZengEDP SciencesEPJ Web of Conferences2100-014X2021-01-012510203710.1051/epjconf/202125102037epjconf_chep2021_02037CERN Tape Archive: a distributed, reliable and scalable scheduling systemCano Eric0Bahyl Vladimír1Caffy Cédric2Cancio Germán3Davis Michael4Keeble Oliver5Kotlyar Viktor6Leduc Julien7Murray Steven8CERN—European Organization for Nuclear ResearchCERN—European Organization for Nuclear ResearchCERN—European Organization for Nuclear ResearchCERN—European Organization for Nuclear ResearchCERN—European Organization for Nuclear ResearchCERN—European Organization for Nuclear ResearchInstitute for High Energy Physics named by A.A. Logunov of National Research Center “Kurchatov Institute”CERN—European Organization for Nuclear ResearchCERN—European Organization for Nuclear ResearchThe CERN Tape Archive (CTA) provides a tape backend to disk systems and, in conjunction with EOS, is managing the data of the LHC experiments at CERN. Magnetic tape storage offers the lowest cost per unit volume today, followed by hard disks and flash. In addition, current tape drives deliver a solid bandwidth (typically 360MB/s per device), but at the cost of high latencies, both for mounting a tape in the drive and for positioning when accessing non-adjacent files. As a consequence, the transfer scheduler should queue transfer requests before the volume warranting a tape mount is reached. In spite of these transfer latencies, user-interactive operations should have a low latency. The scheduling system for CTA was built from the experience gained with CASTOR. Its implementation ensures reliability and predictable performance, while simplifying development and deployment. As CTA is expected to be used for a long time, lock-in to vendors or technologies was minimized. Finally, quality assurance systems were put in place to validate reliability and performance while allowing fast and safe development turnaround.https://www.epj-conferences.org/articles/epjconf/pdf/2021/05/epjconf_chep2021_02037.pdf
collection DOAJ
language English
format Article
sources DOAJ
author Cano Eric
Bahyl Vladimír
Caffy Cédric
Cancio Germán
Davis Michael
Keeble Oliver
Kotlyar Viktor
Leduc Julien
Murray Steven
spellingShingle Cano Eric
Bahyl Vladimír
Caffy Cédric
Cancio Germán
Davis Michael
Keeble Oliver
Kotlyar Viktor
Leduc Julien
Murray Steven
CERN Tape Archive: a distributed, reliable and scalable scheduling system
EPJ Web of Conferences
author_facet Cano Eric
Bahyl Vladimír
Caffy Cédric
Cancio Germán
Davis Michael
Keeble Oliver
Kotlyar Viktor
Leduc Julien
Murray Steven
author_sort Cano Eric
title CERN Tape Archive: a distributed, reliable and scalable scheduling system
title_short CERN Tape Archive: a distributed, reliable and scalable scheduling system
title_full CERN Tape Archive: a distributed, reliable and scalable scheduling system
title_fullStr CERN Tape Archive: a distributed, reliable and scalable scheduling system
title_full_unstemmed CERN Tape Archive: a distributed, reliable and scalable scheduling system
title_sort cern tape archive: a distributed, reliable and scalable scheduling system
publisher EDP Sciences
series EPJ Web of Conferences
issn 2100-014X
publishDate 2021-01-01
description The CERN Tape Archive (CTA) provides a tape backend to disk systems and, in conjunction with EOS, is managing the data of the LHC experiments at CERN. Magnetic tape storage offers the lowest cost per unit volume today, followed by hard disks and flash. In addition, current tape drives deliver a solid bandwidth (typically 360MB/s per device), but at the cost of high latencies, both for mounting a tape in the drive and for positioning when accessing non-adjacent files. As a consequence, the transfer scheduler should queue transfer requests before the volume warranting a tape mount is reached. In spite of these transfer latencies, user-interactive operations should have a low latency. The scheduling system for CTA was built from the experience gained with CASTOR. Its implementation ensures reliability and predictable performance, while simplifying development and deployment. As CTA is expected to be used for a long time, lock-in to vendors or technologies was minimized. Finally, quality assurance systems were put in place to validate reliability and performance while allowing fast and safe development turnaround.
url https://www.epj-conferences.org/articles/epjconf/pdf/2021/05/epjconf_chep2021_02037.pdf
work_keys_str_mv AT canoeric cerntapearchiveadistributedreliableandscalableschedulingsystem
AT bahylvladimir cerntapearchiveadistributedreliableandscalableschedulingsystem
AT caffycedric cerntapearchiveadistributedreliableandscalableschedulingsystem
AT canciogerman cerntapearchiveadistributedreliableandscalableschedulingsystem
AT davismichael cerntapearchiveadistributedreliableandscalableschedulingsystem
AT keebleoliver cerntapearchiveadistributedreliableandscalableschedulingsystem
AT kotlyarviktor cerntapearchiveadistributedreliableandscalableschedulingsystem
AT leducjulien cerntapearchiveadistributedreliableandscalableschedulingsystem
AT murraysteven cerntapearchiveadistributedreliableandscalableschedulingsystem
_version_ 1721195758694170624