Site Reliability Engineer (W/M) (BB-84DE4)

Gefunden in: Neuvoo CH

Your mission :
The aim of the EPFL Blue Brain Project (BBP), a Swiss brain research initiative founded and directed by Professor Henry Markram, is to establish simulation neuroscience as a complementary approach alongside experimental, theoretical and clinical neuroscience to understanding the brain, by building the world’s first biologically detailed digital reconstructions and simulations of the mouse brain.
We are now looking for a Site Reliability Engineer to work on our mission-critical IT systems. You would have opportunities to challenge yourself by: Main duties and responsibilities include :
  • Ensuring reliable product launches and successful periodic upgrades upon our 1200+ node HPC cluster, on-premises cloud and container platforms, large-scale parallel file system, NAS and other IT platforms with the help of modern software development, configuration management, CI/CD and infrastructure-as-code approaches
  • Improving IT service reliability by implementing SRE best practices for availability, performance, emergency response and capacity planning
  • Developing monitoring, logging and metrics tools to embrace and minimize risks
  • Automating IT processes - in order to get rid of toil, technical debt and manual work - using modern software engineering practices
  • Contributing to IT security e.g. by establishing industry best practices with regards to periodic patching and other, proactive IT security measures
  • Your profile :
    We expect you to have experience in the following areas:
  • Linux (e.g. RedHat/CentOS, Ubuntu) in production server environments
  • Physical server hardware / data center infrastructure
  • Virtualized and containerized infrastructure
  • Network concepts (e.g. IP routing, DNS, VLANs)
  • Configuration & provisioning tools (e.g. Puppet)
  • Programming and scripting (e.g. Python, bash)
  • We count as advantage your possible experience with:
  • Operating Linux-based hardware infrastructure
  • Operating virtualization, cloud and container platforms (e.g. Kubernetes, OpenStack)
  • Operating storage systems (e.g. NetApp, Spectrum Scale), filesystems, data archiving
  • Operating data centre networks built on Ethernet or InfiniBand
  • Operating HPC systems and software (e.g. Slurm, cluster managers)
  • Implementing & monitoring secure IT infrastructure
  • Leading IT projects / team leadership
  • Our desired candidate would have:
  • Bachelor or Master degree in computer science - or similar degree or working experience
  • Detail-oriented, cautious & professional working practices and attitude
  • Interest to improve IT processes (e.g. access control, security)
  • Experience managing and completing IT projects
  • Interest to work in a collaborative and multi-cultural environment
  • Proven ability to work both independently and in team-based environments
  • Fluent communication in English (written and spoken)
  • We offer :
  • A world-recognized leader in simulation-based research in neuroscience
  • A dynamic, multidisciplinary, international and collaborative working environment committed to benefitting the global community
  • A modern working environment, based at the Biotech Campus in Geneva Sécheron
  • calendar_todayvor 2 Tagen


    info Full time

    location_on Geneva, Schweiz

    work Epfl

    Ich ermächtige ausdrücklich die Bedingungen und Konditionen

    Ähnliche Jobs