Senior Site Reliability Engineer

Careers at Bloomberg

New York

Posted Aug 11, 2016 - Requisition No. 53372

As an SRE on the Middleware team, you will help drive the next generation of our Enterprise Messaging platform. Our team owns Bloomberg’s entire messaging infrastructure and is responsible for keeping it up and running, ensuring connectivity for both external clients and internal systems. We architect scalable and secure enterprise messaging systems, leveraging industry standards like IBM MQ as well as Kafka and RabbitMQ. We also develop automated tools to provision new servers, reallocate existing ones, track performance issues and look at long-term trends to correct issues.

We'll trust you to:- Drive efficiencies in systems: design, build and deliver automation tools that improve the availability, scalability, latency and efficiency

  • Drive efficiencies in process: implement and enforce process for change management, emergency response and capacity planning
  • Solve problems relating to mission-critical services and build automation to prevent problem recurrence, with the goal of automating response to all non-exceptional service conditions
  • Participate in an on-call rotation and be available for escalations

You'll need to have:- 2+ years of experience working in an SRE role

  • A strong system and software engineering background
  • A solid understanding of system availability, latency and performance
  • Experience with enterprise messaging systems like MQ, Kafta and RabbitMQ running on Linux and Solaris
  • Knowledge of large-scale distributed systems in practice, including multi-tier architectures, application security, monitoring and storage systems
  • Strong programming skills in Java, Python or C++ and the ability to learn new languages as needed

We'd love to see: - Prior experience with Agile methodologies like Scrum and Kanban- A good understanding of PaaS

Similar jobs