Systems Reliability Engineer - Data License

Careers at Bloomberg

New York

Posted Aug 23, 2016 - Requisition No. 53688

You are an engineer who is passionate about solving hard problems. You love working in a high energy, challenging environment with smart, talented peers, and you want to automate large scale engineering processes by designing self-monitoring and self-healing systems.

If this sounds like you, the Systems Reliability Engineering team in Data License should be your next move. Data License is Bloomberg's industry-leading product that offers enterprise-level programmatic interfaces to reference, regulatory and market data. Our systems deliver billions of data points every day on all types of financial instruments to the world's leading investment banks, brokers, asset managers, custodians and fund administrators. With thousands of customers depending on our services, your mission will be to ensure our products can meet and exceed our customers' high performance and reliability requirements, both now and in the future.

We’ll trust you to:

  • Manage availability and scalability of the Data License system by instilling engineering reliability into our development lifecycle, with a focus on fault tolerant approaches
  • Automate aspects of our development lifecycle with an eye towards Continuous Integration and deployment
  • Respond to and resolve unexpected and potential service problems and write software to prevent problem recurrence
  • Create automated tools for monitoring system health
  • Drive capacity planning, performance analysis, instrumentation and other non-functional systems requirements
  • Review and influence ongoing design, architecture, standards and methods for improving our services
  • Manage system releases, writing production software acceptance tests, and coordinating with product and data/technical support teams

You’ll need to have:

  • A Bachelor’s degree in Computer Science or equivalent experience
  • 3+ years of experience with Python, Perl or other scripting languages
  • Extensive experience with Linux/UNIX
  • Excellent communications skills with technical and non-technical audiences
  • Ability to handle on-call duty as well as out-of-band requests

We’d love to see:

  • Prior experience as a systems performance or site/systems reliability engineer
  • 3+ years of C++ and/or Java development
  • Experience with Jenkins or other Continuous Integration tools
  • Experience with Robot Framework or other test automation frameworks
  • Familiarity with virtualization and Infrastructure as a Service models
  • Experience working in a high-volume or critical production service environment
  • Expertise analyzing and troubleshooting large-scale distributed systems
Similar jobs