SRE - TickerPlant
Posted Jun 29, 2020 - Requisition No. 83667
The Ticker Plant team builds the software that manages the storage and distribution of market data in Bloomberg. Market data is the lifeblood of Bloomberg’s business, and countless downstream applications and customers depend on our fast and accurate representation of market events. We need to handle current data volumes peaking at over 200 billion events per day and plan our capacity to handle whatever the market throws at us in the future.
Who are we?
Ticker Plant’s System Reliability Engineering (SRE) team is responsible for the operational success of the Ticker Plant, both right now and over the long term. We have a broad remit stretching from ensuring a robust and scalable production environment to increasing the level of automation and telemetry in our system. Our teams in London, Tokyo and New York reflect the global nature of our mission.
What’s in it for you:
On our team, you'll design and develop scalable services that enhance the stability and reliability of Bloomberg's market data infrastructure. We’ll depend on you to not only help set standards but also partner closely with our application engineers to ensure that all products meet those standards. You'll be trusted to create infrastructure capable of early detection of issues through metrics and signals and take automatic corrective actions.
We'll trust you to:
- Ensure optimal availability, latency, scalability and efficiency of Bloomberg application development. You will do this by advocating engineering reliability into our development life cycle with a focus on fault tolerant approaches
- Respond to and resolve unexpected and potential service problems. You will write software to prevent the same problem happening again
- Drive capacity planning, performance analysis, instrumentation and other non-functional systems requirements
- Review and influence on-going design, architecture, standards and methods for improving operating services
- Own system releases, write production software acceptance tests and coordinate all aspects of the release including coverage and communication plans
You'll need to have:
- Experience with managing systems in production, including triaging and solving outages
- Strong understanding of large-scale distributed systems architecture
- In-depth knowledge of Linux/Unix
- Software engineering experience
- Proficiency in object-oriented languages like Python, Perl, C++ or Java
- Strong Communications skills
- Bachelor's degree
We'd love to see:
- Configuration management system, e.g, Salt, Chef, Ansible
- Experience with blameless postmortems
- A passion for performance excellence, robustness and engineering mindset
If this sounds like you:
Apply if you think we're a good match! We'll get in touch with you to let you know what the next steps are.
In the meantime, check us out at http://www.techatbloomberg.com/
Bloomberg is an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, colour, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
Bloomberg is a disability inclusive employer. Please let us know if you require any reasonable adjustments to be made for the recruitment process. If you would prefer to discuss this confidentially, please email firstname.lastname@example.org. Alternatively, you can get support from our disability partner EmployAbility, please contact +44 7852 764 684 or email@example.com