Infrastructure SRE - Data Center
Our Team:
Bloomberg’s Data Center SRE Team is trusted to help automate all infrastructure delivery. We collaborate regularly with teams across Bloomberg to help implement new technologies and strategies for maintaining the multi-billion dollar Data Center operation.
What’s in it for you:
You’ll build infrastructure as code and automate away the toil. We'll trust you to define, document and automate all operational processes. Through the collaborative development of automation and standardized configuration utilities, you’ll provide focused engineering and operational support. For example, our Infrastructure Teams will depend on tools that you develop to automate the standup and support of all compute. You will be at the center of our hub for innovation and you’ll get the opportunity to evaluate emerging technologies along with their impact on various technology infrastructure services.
Who you are:
You have a deep knowledge of hardware and software and have a constant thirst for learning about new technologies, which you use to find the best solutions to multi-factor problems. You have a proven track record of successfully integrating hardware with software to automate problems away from your stakeholders.
We’ll trust you to:
- Design, develop, implement and document tools for the standup of compute infrastructure as well as orchestration workflows
- Support existing hardware monitoring utilizing open source software (OpenNMS)
- Develop and maintain documentation, training, and SLAs for managed infrastructure
- Act as point of escalation for operations teams in supporting new technologies
You Need to Have:
- BS in Computer Science/Engineering or equivalent experience
- 3+ years experience with 2 or more of the following programming skills (Python, Go(golang), Ruby, Rust, Html, flask, jinja2, javascript, react, angular, angular2, vue) Python preferred
- 2+ years of experience with infrastructure engineering
- 2+ years experience with Unix (RHEL/Ubuntu, etc…)
- Experience with BMC configuration, IPMI a must (redfish a plus)
- Experience working with Salt or Ansible for orchestration (preferably Salt)
- A good knowledge of SDLC concepts
- Experience with server/storage hardware technologies from third-party vendors such as HP, Supermicro, DellEmc, Oracle, OCP a plus
- Operational expertise running production-range systems and able to solve problems related to capacity and performance
- Knowledge of how network traffic gets routed and its integration within the full stack
We’d Love to see:
- Working knowledge of Jira concepts