Open Position -

Site Reliability Engineer

REF. NO:
ZEPH_SRE

YOUR CONTRIBUTION TO SOMETHING BIG:

  • Ensure Zephr`s high throughput web platform can meet its SLAs as we scale gracefully:
  • Building and maintaining metrics & logs aggregation pipelines (using 3rd party tools & services as appropriate/cost effective)
  • Identifying and implementing key automated alerts, documenting playbooks
  • Developing internal tools to improve observability
  • Helping team adopt DevOps contributing to cloud architecture, developing internal tooling, guiding terraform rollout
  • Extending core aspects of our Java based "no code" multi tenanted configurable reverse proxy in a safe and performant way

OUR IDEAL TEAMMATE HAS/IS:

  • Strong analytical skills (Engineering or maths major preferred)
  • Experience with SRE & DevOps best practices
  • Production experience with terraform, AWS ECS, GitOps

BROWNIE POINTS:

Experience provisioning and configuring a reverse proxy/load balancer (eg NGinX, HAProxy)