Site Reliability Engineer - Canada

Toronto Tech Ops-610

At Okta our motto is "Always On", and nowhere do we embrace that more than in Technical Operations. We strive to build the most reliable and performant systems on the planet through the skillful use of automation. If you like to be challenged and have a passion for solving problems at scale with automation, testing and tuning then we would love to hear from you. The ideal candidate is someone who exemplifies the ethics of, “If you have to do something more than once, automate it,” and who can rapidly self-educate on new concepts and tools.

You will work on:

  • Designing, building, deploying Okta's production infrastructure
  • Running and improving the deployment process across Okta’s production sites
  • Identifying and automating manual processes
  • Promoting and applying best practices for building scalable and reliable services across engineering
  • Developing and maintaining technical documentation, runbooks, and procedures
  • Supporting a 24x7 online environment as part of an on-call rotation 

You are an ideal candidate if you:

  • Have experience automating and deploying large scale production Java/Tomcat services in AWS (EC2, ECS, KMS, Kinesis, RDS) or other cloud providers
  • Strong understanding of CI/CD principles and tools.
  • Have experience writing infrastructure as code using tools such as Chef and Terraform
  • Experience using monitoring services
  • Linux fundamentals
  • Scripting skills for operational tooling in Bash, Ruby, Python, Go or similar
  • Experience running container technology in production

Education and Training:

  • B.S. Computer Science (plus) or relevant experience

Okta is an equal opportunity employer 

#L1-JA1


Okta

okta.com

Okta, Inc. is a publicly traded identity and access management company based in San Francisco. It provides cloud software that helps companies manage and secure user authentication into modern applica...


View all jobs
Apply now