Site Reliability Engineer/Cloud Operations Engineer/Cloud Infrastructure Engineer-REMOTE

Auto Import

Site Reliability Engineer 

We are seeking a Site Reliability Engineer (SRE) to build, maintain, and improve the reliability, scalability, and performance of our cloud infrastructure. You will collaborate with development and operations teams to automate processes, monitor systems, troubleshoot incidents, and ensure high availability of critical services.

Key Responsibilities:

  • Monitor and maintain production systems and cloud environments.
  • Automate deployment, scaling, and operational tasks.
  • Troubleshoot and resolve infrastructure and application issues.
  • Improve system reliability, performance, and security.
  • Collaborate with engineering teams on CI/CD and DevOps initiatives.

Requirements:

  • Experience with cloud platforms (AWS, Azure, or Google Cloud Platform).
  • Knowledge of Linux, networking, and system administration.
  • Hands-on experience with Kubernetes, Docker, and CI/CD tools.
  • Scripting skills in Python, Bash, or similar.
  • Strong problem-solving and incident management skills.
Back to blog