Senior Site Reliability Engineer

Job Details

Posted on: 
June 26, 2025
Job ID:
438

About the Company

Established in 2004, ALLSTARSIT was founded with a clear vision: to enhance the landscape of global IT employment by bridging the gap between companies and skilled professionals. The core belief was that assembling a team shouldn't be hindered by geographical constraints. Fast forward to the present day, ALLSTARSIT stands as an international outstaffing service provider committed to change the way businesses recruit, compensate, and oversee top talent worldwide. 

With operational hubs scattered across Europe, Asia, and LATAM, and its headquarters situated in San Francisco, US, the company boasts a workforce of over 1,000 adept professionals. Spanning across more than 20 countries, ALLSTARSIT offers a diverse range of skilled employees across various verticals, including AI, cybersecurity, healthcare, fintech, telecom, media, and so on.

About the Project

We are hiring on behalf of SecuringSam, a US-based cybersecurity company that specializes in automating security operations and hardening cloud infrastructure. Their platform helps organizations identify, prioritize, and remediate security vulnerabilities at scale, empowering security and DevOps teams to operate more efficiently and securely.

We are looking for a Senior Site Reliability Engineer to join their fully remote engineering team in Latin America. This role offers the opportunity to work with a US-based core engineering group, modern technologies, and a mission-critical product in the fast-evolving cybersecurity domain.

Specialization

Headquarters

Years on the market

Team size and structure

Current technology stack

Required skills:

  • Education: Bachelor’s degree in Computer Science, Engineering, or a related field. Advanced degrees are a plus.
  • Experience: Minimum of 5–7 years of experience in a similar role, with a strong background in software engineering and systems administration.

Technical Skills:

  • Programming Languages: Proficiency in one or more programming languages such as Python, Go.
  • Cloud Platforms: Extensive experience with AWS cloud platform.
  • Infrastructure as Code: Hands-on experience with tools like Terraform, Ansible, or CloudFormation.
  • Containerization and Orchestration: Expertise in Docker and Kubernetes.
    • Kubernetes hands-on experience is a must.
    • Preferred Kubernetes technologies: ArgoCD, Linkerd, Prometheus, Karpenter, etc.
  • Monitoring and Logging: Proficiency with monitoring tools like Prometheus, Grafana, and logging tools like ELK stack or Loki stack.
  • CI/CD Pipelines: Experience with continuous integration and deployment tools such as Jenkins, GitLab CI, or CircleCI.

Scope of work:

  • System Reliability: Ensure the reliability, availability, and performance of critical systems.
  • Automation: Develop and maintain automation scripts and tools to streamline operations.
  • Monitoring: Develop and maintain monitoring dashboards & alerts.
  • Incident Management: Lead incident response efforts and post-mortem analysis to prevent future occurrences.
  • Performance Tuning: Optimize system performance and scalability.
  • Security: Implement and maintain security best practices.
  • Documentation: Create and maintain comprehensive documentation for systems and processes.
  • On-Call Shifts: Participate in on-call rotations to provide 24/7 support for critical systems.
  • Why ALLSTARSIT?

    Apply now

    More open positions

    Apply for

    Senior Site Reliability Engineer

    Full name *

    E-mail *

    Phone *

    Country

    Uploading...
    fileuploaded.jpg
    Upload failed. Max size for files is 10 MB.

    Cover Letter

    Thank you! Your submission has been received!
    Oops! Something went wrong while submitting the form.