Worldwide Remote Jobs

Site Reliability Engineer

Group 1001 Resources, LLC
๐Ÿ“ USA ๐Ÿ’ผ full_time ๐Ÿ’ฐ $180,000 - $200,000/year
Apply Now ๐Ÿ“… 14 hours ago

Job Description

Join GROUP1001 as a Site Reliability Engineer: Build Resilient Systems and Shape the Future

Are you passionate about building highly reliable and scalable systems? Do you thrive in a collaborative environment where you can leverage your expertise in automation, DevSecOps, and cloud technologies to deliver exceptional user experiences? If so, GROUP1001 is looking for a talented Site Reliability Engineer (SRE) to join our growing team.

At GROUP1001, you’ll be at the forefront of ensuring the availability, performance, and overall health of our critical systems and applications. You’ll work hand-in-hand with development, operations, and security teams to design, implement, and maintain robust infrastructure solutions that power our business.

Key Responsibilities:

  • Design, implement, and maintain highly available and scalable infrastructure solutions on cloud platforms like AWS, Azure, and GCP.
  • Champion and manage DevSecOps practices across the entire project lifecycle, fostering a culture of collaboration and continuous improvement.
  • Utilize monitoring and observability tools (preferably Grafana) to proactively monitor system health and troubleshoot issues in real-time.
  • Leverage your strong Git skills and comfort with trunk-based workflows and semantic versioning (semver) for efficient code management.
  • Design and implement Infra CI/CD pipelines to automate the deployment of geospatial software and manage infrastructure effectively.
  • Conduct regular system audits to identify and address potential risks before they impact project delivery.
  • Ensure strict compliance with data governance and security policies throughout the geospatial project lifecycle.
  • Provide technical guidance and mentorship to junior team members, fostering a culture of continuous learning and growth.
  • Proactively prevent incidents by implementing effective alerting mechanisms based on symptom monitoring.
  • Collaborate effectively with various teams, including Data Platforms, NOC/SOC, and IT security, to ensure seamless operations.
  • Develop and maintain effective monitoring systems with proactive and reactive alerts to identify and address issues promptly.
  • Create comprehensive system health dashboards to provide clear visibility into system performance.
  • Build end-user monitoring dashboards to gain valuable insights into user experience.
  • Partner with delivery teams to provide data-driven insights based on monitoring data.
  • Manage deployments and incidents effectively to minimize disruptions and ensure smooth operations.
  • Integrate alerts with our notification engine for timely communication and incident response.

Qualifications:

  • 10-14 years of relevant experience.
  • Strong proficiency with Git, GitLab, and Infra CI/CD Pipelines.
  • Hands-on experience with Terraform and/or Pulumi for infrastructure as code.
  • Proven experience as a Site Reliability Engineer (SRE).
  • Experience working with cloud platforms like AWS and Azure.
  • Experience with Application Performance Monitoring (APM) tooling.
  • Experience automating operational tasks using CI/CD pipelines.
  • Familiarity with Operational Excellence principles, including generating runbooks and facilitating handoffs to L1/L2 support teams.

Preferred Skills:

  • Experience as an SRE in environments using technologies such as AWS, Azure, Angular, REST/GraphQL, Neo4j, and Event Hubs.
  • Proven experience with Service Meshes.
  • Proven experience with Backups and Patching strategies.
  • Proven experience with Policy-as-Code (Rego, OPA).
  • Proven experience with Zero Trust Network Access (ZTNA) Policies.

Compensation:

Our compensation is competitive and reflects the cost of labor across various U.S. geographic markets. The base pay for this position ranges from $180,000 per year in our lowest geographic market up to $200,000 per year in our highest geographic market. Actual pay will be determined based on factors such as market location, job-related knowledge, skills, and experience.

Benefits:

We offer a comprehensive benefits package to employees who meet eligibility guidelines and work 30 hours or more weekly, including:

  • Comprehensive health, dental, and vision insurance plan options.
  • Basic and Supplemental Life Insurance, Short and Long-Term Disability coverage.
  • Immediate access to our Employee Assistance Program and wellness programs for all employees, regardless of hours worked.
  • Participation in the Company’s 401K plan with matching contributions.

Build Your CV for remote jobs in Minutes

Latest Jobs

Similar Jobs

Velotio Technologies
๐Ÿ“ India ๐Ÿ’ผ full_time ๐Ÿ“… Sep 12, 2025
101 Unisys Corporation
๐Ÿ“ USA ๐Ÿ’ผ full_time ๐Ÿ“… Sep 12, 2025
ciandt
๐Ÿ“ Brazil ๐Ÿ’ผ full_time ๐Ÿ“… Sep 12, 2025
BaxEnergy
๐Ÿ“ European timezones ๐Ÿ’ผ full_time ๐Ÿ“… Sep 11, 2025