Site Reliability Engineer Job at iCIMS, Holmdel, NJ

NFRYQ2VrU3d0RUhZWW9PZHh4MVpDbkZiL0E9PQ==
  • iCIMS
  • Holmdel, NJ

Job Description

Job Summary

We are seeking a skilled Engineer, Site Reliability (SRE) to contribute to the reliability, scalability, and performance of our multi-cloud SaaS platform serving thousands of customers worldwide. This role involves hands-on technical work in incident response, system monitoring, automation, and continuous improvement of our platform reliability. The successful candidate will work within a global SRE team to ensure optimal system performance and customer satisfaction.

Responsibilities

  • System Monitoring & Reliability:
    • Monitor multi-cloud infrastructure (AWS, Azure, GCP) using New Relic, Grafana, and Sumo Logic
    • Maintain reliability of AWS resources, Auth0/Okta authentication, databases, and legacy applications
    • Implement monitoring, alerting, and dashboards for assigned systems
  • Incident Management & Response:
    • Respond to alerts and incidents within SLA timeframes
    • Perform root cause analysis and document findings
    • Create and maintain runbooks and troubleshooting procedures
    • Participate in 24/7 on-call rotation
  • Automation & Improvement:
    • Develop scripts to reduce manual operational overhead
    • Build monitoring and alerting solutions
    • Support infrastructure-as-code initiatives
    • Implement automated remediation where possible
  • Success Metrics:
    • Customer Impact : Reduced MTTR and improved customer satisfaction scores
    • Reliability : Achievement of 99.9%+ uptime SLAs across all products and regions
    • Proactive Prevention: Reduction in incident frequency through automated detection and prevention
    • Cross-functional Collaboration: Improved partnership metrics with Product, Engineering, and Customer Success teams
    • Automation Delivery: Complete assigned automation projects to reduce manual tasks
    • Knowledge Sharing: Contribute to team knowledge base and mentor junior engineers

Qualifications

  • 4+ years experience in SRE, DevOps, or Infrastructure Engineering
  • Hands-on experience with AWS (required) and Azure (preferred)
  • Strong Linux system administration skills
  • Experience with monitoring tools (New Relic, Grafana, Prometheus)
  • Scripting skills in Python, Bash, or similar
  • Knowledge of databases (SQL Server, PostgreSQL, MongoDB)

Job Tags

Worldwide,

Similar Jobs

Little Bay Pet Services, LLC

Pet Sitter Job at Little Bay Pet Services, LLC

What qualities do we look for in a great pet sitter? A genuine love for animals and a desire to provide outstanding pet care! Ability to self-manage and has good communication skills with pet owners and management. An enthusiasm for long-term commitments. Our...

Axiom Software Solutions Limited

Network Engineer L3 Job at Axiom Software Solutions Limited

 ...interoperability. Experience with SDWAN design, deployment, and troubleshooting. Preferred Qualifications Certifications such as CCNA, CCNP, CCIE (or equivalent). Experience automating network management tasks using scripting languages or automation tools is a... 

Food Plant Engineering, LLC

Construction Superintendent - Traveling Job at Food Plant Engineering, LLC

 ...Opportunity for a Construction Superintendent (Traveling) to oversee the construction and installation of food production facilities across the country. The ideal candidate should have excellent project management skills, attention to detail, and a commitment to safety... 

Good, inc

Plumbing Service Mechanic Job at Good, inc

 ...Good Plumbing, Heating, and Air Conditioning, Inc. handles plumbing, heating, and cooling needs throughout Bucks, Montgomery, Chester, and Lehigh counties. We provide a wide variety of services for our residential and commercial customers including new installations,... 

County of Armstrong

Adult Probation Officer - Full Time Job at County of Armstrong

 ...driving record check. Must obtain Criminal History Clearance, Child Abuse Clearance. Work Experience 2 years experience in probation or related field is preferred. Comprehensive benefit package includes contributory healthcare, dental, vision, short-term disability...