Home - Nextdeavor

Site Reliability Engineer 2

Job Title
Site Reliability Engineer 2
Job ID
27742585
Work Hybrid
Yes
Location
Newton, MA,   Hybrid
Other Location
Description
Site Reliability Engineer 2
6+ Month W2 Contract
Newton, MA (hybrid)

Benefits You’ll Love: 
  • NextDeavor offers health, vision and dental benefits for contract employees
  • Paid sick leave eligibility is contingent on state of residence
  • Optional 401k Plan (excludes employer match)
  • Opportunity to get your foot in the door at a well-established corporation, with potential for extended or permanent full-time employment (NextDeavor boasts an impressive conversion rate of approximately 70%)!
Become a key player as a Site Reliability Engineer:
We are seeking a skilled Site Reliability Engineer (SRE) Level 2 to join our dynamic team. The ideal candidate will have a strong technical background, excellent problem-solving skills, and a passion for enhancing system reliability and performance. You will play a crucial role in monitoring, automating, and optimizing our infrastructure to ensure the seamless operation of our services.

Here’s how you’ll make an impact on the team:
  • System Monitoring and Incident Response: Monitor system health, performance metrics, and availability. Respond promptly to incidents and outages, ensuring minimal downtime.
  • Infrastructure Management: Manage and optimize both cloud and on-premise infrastructure using Infrastructure as Code (IaC) tools.
  • Automation: Develop and maintain automation scripts and tools to enhance operational efficiency and reduce manual tasks.
  • Collaboration: Work closely with development teams to implement CI/CD practices and improve deployment processes.
  • Capacity Planning: Analyze usage patterns and forecast capacity needs to ensure system scalability and reliability.
  • Documentation: Create and maintain comprehensive documentation for systems, processes, and incident response protocols.
  • Security Best Practices: Implement and enforce security measures to protect infrastructure and data.
  • Post-Incident Reviews: Conduct post-mortems on incidents to identify root causes and implement corrective actions.
Here’s what you’ll need to be successful in this role:
  • 1-4 years of experience in Site Reliability Engineering or a similar role
  • Strong knowledge of Linux/Unix systems and proficiency in scripting languages (e.g., Python, Bash)
  • Familiarity with cloud platforms (e.g., AWS) and their services
  • Experience with container orchestration (e.g., Kubernetes, Docker)
  • Proficiency in using monitoring and alerting tools (e.g., Prometheus, Grafana, Nagios)
  • Experience with version control systems (e.g., Git)
  • Strong troubleshooting skills with the ability to diagnose complex system issues
  • Excellent verbal and written communication skills for collaboration with cross-functional teams
  • Understanding of Agile development practices and methodologies
Pay Range:
$35.00 - $38.70/hour

Ready to make your mark? Take the leap and apply directly here: <https://j.brt.mv/jb.do?reqGK=27742585&refresh=true> – your application is in good hands.

Option 1: Create a New Profile

©NextDeavor 2022