Home

SRE Site Reliability Engineer Lead II Local to VA or nearby II 11 exp at Remote, Remote, USA
Email: [email protected]
From:

Vinay Chaudhary,

Intellicept Inc

[email protected]

Reply to:   [email protected]

Senior Site Reliability Engineer Lead

Job Summary
The Senior Support Lead in Site Reliability engineering (SRE) will be responsible for overseeing the support and reliability operations within the organization. This role will focus on ensuring the stability, performance, and efficiency of the systems while leading a team of support engineers to provide exceptional service. (1.) Key Responsibilities
1. Lead and manage a team of support engineers in resolving incidents, requests, and problems to ensure system uptime and reliability.
2. Collaborate with the engineering and development teams to implement efficient and scalable solutions that enhance system performance.
3. Develop and maintain support documentation, standard operating procedures, and best practices for the support team.
4. Identify opportunities for automation and implement tools to streamline support processes.
5. Monitor system performance and provide recommendations for improvements to optimize system reliability.
6. Participate in on call rotations to address critical incidents and ensure 24/7 system availability.
7. Conduct regular performance evaluations, provide feedback, and mentor team members to promote professional growth.

Skill Requirements
1. In-depth knowledge of site reliability engineering (sre) principles and best practices.
2. Proficiency in system monitoring, incident management, and performance tuning tools.
3. Strong understanding of cloud services, microservices architecture, and containerization technologies.
4. Excellent problem-solving skills and the ability to troubleshoot complex technical issues.
5. Experience with scripting languages (e.g., python, bash) for automation and tool development.
6. Familiarity with agile methodologies and devops practices for continuous integration and delivery.
7. Strong communication and leadership skills to effectively lead a support team and collaborate with cross functional teams.
8. Ability to work under pressure, prioritize tasks, and manage multiple projects simultaneously.

Certifications: Relevant certifications in Site Reliability Engineering (SRE) or Cloud Services are a plus.
Skill (Primary): Modern Application Development-DevOps (Modern AD)-Site Reliability engineering (SRE)
Job Family Support
Band E2

JD for Observability Engineer with Dynatrace Knowledge
Must-Have:
10+ years of experience in software development, architecture, and observability solutions.
Experience in developing and implementing observability solutions for production environments.
Proficiency in managing SLI/SLOs, user experience metrics, and performance improvement.
Hands-on experience with monitoring tools like Dynatrace (along with Devis, RUM, Synthetic monitor and Anomaly detection.), Splunk, NewRelic, Logic Monitor, and other AIOps tools.
Proven ability to lead observability teams and collaborate with cross-functional stakeholders.
Scripting skills (Python, Power, Bash), CI/CD tools for automation.
Familiarity with ITSM processes and automation to enhance application support.
Ability to assess and recommend the best tools.
Proficiency in creating dashboards and reports.

Good to Have:
Experience with cloud platforms like AWS, Azure, GCP.
Knowledge of chaos engineering.
Familiarity with customer service platforms such as Salesforce, Mainframe (Hogan), MuleSoft.
Understanding of AI/GenAI/ML and its applications.
Knowledge of the banking or financial services domain.
Open to on-call/support work

Keywords: continuous integration continuous deployment artificial intelligence machine learning active directory
SRE Site Reliability Engineer Lead II Local to VA or nearby II 11 exp
[email protected]
[email protected]
View All
01:09 AM 06-Mar-25


To remove this job post send "job_kill 2231026" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.


Your reply to [email protected] -
To       

Subject   
Message -

Your email id:

Captcha Image:
Captcha Code:


Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]


Time Taken: 0

Location: ,