Contract role || Site Reliability Engineer (SRE) 10+ Years Assessments & Implementation || On-site in Charlotte NC/ Raleigh NC at Charlotte, North Carolina, USA |
Email: [email protected] |
http://bit.ly/4ey8w48 https://jobs.nvoids.com/job_details.jsp?id=2152687&uid= Hi All, Hope you are doing good. Kindly share resume mentioning visa status and LI URL for prompt revert. Please dont share pure Devops consultant. Job Title: Site Reliability Engineer (SRE) Assessments & Implementation (10+Years) Location: On-site in Charlotte NC/ Raleigh NC Job Type: Contract Rate: $60-$65/hr. C2C No GC/H1B-T/OPT for the roles. Need PP number and LI URL of the candidate as must for submission. Role Overview As an SRE, you will be responsible for conducting comprehensive reliability assessments, identifying gaps in infrastructure and processes, and implementing solutions to enhance system performance. You will collaborate with cross-functional teams to ensure systems meet scalability, uptime, and resilience goals through automation, observability, and best practices. Key Responsibilities SRE Assessments & Recommendations: Perform end-to-end reliability assessments of systems, identifying gaps in scalability, performance, and fault tolerance. Define SLIs, SLOs, and SLAs aligned with business objectives and implement error budgets. Develop short-term fixes (e.g., reducing downtime, automating manual tasks) and long-term strategies (e.g., architectural redesign, cloud migration). Implementation & Automation: Design and execute infrastructure improvements using IaC tools (Terraform, CloudFormation) and containerization platforms (Kubernetes, Docker). Build and optimize CI/CD pipelines (Jenkins, GitLab CI) to streamline deployments and reduce manual intervention. Automate incident response, monitoring, and recovery processes to minimize MTTR (Mean Time to Recovery). Monitoring & Incident Management: Implement robust observability solutions using tools like Prometheus, Grafana, Datadog, or ELK Stack. Conduct root cause analysis (RCA) and lead incident postmortems to prevent recurrence. Collaboration & Leadership: Partner with DevOps, development, and operations teams to embed SRE practices into workflows. Advise stakeholders on SRE best practices and cost-effective scalability strategies. Technical Requirements Mandatory Skills: 8+ years of hands-on SRE experience with a focus on system reliability and performance optimization. Expertise in cloud platforms (AWS, GCP, Azure) and infrastructure-as-code (Terraform, CloudFormation). Proficiency in scripting (Python, Bash, Go) and automation tools (Ansible, Puppet). Strong background in CI/CD pipelines and observability tools (Prometheus, Grafana, Datadog). Experience defining SLIs/SLOs, conducting reliability assessments, and driving measurable improvements. Preferred Qualifications: Certifications: Google Professional SRE, AWS Certified DevOps Engineer, or CKA/CKAD. Experience in consulting/advisory roles with a focus on SRE best practices. Background in industries with high uptime demands (e.g., fintech, SaaS). Warm regards, Yogesh Pratap Singh( Yogi) Direct (205)-775-0773 244 Fifth Avenue, Suite R295, New York, NY 10001 Email: [email protected] Web: http://elgebra.com/ https://www.linkedin.com/in/yogesh-pratap-singh-282a744a/ Donate Red || Save Blue || Spread Green Keywords: continuous integration continuous deployment golang green card New York North Carolina Contract role || Site Reliability Engineer (SRE) 10+ Years Assessments & Implementation || On-site in Charlotte NC/ Raleigh NC [email protected] http://bit.ly/4ey8w48 https://jobs.nvoids.com/job_details.jsp?id=2152687&uid= |
[email protected] View All |
01:07 AM 07-Feb-25 |