| Hybrid Job Title : Site Reliability Engineer AND at Atlanta, Georgia, USA |
| Email: [email protected] |
|
http://bit.ly/4ey8w48 https://jobs.nvoids.com/job_details.jsp?id=2253009&uid= Job Title : Site Reliability Engineer Location: Atlanta, GA) Hybrid MOI: Skype Job details : We are seeking a Site Reliability Engineer to join our Retail Site Reliability Engineering team in Atlanta, GA. In this role, you will be at the forefront of Cloud and Big Data technologies, driving innovation and operational excellence for highly available, business-critical applications. This is an opportunity to establish yourself as a technical leader, working with cutting-edge tools and processes to enhance reliability, automation, and performance across both on-premise and AWS environments. As a Site Reliability Engineer, you will serve as the escalation point for complex and undefined issues, leveraging your expertise in DevOps, automation, infrastructure orchestration, and continuous integration. The ideal candidate is a problem solver who is not constrained by traditional methods and is passionate about driving efficiency and scalability. Key Responsibilities: Engineer and optimize data streaming and API components in OpenShift (On-Premise) and AWS. Identify and implement optimizations to reduce response times for various application components. Automate testing, deployment, and delivery pipelines to ensure seamless production releases. Develop integrations between on-premise and AWS environments and third-party tools such as ServiceNow, VersionOne, and Sumo. Define and monitor Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to maintain system reliability. Lead troubleshooting efforts for performance degradation and undefined platform issues, documenting solutions for future reference. Experiment with emerging cloud technologies to enhance infrastructure capabilities and drive innovation. Design and implement CI/CD pipelines for deploying APIs and data processing jobs. Configure monitoring and alerting metrics to ensure proactive issue resolution. Maintain data integrity and security using AWS tools such as IAM, HSM, and key management services. Develop cost monitoring solutions and implement AWS cost optimization strategies. Work with enterprise security architects to design security measures, mitigate vulnerabilities, and enforce compliance. Expertise in infrastructure automation tools such as Terraform, Ansible, OpenShift Cloud Formation, , and Python. Experience with container orchestration platforms (OpenShift, Kubernetes, Docker). Strong understanding of Linux OS, networking, virtualization, load balancers, firewalls, and storage solutions. Experience with CI/CD tools (GitLab, GitHub, Jenkins, Maven, Gradle, Nexus). Familiarity with performance monitoring and alerting tools. Experience working with highly available, mission-critical applications. Preferred Qualifications: BS degree in Computer Science or a related technical field (or equivalent practical experience). 4-6 years of overall experience in DevOps, SysOps, or Cloud Engineering. 2+ years of experience in application development, data streaming, and deployment/monitoring of high-availability systems. 1+ years in a Site Reliability Engineering (SRE) role is preferred. Experience working in large-scale enterprise environments with security, compliance, and scalability requirements -- Keywords: continuous integration continuous deployment information technology card Georgia Hybrid Job Title : Site Reliability Engineer AND [email protected] http://bit.ly/4ey8w48 https://jobs.nvoids.com/job_details.jsp?id=2253009&uid= |
| [email protected] View All |
| 06:54 PM 13-Mar-25 |