Job Details

Home

Principal AWS Site Reliability Engineer || 100% Remote || Contract at Remote, Remote, USA

http://bit.ly/4ey8w48
https://jobs.nvoids.com/job_details.jsp?id=1684129&uid=

From:

Raveena Mourya,

DMS Visions Inc

[email protected]

Reply to: [email protected]

H

i,

Hope you are doing well !!

I have an urgent position. Kindly go through the Job description and let me know if this would be of interest to you.

Job Title

:
Principal AWS Site Reliability Engineer

L
ocation:
100% Remote

Duration:

24 months Contract

JOB DESCRIPTION

About the job

Environment: DEVops=SRE

AWS

net

Kubernetes

Gravana

KEY REQUIRED SKILLS

Expert: ansible-terraform kubernetes

Expert; AWS devops pro cert Preferred

Good: AWS EKS

Good APM

Overview

We are looking for an outgoing and dynamic Site Reliability Engineer to manage the successful operation and support of our application environments. This position is responsible for overseeing application policies and procedures to ensure the integrity and availability of applications. The Site Reliability Engineer is responsible for working with the product development teams and DevOps teams, focusing on the consideration for web and applications regarding deployment, performance and availability for all applications being developed.

Responsibilities

Drive focused initiatives that improve operational efficiency and scalability of the platform and applications

Drive standardization efforts across multiple disciplines and services in conjunction with embedded SREs throughout the organization

Identify and drive opportunities to improve automation for the company; scope and create automation for deployment, management and visibility of our services Understand modern software security and secure software systems with cloud-based infrastructure

Provide full-stack diagnostics and determine root cause of internal problems

Analyze operational performance which support delivering improvements to critical related system metrics & KPIs

Examine all areas of infrastructure and applications for improvement and suggest changes, rather than wait for direction

Safeguard application information against accidental or unauthorized damage, modification, or disclosure

Build and maintain redundant systems and procedures for high availability and disaster recovery

Develop integrated workflows for our support teams

Own the customer experience think and act in ways that put our customers first, provide them a great digital experience, and make them promoters of our products and services

Respond to and help troubleshoot incidents

Participate in a 24x7 on-call rotation

Key Skills and Competencies Needed

5+ years of extensive experience with Infrastructure as a Code (IaaC) and Desired State Configuration (DSC) tools such as Terraform and Ansible

5+ years of experience packaging, deploying and managing containerized workloads running in common PaaS solutions (i.e. Docker, Kubernetes)

5+ years expertise in managing AWS infrastructure at scale including expertise in the following services: EC2, S3, Elastic Load Balancing, Lambda, Route 53, ECS, SQS, CloudWatch

Prior experience working in a DevOps or SRE environment

Highly experienced with automation and scripting using languages such as: Power, Python, Bash

Large-scale monitoring and reporting experience using ELK stack, Dynatrace (or other APM)

Experience with MS Windows IIS management, troubleshooting, and performance monitoring

Experience managing web farms in a high-traffic SaaS environment

Strong analytical and problem-solving skills including robust troubleshooting skills with a focus on preventative and proactive actions

Extensive experience with .NET applications architecture components (caching, content delivery, high availability, load balancing, etc.)

Understanding of the Software/Application Development Life Cycle process and experience with implementing and maintaining CI/CD technologies including: TeamCity, Octopus Deploy, GitHub, Jenkins, Codefresh, etc.

Knowledge of or experience with most of the following technologies:

Active Directory, SSL, FTP, Big-IP F5, T-SQL, MongoDB, MySQL, SQL Server, Nagios, Git, TeamCity, Octopus Deploy, Codefresh, Chef, Salt, Docker, Kubernetes, Kafka, Azure, Linux Server Administration, Bash, Apache

Thanks & Regards,

Raveena Mourya

US IT Recruiter, DMS Visions Inc

972-325-9476 |
dmsvisions.com/ |
[email protected]

4645 Avon Lane, Suite 210, Frisco, TX 75033

linkedin.com/in/raveena-mourya-766314250

Keywords: continuous integration continuous deployment sthree information technology golang ffive microsoft Texas
Principal AWS Site Reliability Engineer || 100% Remote || Contract
[email protected]
http://bit.ly/4ey8w48
https://jobs.nvoids.com/job_details.jsp?id=1684129&uid=

[email protected]
View All

08:26 PM 22-Aug-24

To remove this job post send "job_kill 1684129" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.

Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]

Time Taken: 0

Location: ,