Job Details

Home

Hiring SRE Engineer ,Philadelphia, Pennsylvania (Day 1 Onsite) at Philadelphia, Pennsylvania, USA

http://bit.ly/4ey8w48
https://jobs.nvoids.com/job_details.jsp?id=938490&uid=

From:

Swati,

SIRI INFO SOLUTIONS

[email protected]

Reply to: [email protected]

Hello,

Have a nice day...!

Greeting from Siri Info Solutions.

I am Swati with Siri Info Solutions. We Siri Info Solutions is a global staff augmentation firm providing a wide-range of talent on-demand and total workforce solutions. We have an immediate opening for the below position with one of our premium clients.

Job Title: SRE Engineer

Location: Philadelphia, Pennsylvania (Day 1 Onsite)

Interview: WebEx Hire

Duration: Long Term

Job Description:

Responsibilities:

Observability and Monitoring:

Develop and implement robust observability strategies, including logging, metrics, and tracing, to gain deep insights into the performance and health of our systems.

Collaborate with cross-functional teams to establish and enforce best practices for instrumentation, logging, and monitoring throughout the software development lifecycle.

Site Reliability Engineering:

Lead initiatives to improve the reliability, availability, and scalability of our applications and infrastructure.

Collaborate with development teams to design and implement systems that are resilient to failures and capable of quick recovery.

Drive the adoption of SRE principles and practices across the organization.

Incident Management:

Develop and refine incident response processes, ensuring timely detection, analysis, and resolution of incidents.

Collaborate with teams to conduct post-incident reviews, identify root causes, and implement preventive measures.

Automation and Tooling:

Build and maintain automation tools for deployment, monitoring, and incident response to streamline operational processes.

Evaluate and integrate third-party tools to enhance observability and SRE capabilities.

Collaboration and Leadership:

Provide technical leadership and mentorship to the engineering team.

Collaborate with product managers, architects, and other stakeholders to align observability and SRE initiatives with business goals.

Qualifications:

Bachelor's or higher degree in Computer Science, Software Engineering, or a related field.

Extensive experience in software engineering with a focus on observability, monitoring, and SRE.

Strong expertise in designing and implementing distributed systems for high availability and reliability.

Proficiency in APM (Application performance monitoring), RUM (Real user monitoring), Synthetics, correlation, alert & incident management will be required. (e.g., OTEL, Jaeger, Kloudfuse, service-now)

Proficiency in one or more programming languages (e.g., Java, Python, Go).

Experience with cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes).

In-depth knowledge of observability tools and frameworks (e.g., Prometheus, Grafana, ELK stack, Datadog, Aternity) and incident management processes.

In-depth knowledge of ML & AI frameworks (e.g., Anomaly, Outlier, AIOps, LLM )

Excellent communication and collaboration skills.

Demonstrated ability to lead technical initiatives and mentor team members.

Preferred Qualifications:

Certifications in relevant areas such as AWS Certified DevOps Engineer, Certified Kubernetes Administrator (CKA), or equivalent.

Previous experience in a leadership or management role.

Familiarity with Infrastructure as Code (IaC) tools such as Terraform, Packer & C Crossplane

Please share your Employer details, doc (
Visa copy & DL Copy) & share updated resume

Current Location

Visa status

Prefer time to call you

Contact No.

Skype id

LinkedIn URL

Education details both bachelors & Masters (university, Stream, Year of passing)

Keywords: cprogramm artificial intelligence machine learning access management golang Idaho
http://bit.ly/4ey8w48
https://jobs.nvoids.com/job_details.jsp?id=938490&uid=

[email protected]
View All

09:13 PM 14-Dec-23

To remove this job post send "job_kill 938490" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.

Your reply to [email protected] -

To

Subject
Message -

swati@siriinfo.com wrote:
From:

Swati,

SIRI INFO SOLUTIONS

swati@siriinfo.com

Reply to:   swati@siriinfo.com

Hello,

Have a nice day...!

Greeting from Siri Info Solutions.

I am Swati   with Siri Info Solutions. We Siri Info Solutions is a global staff augmentation firm providing a wide-range of talent on-demand and total workforce solutions. We have an immediate opening for the below position with one of our premium clients.

Job Title: SRE Engineer

Location: Philadelphia, Pennsylvania (Day 1 Onsite)

Interview: WebEx Hire

Duration: Long Term

Job Description:

Responsibilities:

Observability and Monitoring:

Develop and implement robust observability strategies, including logging, metrics, and tracing, to gain deep insights into the performance and health of our systems.

Collaborate with cross-functional teams to establish and enforce best practices for instrumentation, logging, and monitoring throughout the software development lifecycle.

Site Reliability Engineering:

Lead initiatives to improve the reliability, availability, and scalability of our applications and infrastructure.

Collaborate with development teams to design and implement systems that are resilient to failures and capable of quick recovery.

Drive the adoption of SRE principles and practices across the organization.

Incident Management:

Develop and refine incident response processes, ensuring timely detection, analysis, and resolution of incidents.

Collaborate with teams to conduct post-incident reviews, identify root causes, and implement preventive measures.

Automation and Tooling:

Build and maintain automation tools for deployment, monitoring, and incident response to streamline operational processes.

Evaluate and integrate third-party tools to enhance observability and SRE capabilities.

Collaboration and Leadership:

Provide technical leadership and mentorship to the engineering team.

Collaborate with product managers, architects, and other stakeholders to align observability and SRE initiatives with business goals.

Qualifications:

Bachelor's or higher degree in Computer Science, Software Engineering, or a related field.

Extensive experience in software engineering with a focus on observability, monitoring, and SRE.

Strong expertise in designing and implementing distributed systems for high availability and reliability.

Proficiency in APM (Application performance monitoring), RUM (Real user monitoring), Synthetics, correlation, alert & incident management will be required. (e.g., OTEL, Jaeger, Kloudfuse, service-now)

Proficiency in one or more programming languages (e.g., Java, Python, Go).

Experience with cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes).

In-depth knowledge of observability tools and frameworks (e.g., Prometheus, Grafana, ELK stack, Datadog, Aternity) and incident management processes.

In-depth knowledge of ML & AI frameworks (e.g., Anomaly, Outlier, AIOps, LLM )

Excellent communication and collaboration skills.

Demonstrated ability to lead technical initiatives and mentor team members.

Preferred Qualifications:

Certifications in relevant areas such as AWS Certified DevOps Engineer, Certified Kubernetes Administrator (CKA), or equivalent.

Previous experience in a leadership or management role.

Familiarity with Infrastructure as Code (IaC) tools such as Terraform, Packer & C Crossplane

Please share your Employer details, doc (
Visa copy & DL Copy) & share updated resume

Current Location

Visa status

Prefer time to call you

Contact No.

Skype id

LinkedIn URL

Education details both bachelors & Masters (university, Stream, Year of passing)

Keywords: cprogramm artificial intelligence machine learning access management golang Idaho

Your email id:

Captcha Image:

Captcha Code:

Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]

Time Taken: 5

Location: Philadelphia, Pennsylvania