Site Reliability Engineer (Onsite Role) at Philadelphia, Pennsylvania, USA |
Email: [email protected] |
http://bit.ly/4ey8w48 https://jobs.nvoids.com/job_details.jsp?id=939240&uid= From: Syed Musharaf, Talent Groups [email protected] Reply to: [email protected] Location: Philadelphia PA Duration: 8+ Months Qualifications: Bachelor's or higher degree in Computer Science, Software Engineering, or a related field. Extensive experience in software engineering with a focus on observability, monitoring, and SRE. Strong expertise in designing and implementing distributed systems for high availability and reliability. Proficiency in APM (Application performance monitoring), RUM (Real user monitoring), Synthetics, correlation, alert & incident management will be required. (e.g., OTEL, Jaeger, Kloudfuse, service-now) Proficiency in one or more programming languages (e.g., Java, Python, Go). Experience with cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes). In-depth knowledge of observability tools and frameworks (e.g., Prometheus, Grafana, ELK stack, Datadog, Aternity) and incident management processes. In-depth knowledge of ML & AI frameworks (e.g., Anomaly, Outlier, AIOps, LLM ) Excellent communication and collaboration skills. Demonstrated ability to lead technical initiatives and mentor team members. Preferred Qualifications: Certifications in relevant areas such as AWS Certified DevOps Engineer, Certified Kubernetes Administrator (CKA), or equivalent. Previous experience in a leadership or management role. Familiarity with Infrastructure as Code (IaC) tools such as Terraform, Packer & C Crossplane Keywords: cprogramm artificial intelligence machine learning golang Pennsylvania http://bit.ly/4ey8w48 https://jobs.nvoids.com/job_details.jsp?id=939240&uid= |
[email protected] View All |
10:38 PM 14-Dec-23 |