Job Details

Home

Urgently Hiring:: Site Reliability Engineer::Hybrid (Santa Clara, California) at Santa Clara, California, USA

From:

Priyabrata pradhan,

Vyze

[email protected]

Reply to: [email protected]

Job Description -

Job title: Site Reliability Engineer

Location: Santa Clara, California

Duration: 6 months to 8 months

Visa: no h1b

Need genuine visa

The chances of the last interview being face-to-face are 50%.

Responsibilities:

-Fleet monitoring & recovery of assets in our private cloud environment that houses several compute servers with NVIDIA GPUs.

-Specific focus on building and stabilizing our virtualization infrastructure of ESXi, KVM and Hyper-V.

-Deploy and maintain a large farm of machines using the latest Configuration Management & Infrastructure Automation tools (Chef, Ansible, Terraform).

-Participate in on-call & rotational L1 support for round-the-clock monitoring and remediation of infrastructure issues (PagerDuty)

-Analyze and Debug operating system, networking, configuration and performance problems.

-Assist in roll-out and deployment of infrastructure configurations to supporting the latest hardware and technologies.

-Contribute to the development of monitoring systems to have fast, reliable and real-time pulse of the various infrastructure subsystems (Zabbix, Big Panda, Grafana)

Apply now!

-Bachelors or Masters Degree in Computer Science or Software Engineering, or equivalent experience.

-Good with system and platform debugging

-Virtualization experience (key match if available) - (vSphere, Hyper-V, KVM, Xen server)

-Familiar with Client Configuration tools (Chef (preferred), Ansible)

-Experience working in large scale enterprise production systems. -5+ years of professional experience required.

-Ability to debug and analyze system issues, code to triage, root cause and resolve issues in the infrastructure. Work closely with the platform engineering team in understanding hardware setups.

-Familiar with maintenance and setup of Linux, Windows hosts

-Scripting experience with any of Python, Go. Unix proficiency.

-Experience with version control systems like Perforce, GIT.

Preferred:

-Familiar with private cloud setups (VMware, Dell, Apple)

Scripting (bash, python, go)

-Experience with VM and hardware virtualization technologies like VMware, KVM, Hyper-V, Docker and Kubernetes.

-Background with automating bare metal and VM provisioning.

-Experience with supporting GPUs, embedded device development, driver development and CUDA/TensorRT applications.

-Development experience in Chef, Ansible and infrastructure orchestration.

Thanks & Regards

Priyabrata Pradhan

Email: [email protected]

Keywords: golang
Urgently Hiring:: Site Reliability Engineer::Hybrid (Santa Clara, California)
[email protected]

[email protected]
View All

08:21 PM 16-Dec-24

To remove this job post send "job_kill 2015644" as subject from [email protected] to [email protected]. Do not write anything extra in the subject line as this is a automatic system which will not work otherwise.

Your reply to [email protected] -

To

Subject
Message -

ppradhan@vyzeinc.com wrote:
From:

Priyabrata pradhan,

Vyze

ppradhan@vyzeinc.com

Reply to:   ppradhan@vyzeinc.com

Job Description -

Job title: Site Reliability Engineer

Location: Santa Clara, California

Duration: 6 months to 8 months

Visa: no h1b

Need genuine visa

The chances of the last interview being face-to-face are 50%.

Responsibilities:

-Fleet monitoring & recovery of assets in our private cloud environment that houses several compute servers with NVIDIA GPUs.

-Specific focus on building and stabilizing our virtualization infrastructure of ESXi, KVM and Hyper-V.

-Deploy and maintain a large farm of machines using the latest Configuration Management & Infrastructure Automation tools (Chef, Ansible, Terraform).

-Participate in on-call & rotational L1 support for round-the-clock monitoring and remediation of infrastructure issues (PagerDuty)

-Analyze and Debug operating system, networking, configuration and performance problems.

-Assist in roll-out and deployment of infrastructure configurations to supporting the latest hardware and technologies.

-Contribute to the development of monitoring systems to have fast, reliable and real-time pulse of the various infrastructure subsystems (Zabbix, Big Panda, Grafana)

Apply now!

-Bachelors or Masters Degree in Computer Science or Software Engineering, or equivalent experience.

-Good with system and platform debugging

-Virtualization experience (key match if available) - (vSphere, Hyper-V, KVM, Xen server)

-Familiar with Client Configuration tools (Chef (preferred), Ansible)

-Experience working in large scale enterprise production systems. -5+ years of professional experience required.

-Ability to debug and analyze system issues, code to triage, root cause and resolve issues in the infrastructure. Work closely with the platform engineering team in understanding hardware setups.

-Familiar with maintenance and setup of Linux, Windows hosts

-Scripting experience with any of Python, Go. Unix  proficiency.

-Experience with version control systems like Perforce, GIT.

Preferred:

-Familiar with private cloud setups (VMware, Dell, Apple)

Scripting (bash, python, go)

-Experience with VM and hardware virtualization technologies like VMware, KVM, Hyper-V, Docker and Kubernetes.

-Background with automating bare metal and VM provisioning.

-Experience with supporting GPUs, embedded device development, driver development and CUDA/TensorRT applications.

-Development experience in Chef, Ansible and infrastructure orchestration.

Thanks & Regards

Priyabrata Pradhan

Email: ppradhan@vyzeinc.com

Keywords: golang 
Urgently Hiring:: Site Reliability Engineer::Hybrid (Santa Clara, California)
ppradhan@vyzeinc.com

Your email id:

Captcha Image:

Captcha Code:

Pages not loading, taking too much time to load, server timeout or unavailable, or any other issues please contact admin at [email protected]

Time Taken: 5

Location: Santa Clara, California