HPC Engineer/Architect at New York, New York, USA |
Email: rishabh.s@e-solutionsinc.com |
http://bit.ly/4ey8w48 https://jobs.nvoids.com/job_details.jsp?id=2211762&uid= Hi Recruiters, I hope you are doing well!!! We have an urgent requirement for the position of " HPC Engineer/Architect ". Please go through the JD & share your updated resume if you find it interesting. Mention your consultant visa status also Job title: HPC Engineer/Architect Job Location: New York, NY Work Model: Hybrid Desired Start Date: 3/3/2025 Job Summary: You will support day-to-day operations of large-scale parallel file systems, deploy and maintain Linux HPC infrastructure across multiple data centers, and assist HPC engineers and architects with day-to-day operations and tickets. Support day-to-day operations of large-scale parallel file systems Deploy and Maintain Linux HPC infrastructure across multiple datacenters Assist HPC engineers and architects with day-to-day operations and tickets Required Skills: Linux Operating Systems (RHEL/CentOS), Parallel file system (GPFS), Job Scheduler LSF/Slrm Anxible, Python, scripting GPU-based compute infrastructure (including CUDA) CentOS 4.5 HPCC Responsibilities: Design, architect and oversee implementation of Linux based HPC clusters and storage Deploy physical hardware using HPC deployment tools and configuration and orchestration tools (Ansible) Parallel file system (GPFS) performance tuning, monitoring and troubleshooting Perform systems benchmarking, and developing automated tests for the HPC environment, ensuring the reliability and efficiency of our computational infrastructure Infiniband network maintenance and troubleshooting Automate and monitor the HPC user lifecycle process Slurm installation, configuration, performance tuning and troubleshooting Plan, design and implement a transition from the LSF scheduler to Slurm Manage the Slurm scheduler and translate Research policies into scheduler configurations Consult with faculty and students to develop research pipelines for use on the HPC cluster Develop and maintain user lifecycle software suite in Python, implement CI/CD pipeline Test and automate upgrades of critical system applications using Ansible and scripts. The ability to communicate effectively with clinicians, researchers, and other team members to develop technological solutions is key Qualifications: Experience working in a large-scale research based HPC environment Proven experience working with distributed file storage solutions (i.e., GPFS) Experience with deploying and troubleshooting Linux Operating Systems (RHEL/CentOS) Experience with Scripting and Automation (Ansible, Python, Scripting) Solid understanding of job schedulers (LSF/SLURM) Experience with GPU-based compute infrastructure (including CUDA) Thanks and regards, Rishabh Singh E: rishabh.s@e-solutionsinc.com 2N Market St,Suite # 400, San Jose, CA-95113 USA | CANADA | UK | SINGAPORE | MALAYSIA | INDIA www.e-solutionsinc.com Disclaimer: E-Solutions Inc. provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, gender, sexual orientation,gender identity or expression, national origin, age, disability, genetic information, marital status, amnesty, or status as a covered veteran in accordance with applicable federal, state and local laws. We especially invite women, minorities, veterans, and individuals with disabilities to apply. EEO/AA/M/F/Vet/Disability. -- Keywords: continuous integration continuous deployment information technology golang California New York HPC Engineer/Architect rishabh.s@e-solutionsinc.com http://bit.ly/4ey8w48 https://jobs.nvoids.com/job_details.jsp?id=2211762&uid= |
rishabh.s@e-solutionsinc.com View All |
07:54 PM 27-Feb-25 |