We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

Linux/HPC Architect

Cognizant North America
life insurance, parental leave, paid time off, paid holidays, 401(k)
United States, New York, New York
Feb 21, 2025
HPC ARCHITECT
Job Description
  • Support day-to-day operations of large-scale parallel file systems
  • Deploy and Maintain Linux HPC infrastructure across multiple datacenters
  • Assist HPC engineers and architects with day-to-day operations and tickets
  • Experience working in a large-scale research based HPC environment
  • Proven experience working with distributed file storage solutions (i.e., GPFS)
  • Experience with deploying and troubleshooting Linux Operating Systems (RHEL/CentOS)
  • Experience with Scripting and Automation (Ansible, Python, Shell Scripting)
  • Solid understanding of job schedulers (LSF/SLURM)
  • Experience with GPU-based compute infrastructure (including CUDA)
Responsibilities:
  • Design, architect and oversee implementation of Linux based HPC clusters and storage
  • Deploy physical hardware using HPC deployment tools and configuration and orchestration tools (Ansible)
  • Parallel file system (GPFS) performance tuning, monitoring and troubleshooting
  • Perform systems benchmarking, and developing automated tests for the HPC environment, ensuring the reliability and efficiency of our computational infrastructure
  • Infiniband network maintenance and troubleshooting
  • Automate and monitor the HPC user lifecycle process
  • Slurm installation, configuration, performance tuning and troubleshooting
  • Plan, design and implement a transition from the LSF scheduler to Slurm
  • Manage the Slurm scheduler and translate Research policies into scheduler configurations
  • Consult with faculty and students to develop research pipelines for use on the HPC cluster
  • Develop and maintain user lifecycle software suite in Python, implement CI/CD pipeline
  • Test and automate upgrades of critical system applications using Ansible and shell scripts.
  • The ability to communicate effectively with clinicians, researchers, and other team members to develop technological solutions is key
**Applications will be accepted until 3/20/24

Salary and Other Compensation:

  • The annual salary for this position is [$88,200 to $139,500] depending on experience and other qualifications of the successful candidate.
  • This position is also eligible for Cognizant's discretionary annual incentive program, based on performance and subject to the terms of Cognizant's applicable plans.

Benefits: Cognizant offers the following benefits for this position, subject to applicable eligibility requirements:

  • Medical/Dental/Vision/Life Insurance
  • Paid holidays plus Paid Time Off
  • 401(k) plan and contributions
  • Long-term/Short-term Disability
  • Paid Parental Leave
  • Employee Stock Purchase Plan

Disclaimer: The benefits information is accurate as of the date of this posting. Cognizant reserves the right to modify this information at any time, subject to applicable law.

Cognizant is recognized as a Military Friendly Employer and is a coalition member of the Veteran Jobs Mission. Our Cognizant Veterans Network encourages Veterans in building and growing a career at Cognizant that allows them to demonstrate the leadership, loyalty, integrity, and dedication to excellence instilled in them through participation in military service.

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status, or disability status. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform crucial job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.

Equal Opportunity Employer M/F/D/V
Applied = 0

(web-7d594f9859-2nzb9)