Subscribe to the latest remote jobs:

High-Performance Computing DevOps Architect

🇮🇳 India

Manufacturing

Management

Python

Docker

Ansible

Machine Learning

Design

Devops

High-Performance Computing DevOps Architect

from 🇮🇳 India

Who We Are

Applied Materials is a global leader in materials engineering solutions used to produce virtually every new chip and advanced display in the world. We design, build and service cutting-edge equipment that helps our customers manufacture display and semiconductor chips – the brains of devices we use every day. As the foundation of the global electronics industry, Applied enables the exciting technologies that literally connect our world – like AI and IoT. If you want to push the boundaries of materials science and engineering to create next generation technology, join us to deliver material innovation that changes the world. 

What We Offer

Location:

Bangalore,IND, Chennai,IND

You’ll benefit from a supportive work culture that encourages you to learn, develop, and grow your career as you take on challenges and drive innovative solutions for our customers. We empower our team to push the boundaries of what is possible—while learning every day in a supportive leading global company. Visit our Careers website to learn more. 

At Applied Materials, we care about the health and wellbeing of our employees. We’re committed to providing programs and support that encourage personal and professional growth and care for you at work, at home, or wherever you may go. Learn more about ourbenefits. 

 

 

As a Software Engineer at Applied Materials, you’ll dive deep into ground-breaking technologies—like machine learning and AI—to craft novel software solutions that solve our customers’ high-value problems. Our Software Engineers are responsible for designing, prototyping, developing, and debugging software solutions for semiconductor equipment components and devices to ensure quality and functionality. You'll develop software documentation and test procedures, troubleshoot software problems, and communicate with internal customers to understand project requirements. As part of our team, you'll contribute your expertise in intricate systems, deciphering code, and anticipating software behaviors to ensure Applied remains the leader in the semiconductor and display sectors.

 

 

 

Our Team 

Our team is developing ahigh-performance computingsolution for low-latency and high throughput image processingand deep-learningworkloads thatwillenableourChip Manufacturing process controlequipmenttooffer differentiated value to our customers. 

Your Opportunity 

Asan HPC Architect, youwillget the opportunity toarchitecthigh-performancecomputingsolutions from scratch anddesign/optimize all aspects(Compute, Memory, Networking, Storage) for better cost of Ownership. 

Roles and Responsibility 

  • Asan architect, you willbe responsible fordesigning HPC infrastructure solutions, includingcompute, networking, storage, and workload management components. 

  • You will work closely with cross-functional teams, includingHardware, Software, productmanagement, and business stakeholders, to understandcomputeworkload and translatethemintoPlatformarchitecture anddesigns that meet business needs. 

  • You will create andmaintain detailed system architecture diagrams and specifications.  

  • You will evaluate and selectappropriate hardware and software components for HPC environments 

  • You willInstall, configure, and maintain HPC systems, including hardware, software, and networking components 

  • You will develop and implement automation scripts for system management and deployment.  

  • You willbe a subject Matter expert to unblockdependent teamsinthe HPC domain. 

  • You will be expected todevelop systembenchmarks,profile systems to understand bottlenecks,optimize workflows and processes to improvecost of ownership. 

  • Identify and mitigate technical risks and issues throughout theHPC developmentlife cycle. 

  • Ensure thatComputeCluster is resilient,reliable, and maintainable. 

  • You willbeexpected to stay abreast of the latest HPC technologies, including Hardware, Software and Networking Solutions 

  • Your primary focus will beto understand thecomputeworkload and design HPC cluster with right combinationofNodes,CPU/GPU, Memory,Interconnects and storagetohaveoptimum performance at minimum cost of Ownership. 

 

 

Our Ideal Candidate 

Someone who hasthe drive and passion to learn quickly, has the ability tomulti-task and switch contexts based on business needs. 

Qualifications 

  • In-depth experience with Linux Systemadministration and Hardware/Software Configuration. 

  • Strong knowledge of HPC technologies including cluster computing, high speed interconnects (InfiniBand, RoCE), parallel filesystems (Lustre, GPFS,BeeGFSetc) 

  • Experience in creating,maintaining Operating System imageswith different installationand boot schemes 

  • Extremely good with automation tools like Ansible, Chef, Salt-Stack and Scripting languages (Python and Bash) 

  • Experience inCreating,maintaining Storage Solutions with different RAIDconfiguration. 

  • Abilitytodesign storage solution for different IOPS,Access patterns (Random vs SequentialRW) andtune storage andfilesystemsfor better performance. 

  • Goodof knowledgeNetworking concepts including IP addressing, routing,protocols and Switch configuration for RDMA, VLAN configuration, network bonding etc. 

  • Good Knowledge Virtualization,Hardware and Software Hypervisors 

  • Good knowledge of containerization technologies like docker, singularity. 

  • Experience in Software Defined Networking and Storage. 

  • Experience in setting-up remote managementprotocols like IPMI, Redfish etc. 

  • Experience in setting-upand using monitoring systems likePrometheus, Grafana. 

  • ExperienceSystem profiling andcustomtuningfor targetworkloadfor higher performance and low cost of ownership 

  • Very good written and verbal communication skills. 

  • Very goodinTechnical documentationmeant to serve as manuals fornon-experts in thefield. 

 

Additional Qualifications: 

 

  • ExperienceinHPCCluster management andWork-load orchestration software(e.g.SLURM, Torque,LSF) 

  • Experience inSetting-up Deep-learningtraining/inference solutions. 

  • Experience in Private cloudinfrastructure likeKubernetes,OpenStack,CloudStack etc. 

  • Experience inDistributedHigh Performance ComputingandParallel programming frameworks  

  • Good knowledge of Low-latency and high-throughput data transfer technologies(RDMA on RoCE, InfiniBand) 

 

Education: 

Bachelor's Degreeor higherin Computer science or related Disciplines. 

Additional Information

Time Type:

Full time

Employee Type:

Assignee / Regular

Travel:

Relocation Eligible:

No

Applied Materials is an Equal Opportunity Employer. Qualified applicants will receive consideration for employment without regard to race, color, national origin, citizenship, ancestry, religion, creed, sex, sexual orientation, gender identity, age, disability, veteran or military status, or any other basis prohibited by law.

by @maxrusakovic