Hewlett Packard Enterprise Logo

Hewlett Packard Enterprise

Systems Analyst (/Site Reliability Engineer)

Reposted 10 Days Ago
Remote
Hiring Remotely in Tennessee
120K-275K Annually
Junior
Remote
Hiring Remotely in Tennessee
120K-275K Annually
Junior
The Systems Analyst/Site Reliability Engineer will maintain, optimize, and deploy large-scale HPC systems, ensuring reliability for scientific research while collaborating with technical teams and users.
The summary above was generated by AI
Systems Analyst (/Site Reliability Engineer)

  

This role has been designed as ‘’Onsite’ with an expectation that you will primarily work from an HPE partner/customer office.

Who We Are:

Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today’s complex world. Our culture thrives on finding new and better ways to accelerate what’s next. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good. If you are looking to stretch and grow your career our culture will embrace you. Open up opportunities with HPE.

Job Description:

   

We are seeking a skilled Systems Analyst (/Site Reliability Engineer) at HPE to support Oak Ridge National Laboratory (ORNL). This is a unique, on site, customer facing opportunity to work with some of the world's most advanced high-performance computing (HPC) systems, including Frontier, the world’s first exascale supercomputer. As part of our team, you will play a critical role in the deployment, maintenance, and optimization of large-scale computing software infrastructure and hardware, ensuring system reliability for cutting-edge scientific research.

Responsibilities:

  • Maintain and optimize compute infrastructure across multiple large-scale HPC systems.
  • Participate in the deployment, testing, and validation of live high-performance computing clusters.
  • Troubleshoot node failures by analyzing OS internals, compiler behavior, and system logs, coordinating with internal subject-matter experts as needed.
  • Conduct routine and on-demand maintenance, troubleshooting, and performance tuning for large-scale HPC environments.
  • Collaborate with researchers, engineers, and technical staff to open, maintain and close JIRA tickets to ensure system reliability and efficiency for high-stakes, high-performance scientific research.
  • Investigate and document complex software and system-level issues, acting as a bridge between users and HPE internal teams.
  • Develop and implement automation tools, scripts, and monitoring solutions to streamline system management.
  • Stay up-to-date with advancements in HPC technologies, including GPU acceleration (e.g., ROCm), parallel computation (Cray PE, MPI/OpenMP), and performance tuning.

Requirements:

  • Due to the nature of the work, this position requires either U.S. Citizenship or U.S. Lawful Permanent Resident (LPR) status.
  • System Experience: Experience using SLURM-based HPC systems, both as a user and preferably as a system administrator.
  • Technical Skills: Proficient in Linux, Python, and Bash scripting. Familiarity with C++/Fortran-based HPC application development, GPUs, MPI, and high-performance computing tools.
  • Application Build and Configuration Knowledge: Strong understanding of application build processes, including compiler configurations, library integration, and dependency management, to effectively set up environments, perform upgrades, and troubleshoot build and runtime issues.
  • Log analysis: Experience in large-scale log analysis and troubleshooting performance, bugs or system failures.
  • Communication Skills: Strong written and verbal communication skills, with the ability to document and share knowledge effectively with internal teams and end-users.
  • Industry Knowledge: Familiarity with emerging HPC trends, system architectures, and optimization strategies.

Education:

  • Bachelor’s in Computer Science, Computer Engineering, or a related field, with at least 2 years of experience, OR a Master’s in Computer Science or Computer Engineering of a related field.

#unitedstates

Additional Skills:

Accountability, Accountability, Active Learning, Active Listening, Bias, Business Growth, Client Expectations Management, Coaching, Creativity, Critical Thinking, Cross-Functional Teamwork, Customer Centric Solutions, Customer Relationship Management (CRM), Design Thinking, Empathy, Follow-Through, Growth Mindset, Information Technology (IT) Infrastructure, Infrastructure as a Service (IaaS), Intellectual Curiosity (Inactive), Long Term Planning, Managing Ambiguity, Process Improvements, Product Services, Relationship Building {+ 5 more}

What We Can Offer You:

Health & Wellbeing

We strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing.

Personal & Professional Development

We also invest in your career because the better you are, the better we all are. We have specific programs catered to helping you reach any career goals you have — whether you want to become a knowledge expert in your field or apply your skills to another division.

Unconditional Inclusion

We are unconditionally inclusive in the way we work and celebrate individual uniqueness. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good.

Let's Stay Connected:

Follow @HPECareers on Instagram to see the latest on people, culture and tech at HPE.

#unitedstates#highperformancecompute, #servicesandsupport

Job:

Services

Job Level:

TCP_05

    

States with Pay Range Requirement

The expected salary/wage range for a U.S.-based hire filling this position is provided below. Actual offer may vary from this range based upon geographic location, work experience, education/training, and/or skill level. If this is a sales role, then the listed salary range reflects combined base salary and target-level sales compensation pay. If this is a non-sales role, then the listed salary range reflects base salary only. Variable incentives may also be offered. Information about employee benefits offered can be found at https://myhperewards.com/main/new-hire-enrollment.html.

USD Annual Salary: $119,500.00 - $275,000.00

HPE is an Equal Employment Opportunity/ Veterans/Disabled/LGBT employer. We do not discriminate on the basis of race, gender, or any other protected category, and all decisions we make are made on the basis of qualifications, merit, and business need. Our goal is to be one global team that is representative of our customers, in an inclusive environment where we can continue to innovate and grow together. Please click here: Equal Employment Opportunity.

Hewlett Packard Enterprise is EEO Protected Veteran/ Individual with Disabilities.

   

HPE will comply with all applicable laws related to employer use of arrest and conviction records, including laws requiring employers to consider for employment qualified applicants with criminal histories.

Top Skills

Bash
C++
Fortran
Hpc
Linux
Mpi
Openmp
Python
Rocm
Slurm

Similar Jobs

An Hour Ago
Remote
2 Locations
174K-261K Annually
Senior level
174K-261K Annually
Senior level
Artificial Intelligence • Productivity • Software • Automation
As a Sr. Software Engineer at Zapier, you'll build and scale robust backend systems for their automation platform, collaborating on various impactful projects, improving user workflows, and ensuring smooth execution of automations.
Top Skills: Ai ToolingDjangoMySQLNext.JsNode.jsPostgresPythonReactRestful ApisTypescript
An Hour Ago
Remote
2 Locations
144K-261K Annually
Senior level
144K-261K Annually
Senior level
Artificial Intelligence • Productivity • Software • Automation
The role involves leading engineering efforts in backend and full-stack development, working with enterprise customers, and building scalable systems for automation and asset management, while applying AI tool advancements.
Top Skills: DatadogFastifyGrafanaGraylogKafkaNode.jsPythonReactSqsTypescript
An Hour Ago
Remote
2 Locations
174K-261K Annually
Senior level
174K-261K Annually
Senior level
Artificial Intelligence • Productivity • Software • Automation
As a Senior Backend Engineer at Zapier, you'll design and implement scalable APIs, collaborate on architecture, and integrate AI tools to enhance automation.
Top Skills: Node.jsPython

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account