Blockdaemon Logo

Blockdaemon

Site Reliability Engineer (US)

Reposted 6 Days Ago
In-Office
New York City, NY
Senior level
In-Office
New York City, NY
Senior level
The Site Reliability Engineer ensures reliability, scalability, and performance of systems by collaborating to design, implement, and maintain infrastructure solutions in a multi-cloud environment, focusing on automation, incident management, and security.
The summary above was generated by AI

Position Overview

As a Site Reliability Engineer (SRE), you will play a critical role supporting our Blockdaemon team by ensuring the reliability, scalability, and performance of our systems and services. You will collaborate closely with cross-functional teams to design, implement, and maintain robust and resilient infrastructure solutions in a Multi-Cloud environment.

The ideal candidate is passionate about automation, possesses strong analytical skills, and thrives in a fast-paced, dynamic environment.

Blockdaemon is a Blockchain Infrastructure Company operating in a multi-cloud configuration with a global footprint. The expectation for this role is a candidate capable of supporting systems & infrastructure stack across the major clouds, Google Cloud Platform (GCP) and Amazon Web Services (AWS), Azure.

Your Impact

  • System Architecture and Design: Collaborate with software engineering teams to design scalable, highly available, and resilient systems. Drive architectural improvements to enhance system reliability and performance.

  • Implement Infrastructure as Code to manage services and deployments in a multi-cloud, multi-project configuration.

  • Automation and Tooling: Develop automation tools and scripts to streamline deployment, monitoring, and incident response processes. Implement and maintain infrastructure as code frameworks.

  • Monitoring and Alerting: Configure and maintain monitoring systems to detect and mitigate potential issues proactively. Define alerting thresholds and response procedures to ensure timely incident resolution.

  • Incident Management: Respond to and resolve critical incidents, perform root cause analysis, and implement preventive measures to minimize the likelihood of recurrence. Participate in an on-call rotation to provide 24/7 support as needed.

  • Capacity Planning and Performance Optimization: Analyze system performance metrics, identify bottlenecks, and propose optimizations to improve resource utilization and efficiency.

  • Security and Compliance: Work closely with security teams to implement best practices for data protection, access control, and compliance with regulatory requirements. Conduct periodic security audits and vulnerability assessments.

  • Documentation and Knowledge Sharing: Document system configurations, procedures, and troubleshooting steps. Share knowledge and best practices with team members to foster a culture of continuous learning and improvement.

Role Requirements

Must Have:
  • Proven experience in an independent contributor role working with cloud platforms: GCP, AWS, Azure, Infrastructure-as-Code tooling: Terraform, Helm, and CI/CD orchestration platforms: GitlabCI, ArgoCD, Github Actions or similar GitOps workflows.

  • Excellent problem-solving skills and the ability to independently troubleshoot complex issues.

  • Strong communication and collaboration skills, with the ability to work effectively in cross-functional teams.

  • Strong Architectural & Security Mindset.

Should Have:
  • Strong understanding of Linux/Unix systems administration and networking concepts.

  • Hands-on experience with configuring and running monitoring tools like Prometheus, Grafana, etc.

  • 5+ years experience of maintaining infrastructure-as-code on Google Cloud Platform, Amazon Web Services and Azure.

  • Experience working in SOC 2 Type 1 and Type 2 certified companies.

Nice-to-Have:
  • Proficiency in scripting and programming languages such as BASH, Golang, Python and TypeScript.

  • 2+ years hands-on experience operating highly available Kubernetes clusters.

  • Experience being involved in incident management and resolution.

  • Experience with AI development tools and related security considerations.

  • Passion for the Blockchain Industry & Decentralised Systems.

  • Experience with Blockchain Infrastructure, either in a personal or professional capacity.

About Us:


We Power the Blockchain economy.


Blockdaemon powers the blockchain economy with its suite of industry-leading
infrastructure solutions. We are a globally established, ISO-27001 certified partner with extensive protocol coverage, offering technical depth, industry-leading SLAs, 70+ global points of presence through 10+ cloud and bare metal providers, and 24/7 support for an unmatched institutional-grade experience. We provide integrated business solutions to exchanges, custodians, crypto platforms, financial institutions, and developers using our end-to-end suite of blockchain tools, including dedicated nodes, APIs, staking, liquid staking, MPC tech, and more. Blockdaemon provides its customers with the confidence to quickly and easily scale without compromising security or compliance.


We are a globally distributed team.


Blockdaemon is an Equal Opportunity Employer.

Top Skills

Argocd
AWS
Azure
Bash
GCP
Github Actions
Gitlabci
Go
Grafana
Helm
Prometheus
Python
Terraform
Typescript

Similar Jobs

10 Days Ago
Easy Apply
Remote or Hybrid
5 Locations
Easy Apply
127K-249K Annually
Senior level
127K-249K Annually
Senior level
Big Data • Cloud • Software • Database
The role involves maintaining and improving CI/CD infrastructure using Argo Workflows and Kubernetes, ensuring effective deployment for engineering teams.
Top Skills: AWSAzureGoGCPKubernetesPython
An Hour Ago
Hybrid
5 Locations
178K-313K Annually
Senior level
178K-313K Annually
Senior level
Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
The Machine Learning Engineer will develop models to drive user and advertiser value, ensure code quality, and collaborate across teams.
Top Skills: Caffe2PyTorchScikit-LearnSpark MlTensorFlow
An Hour Ago
Easy Apply
Hybrid
New York, NY, USA
Easy Apply
36-40 Hourly
Junior
36-40 Hourly
Junior
HR Tech • Payments • Professional Services • Software
The Payroll Tax Amendment Specialist will manage tax filing issues, execute payroll tax amendments, and ensure compliance with tax regulations. Strong communication and organizational skills are essential, along with the ability to adapt in a fast-paced environment.
Top Skills: ExcelGoogle SheetsPayroll Tax Software

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account