Guild Mortgage Logo

Guild Mortgage

Senior Site Reliability Engineer

Posted 16 Days Ago
Remote
Hiring Remotely in United States
95K-136K Annually
Senior level
Remote
Hiring Remotely in United States
95K-136K Annually
Senior level
The Senior Site Reliability Engineer executes reliability strategies, designs and maintains infrastructure, improves monitoring and deployment processes, collaborates with teams for system reliability and performance optimization.
The summary above was generated by AI

Guild Mortgage Company, closing loans and opening doors since 1960. As a mortgage banking firm we are dedicated to serving the homeowner/buyer. Our goal is to provide affordable home financing for our customers, utilizing the best terms available while providing a level of professionalism and service unsurpassed in the lending industry.

Position Summary

The Senior Site Reliability Engineer is responsible for executing the organizational reliability strategy and participating in resiliency design reviews to ensure the reliability, scalability, and performance of our company's software systems and applications meet organizational service level objectives (SLOs) and error budgets. The role is responsible for designing, implementing, and maintaining the infrastructure and tools necessary to support our platforms, as well as improving our monitoring, automation, and deployment processes. This role involves strategic planning, technical leadership, and collaboration with various stakeholders including Guild’s Product Delivery, Data Services, DevOps, DataOps, and Infrastructure teams to support organizational goals.

Compensation

This role is an exempt position with a targeted salary range of $94,882 to $136,096 annually.

Compensation at Guild is influenced by a wide array of factors including but not limited to local and federal minimum wage requirements, education, level of experience, and applicant’s geographical location.

Essential Functions

  • Participate in resiliency design reviews and lead complex problem-solving efforts.
  • Design, implement, and maintain monitoring systems to track the performance, availability, and reliability of services.
  • Respond to incidents promptly, investigate root causes, and coordinate efforts to mitigate and resolve them.
  • Analyze performance data, and plan for scalability and capacity requirements.
  • Identify and optimize performance bottlenecks, both at the infrastructure and application levels.
  • Automate repetitive tasks and processes to improve efficiency and reduce manual intervention.
  • Implement and enforce change management practices to ensure safe and controlled changes to the production environment.
  • Design and implement fault-tolerant systems and practices to minimize downtime and ensure service availability.
  • Collaborate with the GRC team on developing and maintaining disaster recovery plans and procedures relevant to the software supported to minimize the impact of catastrophic failures.
  • Work with the Incident Management and other teams to conduct a thorough analysis of incidents, document postmortem reports, and implement improvements based on lessons learned.
  • Work closely with development, operations, and other teams to foster a culture of reliability, and provide feedback on system design and architecture for improved reliability.

Qualifications

  • Bachelors Degree directly related to the position or equivalent, preferred.
  • A combination of education and experience may be considered in lieu of the Bachelor’s degree.
  • Minimum five years experience.
  • Collaborate with stakeholders to define RPO / RTO for Guild’s system footprint.
  • Expert in Cloud-based redundancy, high availability, and reliability strategies.
  • Expert in reliability, scalability, and performance optimization.
  • Expert at maintaining Linux / Unix and Windows systems administration, provisioning, configuration, monitoring, and troubleshooting Web Servers in a 7x24 customer facing environment.
  • Strong Linux and Windows Administration & scripting.
  • Solid Database Administration skills (MySQL, MariaDB, RDS, Sql Server, and Azure Storage services).
  • Deep knowledge of current methodologies in high performance operations and scalable multi-site implementations.
  • Proven Experience with large-scale software implementation (high transaction volume, high-availability concepts).
  • Deep knowledge of software deployment, versioning (GIT) and release management processes.
  • Experienced with infrastructure design, implementation, and support.
  • Proficient at automated provisioning, automated configuration management, and containerization solutions and tools.
  • Experienced in cloud-based hosting solutions (AWS, Azure, GCP).
  • Experienced with Cloud server environments (AWS, Google Cloud, or Azure).
  • Experienced in Agile software development best practices utilizing Continuous Integration & Delivery Pipelines as well as agile tools such as Jira.
  • Excellent written and verbal communication skills.
  • Proficient in communicating to both technical and management levels.
  • Ability to interact with external customers and staff members.
  • Highly adaptable.
  • Ability to work in a fast paced, constantly expanding environment.
  • Excellent verbal and written communication skills required.
  • Highly organized and detail-oriented; ability to work in a fast-paced, metrics-driven environment required.
  • Proficiency in Microsoft Office Suite, Word, Excel, Wiki, collaborative cloud-based programs, and third-party software applications required.
  • Commitment to company values.
  • Customer Service - Proactive attention to each person.
  • Integrity - Do and say what's right.
  • Respect - Treat others with dignity.
  • Collaboration - Listen and work together.
  • Learning - Seek knowledge and strive for improvement.
  • Excellence – Deliver the unexpected.

Supervision 

Job Scope:  Responsible for understanding the department/functional area objectives and goals and how own job contributes to achievement of these goals; may recommend changes and enhancements based on analysis and evaluation of circumstances.

Complexity:  Problems encountered are often complex and may involve significant resource coordination and availability, evaluating and resolving discrepancies with data, analyses, processes, etc. using own expertise and judgment.

Impact:  Decisions and actions primarily impact own work with moderate impact on peers in their area; contributes as team member rather than leader.

Interaction/Supervision:  Works under broad direction with considerable latitude for independent actions; guided by professional standards, desired outcomes and unit/project/program specifications.

Requirements 

  • Work is primarily sedentary; mobility in an office setting.
  • Ability to operate standard office equipment and keyboards.
  • Regularly required to accurately perceive, distinguish and interpret information received visually and through audio; e.g., words, numbers and other data broadcasted aloud/viewed on a screen, as well as print and other media.
  • Office environment – moderate noise, no substantial exposure to adverse environmental conditions.
  • Travel 5% or less.
  • Learn new tasks, remember processes, maintain focus, complete tasks independently, and make timely decisions in the context of a workflow.
  • Work is primarily performed during the business week, Monday - Friday; occasional night or weekend may be necessary.

Guild offers a pleasant work environment, competitive compensation and excellent benefits package; including medical, dental, vision, life insurance, AD&D, LTD and 401(k) with employer match. 

Guild Mortgage Company is an Equal Opportunity Employer.

REQ#: SENIO018160

Equal Opportunity Employer
This employer is required to notify all applicants of their rights pursuant to federal employment laws. For further information, please review the Know Your Rights notice from the Department of Labor.

Similar Jobs

2 Days Ago
Easy Apply
Remote
United States
Easy Apply
130K-140K Annually
Senior level
130K-140K Annually
Senior level
Artificial Intelligence • Consumer Web • Digital Media • Information Technology • Social Impact • Software
Lead SRE work to keep Circle highly available and performant: respond to incidents, own monitoring/alerting/log management, manage and optimize MySQL/Postgres/ClickHouse/Redis databases, maintain server infrastructure and deployment pipelines, collaborate with engineering teams, and build internal SRE tooling and automation.
Top Skills: AWSClickhouseKubernetesLlm-Based Tools (Copilots)MySQLPostgresRedis
3 Days Ago
Easy Apply
Remote
USA
Easy Apply
186K-219K Annually
Senior level
186K-219K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Own reliability, automation, and DevOps for Coinbase's corporate IAM platform: on-call/incident response, CI/CD and IaC pipelines, identity lifecycle tooling, observability and disaster recovery, documentation, and cross-team IAM advisement to ensure secure, scalable access for a global workforce.
Top Skills: AbacAuth0AWSAzureC#Ci/CdContainer OrchestrationDuoEntraidGCPGenerative AiGitGoIacJavaMfaOktaPingPythonRbacRubySsoTerraform
3 Days Ago
Easy Apply
Remote
USA
Easy Apply
186K-219K Annually
Senior level
186K-219K Annually
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Senior SRE on the IT Operations team owning reliability, monitoring, and incident response for AI infrastructure. Build automation, CI/CD and Kubernetes tooling, improve observability and documentation, and develop internal full-stack tools using Go or Python. Partner with Infrastructure, Security, and Compliance to scale secure, resilient AI deployment pipelines.
Top Skills: AnsibleAWSBashChefCi/CdDockerEc2GitGoKubernetesLinuxPuppetPythonRubySaltTerraform

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account