Railway Logo

Railway

Senior Product Engineer, Scalability

Posted Yesterday
Be an Early Applicant
In-Office or Remote
Hiring Remotely in United States
Senior level
In-Office or Remote
Hiring Remotely in United States
Senior level
Design, implement, and operate high-throughput backend systems for billing, payments, fraud detection, and pipelines. Architect end-to-end metering, rating, invoicing, payment flows, real-time scoring, and scale Postgres, ClickHouse, Node.js services, and Temporal workflows. Build TypeScript+GraphQL APIs, write engineering requirements, contribute to OSS, and participate in on-call rotation.
The summary above was generated by AI

Our core mission at Railway is to make software engineers higher leverage. We believe that people should be given powerful tools so that they can spend less time setting up to do, and more time doing. Railway now powers workloads for millions of builders, and the systems underneath — usage metering, billing and payments, fraud and abuse protection, background workers, and the data pipelines that feed them all — have to scale every week.

You'll be the person who architects that stack: making it fast, correct, and trustworthy at a volume that keeps growing. If you're looking to scale the backbone of an operating system for builders, we'd love to talk with you!

Want to learn about our work culture? Here is a multi-part blog series that will help you see the unique ways our team works (Parts 1, 2, 3, and 4).

About the role

This is a backend-leaning role focused on scaling systems. Billing and fraud will be a focus, but your remit spans every high-throughput system at Railway — workers, queues, event pipelines, and the databases underneath them. You'll own your work end-to-end, including when a feature reaches the UI.

For this role, you will:

  • Architect and scale the pipelines that turn raw usage into accurate, real-time billing — metering, aggregation, rating, and invoicing across millions of events, from ingestion in ClickHouse to the rating engine.

  • Build payment flows that are correct under concurrency and partial failure: idempotent charges, retries, reconciliation, and clean handling of provider edge cases (Stripe and beyond).

  • Develop fraud and abuse detection — signal collection, real-time scoring, automated mitigation — that protects platform margin without getting in legitimate users' way.

  • Scale the systems everything else depends on: Postgres under heavy write load, Node.js services under pressure, and long-running workflows orchestrated with Temporal where exactly-once semantics and durability actually matter.

  • Build TypeScript + GraphQL APIs where correctness and auditability are non-negotiable.

  • Write Engineering Requirement Documents to take something from idea, to defined tasks, to implementation, to monitoring its success and scaling it further.

  • Contribute to our open-source repositories (CLI, Typescript SDK, Railpack, etc.) — Rust experience, or the desire to learn it, helps here.

  • Be oncall from time to time.

Some projects this team takes on:

  • Re-architect billing end-to-end: per-second usage metering at platform scale, idempotent payment processing that survives provider outages without double-charging, and credit, prepayment, and enterprise-invoicing models that hold up under audit.

  • Stand up a fraud-detection service that scores signups and deployments in real time and automatically throttles abuse (crypto mining, free-tier farming, stolen cards).

  • Scale our Temporal workloads to orchestrate workflows across millions of deployments.

  • Build internal tooling that gives teams across Railway a trustworthy, real-time view into the systems they depend on.

This is a high impact, high agency role with direct effect on company culture, trajectory, and outcome.

About you
  • An ability to autonomously lead, design, and implement backend systems where correctness, consistency, and auditability are first-class requirements.

  • A track record of scaling systems — you've taken a pipeline, service, or database that was falling over and made it handle 10x, and you know which tools to reach for (and when polling stops being enough).

  • Deep expertise in Postgres and relational data modeling — you reach for the right consistency guarantees, understand the cost of getting them wrong, and know how Postgres itself behaves at scale.

  • Strong working knowledge of Node.js internals — the event loop, memory behavior, and what to do when a service degrades under load.

  • Experience managing complex asynchronous and long-running backend jobs, ideally with a workflow engine like Temporal, for things like billing runs or payment reconciliation.

  • Familiarity with the realities of money movement: payment providers, idempotency, retries, reconciliation, and their failure modes. Direct billing, payments, or fraud experience is a strong plus.

  • A security and abuse-aware mindset — you instinctively think about how a system can be gamed, and you design accordingly.

  • A desire to be a part of the entire project development process, from research gathering and planning, to implementation and monitoring.

  • Great written and verbal communication skills for expressing ideas, designs, and potential solutions in a mostly-asynchronous manner.

We value and love to work with diverse persons from all backgrounds.

Things to know

For better or worse, we're a startup; our team dynamics are different from companies of different sizes and stages.

  • We're globally distributed—and getting more so. Stuff is always happening somewhere.

  • We don't expect you to be online all the time, but you'll need to be diligent about your boundaries — your end of day will overlap with someone else's start.

  • We're a small, high-ownership team that cares deeply about doing exceptional work. We're scaling quickly, which means we rely on leverage—systems over coordination, judgment over process. Expect ambiguity and a fast-moving environment.

  • You'll own real outcomes. That means making decisions, not just executing—and owning the success, or failure, that comes with them.

Benefits and perks

At Railway, we provide best in class benefits. Great salary, full health benefits including dependents, strong equity grants, equipment stipend, and much more. For more details, check back on the main careers page.

Beyond compensation, there are a few things that we believe make working at Railway truly unique:

  • Autonomy: We have very few meetings. Just a Monday and a Friday to go over the Company Board. We think your time is sacred, whether it's at work, or outside of work.

  • Ownership: We're a company with a high ownership, high autonomy culture. We hope that you'll come in, help us, and over the course of many years do the best work of your life. When we bring you onboard, we expect you to change the company.

  • Novel problems/solutions: We're a startup that's well funded, with cool problems, which lets us implement novel solutions! We abhor "busywork" and think, whether it's community, engineering, operations, etc there's always opportunity for creative and high leverage solutions.

  • Growth: We want you to grow with us, but we know that talent is loaned, so when you figure out what area you want to grow in next, whether it's at Railway or outside, we'll make sure you land there.

How we hire

No tricks. No surprises. Here's the entire process.

1 — Talk with us about the role

This is completely open ended and we're just trying to see who you are, what you want to do, and where you wanna go.

2 — Work on a small project to discuss in the interview

Asynchronously design a system that scales. You choose the domain — pick whichever shows your thinking best:

  • A usage metering and billing pipeline that meters CPU/RAM for millions of workloads and bills accurately (you may depend on third parties such as Stripe), or

  • A stream-processing system that ingests high-cardinality observability events in real time, or

Whatever you pick, come ready to defend the architecture end-to-end. We'll dig into:

  • Polling vs. stream processing, and how you avoid losing data when streaming

  • Correctness under concurrency and partial failure: idempotency, retries, reconciliation, what happens when a step fails halfway through

  • How you handle cardinality, and which tools you lean on and why

  • The scalability of the things you depend on — what happens when Postgres becomes the bottleneck

  • Interview Structure to expect (60 Minutes):

    • Prework (submitted before your interview): Your design

    • 0–5 minutes: Introductions

    • 5–35 minutes: Walking through the design and how you'd extend it — new failure modes, 10x load, a fraud signal

    • 35–50 minutes: Noodling on technology, data modeling, and how you think about scale, money-movement, and abuse

    • 50–60 minutes: Time for you to ask your interviewers questions

You can, and SHOULD! ask us questions ahead of time. Ask away!

3 — Review your solution with the Team

You'll sit down with someone on the team and go over the above. We'll poke into your solution, as well as get you acquainted with two more members of the team.

Looking for: Learn about your problem solving skills. How you break down a problem and how you present a solution.

4 — Meet the Team

You'll meet the Team, which will be comprised of 4 people from vastly different sections of the company.

Looking for: How you work with the rest of the team and communicate.

5 — Chat with CEO

Sit down with our founder and CEO for 30 minutes. This is a 1:1, open ended conversation.

6 — Offer call

Finally, we will present the offers, hammer out the details about your position, tee up onboarding, and start our journey together.

Final Note: The interview goes both ways. Once again, please ask us things. Many things! Hard things. That's what we're here for.

Similar Jobs

Yesterday
In-Office or Remote
United States
Senior level
Senior level
Software
Design, architect, and scale high-throughput backend systems for billing, payments, fraud detection, workers, queues, and data pipelines. Own end-to-end implementation from ingestion (ClickHouse) through rating, invoicing, and reconciliation, build TypeScript/GraphQL APIs, manage long-running workflows (Temporal), contribute to OSS, and participate in on-call rotations.
Top Skills: ClickhouseGraphQLNode.jsPostgresRustStripeTemporalTypescript
An Hour Ago
Remote
108K-144K Annually
Senior level
108K-144K Annually
Senior level
Food • Retail • Agriculture • Manufacturing
Lead and develop the Ontario field sales team to achieve sales, volume, margin, and market-share goals. Oversee distributor and multi-unit customer relationships, execute regional sales strategies and initiatives, manage trade spend and CRM discipline (Salesforce), and collaborate cross-functionally to drive profitable growth and retention across Ontario.
Top Skills: CRMSalesforce
An Hour Ago
Easy Apply
Remote
Easy Apply
55K-75K Annually
Junior
55K-75K Annually
Junior
Big Data • Fintech • Mobile • Payments • Financial Services
Provide frontline technical and operational support to e-commerce merchant partners. Troubleshoot account and configuration issues, collaborate with engineering and cross-functional teams, manage multiple inquiries, build scalable support processes, communicate feature updates, and aggregate merchant feedback to inform product planning.
Top Skills: ShopifyWeb DevelopmentWoocommerce

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account