Alpaca Logo

Alpaca

Senior Data Engineer

Reposted 11 Days Ago
Remote
Hiring Remotely in USA
Senior level
Remote
Hiring Remotely in USA
Senior level
Design and develop the data management layer, focusing on scalability and integration for extensive data processing, while collaborating with various teams.
The summary above was generated by AI

Who We Are:

Alpaca is a US-headquartered self-clearing broker-dealer and brokerage infrastructure for stocks, ETFs, options, crypto, fixed income, 24/5 trading, and more. Our recent Series D funding round brought our total investment to over $320 million, fueling our ambitious vision.

Amongst our subsidiaries, Alpaca is a licensed financial services company, serving hundreds of financial institutions across 40 countries with our institutional-grade APIs. This includes broker-dealers, investment advisors, wealth managers, hedge funds, and crypto exchanges, totalling over 9 million brokerage accounts.

Our global team is a diverse group of experienced engineers, traders, and brokerage professionals who are working to achieve our mission of opening financial services to everyone on the planet. We're deeply committed to open-source contributions and fostering a vibrant community, continuously enhancing our award-winning, developer-friendly API and the robust infrastructure behind it.

Alpaca is proudly backed by top-tier global investors, including Portage Ventures, Spark Capital, Tribe Capital, Social Leverage, Horizons Ventures, Unbound, SBI Group, Derayah Financial, Elefund, and Y Combinator.


Our Team Members:

We're a dynamic team of 380+ globally distributed members who thrive working from our favorite places around the world, with teammates spanning the USA, Canada, Japan, Hungary, Nigeria, Brazil, the UK, and beyond!
We're searching for passionate individuals eager to contribute to Alpaca's rapid growth. If you align with our core values—Stay Curious, Have Empathy, and Be Accountable—and are ready to make a significant impact, we encourage you to apply.

Your Role: We are seeking a Senior Data Platform Engineer to design and develop the data management layer for our platform to ensure its scalability as we expand to larger customers and new jurisdictions. At Alpaca, data engineering encompasses financial transactions, customer data, API logs, system metrics, augmented data, and third-party systems that impact decision-making for both internal and external users. We process hundreds of millions of events daily, with this number growing as we onboard new customers.

We prioritize open-source solutions in our data management approach, leveraging a Google Cloud Platform (GCP) foundation for our data infrastructure. This includes batch/stream ingestion, transformation, and consumption layers for BI, internal use, and external third-party sinks. Additionally, we oversee data experimentation, cataloging, and monitoring and alerting systems.

Our team is 100% distributed and remote.

Responsibilities:

  • Design and oversee key forward- and reverse-ETL patterns to deliver data to relevant stakeholders.
  • Develop scalable patterns in the transformation layer to ensure repeatable integrations with BI tools across various business verticals.
  • Expand and maintain the Alpaca Data Lakehouse architecture's constantly evolving elements.
  • Collaborate closely with sales, marketing, product, and operations teams to address key data flow needs.
  • Operate the system and manage production issues in a timely manner.

Must-Haves:

  • 7+ years of experience in data engineering, including 2+ years of building scalable, low-latency data platforms capable of handling >100M events/day.
  • Proficiency in at least one programming language, with strong working knowledge of Python and SQL.
  • Experience with cloud-native technologies like Docker, Kubernetes, and Helm.
  • Strong hands-on experience with relational database systems and object storage implementations like Apache Iceberg.
  • Strong hands-on experience with Google Cloud Platform and its various data-related services (Composer, Dataproc, Datastream, etc.)
  • Experience in building scalable transformation layers, preferably through formalized SQL models (e.g., dbt).
  • Ability to work in a fast-paced environment and adapt solutions to changing business needs.
  • Experience with ETL orchestrators / frameworks like Apache Airflow and Airbyte.
  • Production experience with streaming systems like Kafka.
  • Exposure to infrastructure, DevOps, and Infrastructure as Code (IaaC), like Terraform.
  • Deep knowledge of distributed systems, storage, transactions, and query processing utilizing open-source distributed query engines like Trino (formerly PrestoSQL).
  • If you're passionate about data engineering and thrive in a dynamic startup environment, we'd love to hear from you! 
How We Take Care of You:
  • Competitive Salary & Stock Options
  • Health Benefits
  • New Hire Home-Office Setup: One-time USD $500
  • Monthly Stipend: USD $150 per month via a Brex Card

Alpaca is proud to be an equal opportunity workplace dedicated to pursuing and hiring a diverse workforce.

Recruitment Privacy Policy

Similar Jobs

3 Days Ago
In-Office or Remote
97K-97K Annually
Senior level
97K-97K Annually
Senior level
Artificial Intelligence • Machine Learning • Database
Lead architecture and implementation of scalable cloud-native data platforms and medallion-tiered pipelines. Optimize ETL/ELT, enforce data governance, security, and CI/CD/IaC. Mentor engineers, drive adoption of managed/serverless cloud services, integrate BI tools, and lead client-facing technical consulting and architectural decisions.
Top Skills: AWSAzureCi/CdData WarehouseDatabricksDbtFivetranGCPInfrastructure As CodeJavaJavaScriptLakehouseLookerMedallion ArchitectureMicrosoft FabricPower BIPythonSnowflakeSQLTableauTypescript
4 Days Ago
Remote
Senior level
Senior level
Artificial Intelligence • Information Technology • Software
Senior Data Engineer responsible for designing and optimizing scalable Snowflake and dbt pipelines to handle billions of rows. Improve performance, storage, query execution, and cost efficiency; migrate/refine legacy processes; troubleshoot bottlenecks; provide technical leadership, mentorship, and documentation.
Top Skills: DbtPythonSnowflakeSQL
4 Days Ago
Remote
Senior level
Senior level
Edtech • Information Technology • Social Impact • Software • Database • Analytics • Generative AI
Design, build, and operate analytics-ready data systems and production data pipelines. Integrate external APIs, run batch ELT into data warehouses, model data for reporting, implement orchestration (Airflow) and transformations (dbt), ensure monitoring and incident response, mentor engineers, and collaborate closely with analytics stakeholders to improve platform scalability, quality, and documentation.
Top Skills: AirflowAPIsBi ToolsCi/CdData Quality ToolingData WarehouseDbtEltGCPGitObservabilityPythonSQLTerraform

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account