Gemini Logo

Gemini

Principal Data Engineer

Reposted 5 Hours Ago
In-Office
New York, NY, USA
193K-275K Annually
Expert/Leader
In-Office
New York, NY, USA
193K-275K Annually
Expert/Leader
Lead technical strategy for data architecture, build and optimize ETL/ELT and streaming pipelines, establish observability and data quality standards, mentor engineers, drive large-scale cross-team designs, and deliver self-serve data products for analytics and ML.
The summary above was generated by AI

About the Company

Gemini is a global crypto and Web3 platform founded by Cameron and Tyler Winklevoss in 2014, offering a wide range of simple, reliable, and secure crypto products and services to individuals and institutions in over 70 countries. Our mission is to unlock the next era of financial, creative, and personal freedom by providing trusted access to the decentralized future. We envision a world where crypto reshapes the global financial system, internet, and money to create greater choice, independence, and opportunity for all — bridging traditional finance with the emerging cryptoeconomy in a way that is more open, fair, and secure. As a publicly traded company, Gemini is poised to accelerate this vision with greater scale, reach, and impact.

The Department: Data

At Gemini, our Data Team is the engine that powers insight, innovation, and trust across the company. We bring together world-class data engineers, platform engineers, machine learning engineers, analytics engineers, and data scientists — all working in harmony to transform raw information into secure, reliable, and actionable intelligence. From building scalable pipelines and platforms, to enabling cutting-edge machine learning, to ensuring governance and cost efficiency, we deliver the foundation for smarter decisions and breakthrough products. We thrive at the intersection of crypto, technology, and finance, and we’re united by a shared mission: to unlock the full potential of Gemini’s data to drive growth, efficiency, and customer impact.

The Role: Principal Data Engineer

The Data Engineering Team owns the ingestion and transformation of data from production databases, streams, and external data sources into our data warehouse. As a Principal Data Engineer, you will set the technical direction for how data is modeled, processed, and delivered across the organization. You will partner closely with product, analytics, ML, finance, operations, and engineering teams to move, transform, and model data reliably, with observability, resilience, and agility. You’ll lead by example through design excellence, mentoring, and technical leadership, ensuring our data architecture is scalable, governed, and ready for the next generation of analytics and machine learning at Gemini. 

This is a senior individual contributor role — highly technical, strategic, and cross-functional — where you’ll influence the design of data systems that underpin key decisions and customer-facing products across Gemini.

This role is required to be in person twice a week at our New York City, NY office.

Responsibilities:

  • Define and drive the long-term vision for data architecture, modeling, and transformation at Gemini
  • Establish standards for data reliability, observability, and quality across all pipelines and data products using languages and frameworks such as Python, SQL, Spark, Flink, Beam, or equivalents
  • Partner with Staff and Senior Data Engineers, Platform Engineers, and Analytics Engineers to unify how data is produced, stored, and consumed
  • Lead large-scale design initiatives that span multiple teams and systems, ensuring maintainability, performance, and security
  • Partner with data scientists, ML engineers, analysts, and product teams to understand data requirements, define SLAs, and deliver coherent data products that others can self-serve
  • Establish data quality, validation, observability, and monitoring frameworks (data auditing, alerting, anomaly detection, data lineage)
  • Investigate and resolve complex production issues: root cause analysis, performance bottlenecks, data integrity, fault tolerance
  • Mentor and guide more junior and mid-level data engineers: lead code reviews, design reviews, and best-practice evangelism
  • Help recruit and onboard new talent, shaping the future of Gemini’s data engineering discipline
  • Stay up to date on new tools, technologies, and patterns in the data and cloud space, bringing proposals and proof-of-concepts when appropriate
  • Document data flows, data dictionaries, architecture patterns, and operational runbooks

Minimum Qualifications:

  • 10+ years of experience in data engineering (or similar) roles
  • Strong experience in ETL/ELT pipeline design, implementation, and optimization
  • Deep expertise in Python and SQL writing production-quality, maintainable, testable code
  • Experience with large-scale data warehouses (e.g. Databricks, BigQuery, Snowflake)
  • Solid grounding in software engineering fundamentals, data structures, and systems thinking
  • Hands-on experience in data modeling (dimensional modeling, normalization, schema design)
  • Experience building systems with real-time or streaming data (e.g. Kafka, Kinesis, Flink, Spark Streaming), and familiarity with CDC frameworks
  • Experience with orchestration / workflow frameworks (e.g. Airflow)
  • Familiarity with data governance, lineage, metadata, cataloging, and data quality practices
  • Strong cross-functional communication skills; ability to translate between technical and non-technical stakeholders
  • Proven experience in recruiting, mentoring, leading design discussions, and influencing data-engineering best practices across teams

Preferred Qualifications:

  • Experience with crypto, financial services, trading, markets, or exchange systems
  • Experience with blockchain, crypto, Web3 data — e.g. blocks, transactions, contract calls, token transfers, UTXO/account models, on-chain indexing, chain APIs, etc.
  • Experience with infrastructure as code, containerization, and CI/CD pipelines
  • Hands-on experience managing and optimizing Databricks on AWS
It Pays to Work Here
 
The compensation & benefits package for this role includes:
  • Competitive starting salary
  • A discretionary annual bonus
  • Long-term incentive in the form of a new hire equity grant
  • Comprehensive health plans
  • 401K with company matching
  • Paid Parental Leave
  • Flexible time off

Salary Range: The base salary range for this role is between $192,500 - $275,000 in the State of New York, the State of California and the State of Washington. This range is not inclusive of our discretionary bonus or equity package. When determining a candidate’s compensation, we consider a number of factors including skillset, experience, job scope, and current market data.

In the United States, we offer a hybrid work approach at our hub offices, balancing the benefits of in-person collaboration with the flexibility of remote work. Expectations may vary by location and role, so candidates are encouraged to connect with their recruiter to learn more about the specific policy for the role. Employees who do not live near one of our hubs are part of our remote workforce.

At Gemini, we strive to build diverse teams that reflect the people we want to empower through our products, and we are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or Veteran status. Equal Opportunity is the Law, and Gemini is proud to be an equal opportunity workplace. If you have a specific need that requires accommodation, please let a member of the People Team know.

#LI-AA1

HQ

Gemini New York, New York, USA Office

New York, NY, United States, 10010

Similar Jobs

10 Days Ago
Remote or Hybrid
USA
195K-320K Annually
Expert/Leader
195K-320K Annually
Expert/Leader
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
As a Principal Data Engineer, you will design and implement LLM, AI-powered security data platforms, mentor engineers, and drive the adoption of data solutions across teams.
Top Skills: AirflowAWSBigQueryDaskDockerFlinkGCPKafkaKubeflowKubernetesLangchainLlamaindexMlflowMlops ToolsOciPulsarPythonSagemakerSnowflakeSparkVertex Ai
11 Days Ago
Remote or Hybrid
New York, NY, USA
160K-200K Annually
Expert/Leader
160K-200K Annually
Expert/Leader
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
The Principal Data Engineer at NBCUniversal leads the development of data applications, manages data pipelines, and collaborates with teams while coaching junior engineers. Responsibilities include designing scalable data solutions and integrating AI technologies.
Top Skills: AWSAzureGCPPythonSnowflakeSQL
12 Days Ago
Easy Apply
Remote or Hybrid
USA
Easy Apply
182K-260K Annually
Expert/Leader
182K-260K Annually
Expert/Leader
Cloud • Information Technology • Security • Software • Cybersecurity
The Principal GenAI Data Engineer will drive the design and implementation of scalable GenAI data platforms, focusing on robust data pipelines for structured and unstructured data. This role requires expertise in Python and AI data architecture, enhancing enterprise data capabilities.
Top Skills: Ai WorkloadsGenerative AiGraph DatabasesMetadata Storage SystemsPythonVector Databases

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account