Fusemachines Logo

Fusemachines

Senior Data Engineer

Reposted 10 Days Ago
In-Office
New York, NY, USA
Senior level
In-Office
New York, NY, USA
Senior level
Fusemachines is seeking experienced Senior Data Engineers to design and optimize data systems and develop cloud-native data architectures using modern technologies.
The summary above was generated by AI

About Fusemachines

Founded in 2013, Fusemachines is a global provider of enterprise AI products and services, on a mission to democratize AI. Leveraging proprietary AI Studio and AI Engines, the company helps drive the clients’ AI Enterprise Transformation, regardless of where they are in their Digital AI journeys. With offices in North America, Asia, and Latin America, Fusemachines provides a suite of enterprise AI offerings and specialty services that allow organizations of any size to implement and scale AI. Fusemachines serves companies in industries such as retail,  manufacturing, and government.
Fusemachines continues to actively pursue the mission of democratizing AI for the masses by providing high-quality AI education in underserved communities and helping organizations achieve their full potential with AI.

Senior Data Engineer

Are you an experienced Data Engineering professional with a passion for building scalable, reliable, and high-performance data systems? Do you have hands-on experience designing and optimizing end-to-end real-time and batch pipelines, and developing cloud-native data architectures using modern technologies such as AWS, GCP, Azure, Databricks, and Snowflake?

We are looking for a Senior Data Engineer to architect, design, and implement scalable, high-performance data solutions. The ideal candidate will be an expert in at least one major cloud data ecosystem (AWS, Azure, GCP, Snowflake, or Databricks) and possess a deep understanding of the end-to-end data lifecycle, from ingestion to business intelligence.
Qualification & Skill Set Requirements
Core Technical Competencies
Experience: 5+ years of hands-on data engineering experience in a production environment.
Languages: Strong proficiency in Python, SQL (complex queries, performance tuning), and PySpark/Apache Spark.
Data Modeling: Expert knowledge of data modeling (3NF, Star, Snowflake Schema) and Lakehouse/Warehouse architectures.
ETL/ELT & Orchestration: Proven experience building pipelines using tools like dbt, Airflow, Dagster, or native cloud orchestrators (Glue, Data Factory, Composer).
Integrations: Experienced in integrating data from diverse sources: APIs, RDBMS/NoSQL databases, flat files, and streaming platforms (Kafka, Kinesis, Pub/Sub).
Cloud Platform Expertise (Specialization-Specific)
Candidates should demonstrate deep expertise in anyone of the following:
Snowflake: SnowSQL, Streams, Tasks, Snowpark, and cost optimization.
Databricks: Delta Lake, Unity Catalog, Delta Live Tables (DLT), and Spark optimization.
GCP: BigQuery, Dataflow, Dataproc, Pub/Sub, and Cloud Functions.
Azure: Synapse Analytics, Data Factory, Azure Databricks, and Stream Analytics.
AWS: Redshift, S3, Lake Formation, Glue, and Lambda.
Professional Practices
SDLC & DevOps: Proficient in Git workflows, CI/CD pipelines (GitHub Actions, Azure DevOps, AWS CodePipeline), and IaC (Terraform/CloudFormation).
Data Governance: Strong understanding of data quality, lineage, observability, security (RBAC, encryption), and compliance frameworks.
Agile: Active experience in Agile/Scrum environments using Jira or Azure Boards.
Mentorship: Ability to lead projects and provide technical guidance to junior/mid-level engineers.
Responsibilities
Architecture: Architect, design, and implement scalable, reliable data solutions and pipelines aligned with business analytics needs.
Optimization: Manage and fine-tune cloud resources and workloads for maximum performance, reliability, and cost-efficiency.
Data Transformation: Lead the development of ETL/ELT processes for both batch and real-time data processing.
Collaboration: Partner with Product, Engineering, and Data Science teams to deliver effective, data-driven solutions.
Governance & Quality: Promote and enforce best practices in data governance, security, and data quality frameworks.
Mentorship: Provide technical leadership and mentorship to the team, ensuring architecture quality and best practices.
Documentation: Maintain comprehensive documentation of data architectures, configurations, and workflows.
Fusemachines is an Equal Opportunities Employer, committed to diversity and inclusion. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or any other characteristic protected by applicable federal, state, or local laws.

 

Important: Immigration Sponsorship Policy

Fusemachines is unable to proceed with candidates who require any form of work authorization or immigration support from the company. This restriction applies to all types of support, including:

  • Direct Company Sponsorship: Such as H-1B, J-1, or TN visas.
  • Employer of Record: Listing Fusemachines as the immigration employer on any government documentation.
  • Written Documentation: Providing letters or other support for any work authorization (e.g., OPT, STEM OPT, CPT).
HQ

Fusemachines New York, New York, USA Office

500 7th Avenue, New York, NY, United States, 10018

Similar Jobs

5 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
120K-201K Annually
Senior level
120K-201K Annually
Senior level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Design, build, and maintain scalable Spark-based ETL pipelines and computed tables in a central data lake. Integrate structured and unstructured IoT, sensor, and external data for analytics, model training, and dashboards. Collaborate with Data Science, Analytics, and ML teams to ensure reliable, high-quality customer-facing datasets.
Top Skills: AirflowAWSAzureDagsterData LakeDatabricksDelta LakeETLGCPGitGitPrefectPysparkPythonRest ApisSparksqlSQL
6 Days Ago
In-Office
New York City, NY, USA
180K-225K Annually
Senior level
180K-225K Annually
Senior level
Consumer Web • Healthtech • Professional Services • Social Impact • Software
Design, build, and maintain scalable data pipelines and platform infrastructure. Define data governance, monitoring, and quality standards. Partner cross-functionally with Product, Engineering, Analytics, and Data Science. Drive technical decisions, performance optimization, and infrastructure-as-code for a growing data platform supporting patients, providers, and business teams.
Top Skills: AirflowDbtFivetranPythonSnowflakeSQLTerraform
7 Days Ago
Easy Apply
Remote or Hybrid
New York, NY, USA
Easy Apply
155K-220K Annually
Senior level
155K-220K Annually
Senior level
Fintech • HR Tech
Build and deliver end-to-end, production-grade data solutions: design and maintain scalable ETL pipelines, ingest from diverse sources, implement dbt transformations, ensure data quality and observability, optimize performance and cost, and partner with analytics, product, and engineering teams to drive business impact.
Top Skills: AIAlertingAPIsAutomated TestingAutomationBigQueryCi/CdData ObservabilityDatabricksDbtETLEvent StreamsJavaMonitoringPythonRedshiftScalaSnowflakeSQL

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account