Applico Capital Logo

Applico Capital

Data Engineer

Reposted 16 Days Ago
In-Office
New York, NY, USA
3-6 Annually
Mid level
In-Office
New York, NY, USA
3-6 Annually
Mid level
As a Data Engineer, design and maintain data pipelines, ensure data quality, and support AI workflows using modern tools and cloud environments.
The summary above was generated by AI
About Applico Capital

Applico Capital is the leading venture capital firm focused on the $8 trillion B2B distribution industry. Through our learnings and understanding of the industry, we are building a tech startup, currently in stealth, to solve the industry's biggest problems as it comes to unlocking AI-enabled synergies.

Our mandate is to leverage AI and modern technologies to reimagine the role of the traditional distributor and transform how the entire industry operates.

We are looking for highly technical builders who thrive in entrepreneurial, scrappy, and collaborative environments.

About the Role:

We are looking for a Data Engineer to create the infrastructure, automation, and monitoring that make machine learning reliable, repeatable, and scalable. You will enable our AI Scientists and Engineers to move faster, while ensuring compliance, observability, and cost efficiency.

This is a scrappy, hands-on role in a startup-style team where building durable, automated systems is as important as moving quickly. You’ll ensure that ML becomes a dependable part of daily business operations. You will also extend MLOps practices to support agentic AI systems – managing orchestration, monitoring emergent behavior, and ensuring safe, governed use of AI-augmented workflows.

Key Responsibilities
  • Design, build, and maintain data ingestion and transformation pipelines using modern open-source and cloud-native tools
  • Integrate structured and unstructured data from ERP, CRM, PIM, CMS, and third-party sources
  • Develop and manage data models, staging, and warehouse/lakehouse layers
  • Implement data quality, validation, and observability frameworks to ensure reliability
  • Collaborate with the Head of Data Architecture and Full-Stack Data Engineers to define schema standards and ingestion patterns
  • Automate repeatable workflows (e.g., Airbyte, Dagster, Prefect) to reduce manual work and ensure reproducibility
  • Support analytics, reporting, and AI use cases through well-designed, versioned data products
  • Contribute to infrastructure automation and CI/CD practices for data pipelines
  • Leverage AI tools (LLMs, code generation, enrichment APIs) to accelerate development and improve data coverage

Requirements
  • 3–6 years of professional experience as a Data Engineer, ETL Developer, or Data Platform Engineer
  • Proficiency in Python and SQL for data wrangling, pipeline automation, and transformation
  • Hands-on experience with modern open-source data tooling such as dbt, Airbyte, Meltano, Dagster, or Prefect
  • Familiarity with cloud data environments (AWS, GCP, or Azure) and infrastructure-as-code principles
  • Solid understanding of data modeling, schema design, and relational concepts
  • Experience integrating APIs, flat files, and other external data sources
  • Working knowledge of data quality and observability tools (Great Expectations, Soda, or similar)
  • Exposure to or curiosity about semantic modeling, graph data, and AI enrichment workflows is a plus
  • Comfortable in fast-paced, startup-style environments where iteration, learning, and impact come first
Why Join Us
  • Work on one of the most ambitious AI and data transformations in industrial B2B
  • Build with autonomy in a small, expert team backed by a large, stable business
  • Learn directly from senior data architects and AI engineers
  • Help shape a scalable, open, automation-driven data platform from day one
Preferred Stack
  • Languages: Python, SQL
  • Data Tools: dbt, Airbyte/Meltano, Dagster, Prefect, DuckDB, Delta Lake, Postgres
  • Cloud & Infra: AWS or GCP, Terraform, Docker, GitHub Actions
  • Data Governance: Great Expectations, OpenLineage, Soda
  • APIs & Services: FastAPI, GraphQL
  • AI/Automation (Optional): LangChain, LangGraph, OpenAI APIs, n8n

Top Skills

Airbyte
AWS
Azure
Dagster
Dbt
Docker
Fastapi
GCP
Github Actions
GraphQL
Great Expectations
Langchain
Langgraph
Meltano
N8N
Openai Apis
Openlineage
Prefect
Python
Soda
SQL
Terraform

Similar Jobs

7 Days Ago
In-Office
New York City, NY, USA
150K-200K Annually
Senior level
150K-200K Annually
Senior level
Fintech • Software
As a Data Engineer, you'll build and maintain data pipelines and infrastructure for an AI-driven research platform, ensuring data quality and reliability while collaborating on architectural decisions.
Top Skills: AirflowAWSDockerGoKubernetesPostgresPythonRustS3SQLTemporal
8 Days Ago
Easy Apply
Remote or Hybrid
USA
Easy Apply
175K-225K Annually
Mid level
175K-225K Annually
Mid level
Fintech • Information Technology • Software • Financial Services
The Data Engineer will build real-time data pipelines for pricing algorithms, collaborate with teams, and contribute to batch data workflows.
Top Skills: Cloud-Based Distributed Data InfrastructureFlinkKafkaPythonSQL
8 Days Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
102K-154K Annually
Mid level
102K-154K Annually
Mid level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
As a Data Engineer II, you will build and optimize data pipelines in Databricks, ensure high-quality data delivery, and support generative AI applications. You will collaborate with data scientists and AI engineers to provide reliable data infrastructures and meet business needs.
Top Skills: DatabricksDbtGainsightGongOutreachPythonRdsRedshiftSalesforceSnowflakeSparkSQL

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account