Actively AI is defining a new category: Intelligence-Led Revenue.
Revenue organizations have always been bottlenecked on human capacity. Reps triage which accounts get attention. Context disappears at every handoff. On any given day, the vast majority of accounts have exactly zero people thinking about them.
Actively addresses this at the structural level. Our platform deploys Per-Account AgentsTM across our customers’ TAM, working 24/7 to research, identify opportunities, and advance next steps without being asked. Leading enterprises including Ramp, Ironclad, and Samsara are already making this shift.
Our co-founders are former Stanford AI researchers, and the team comes from Harvard, CMU, Berkeley, Brex, Scale AI, and Google. We've raised $68M from TCV, First Harmonic, Bain Capital Ventures, First Round Capital, and more.
We’re looking for a Senior/Staff Data Platform Engineer to build and scale the foundation of Actively’s data ecosystem; the pipelines, transformations, and infrastructure that power every agent, insight, and workflow across the company.
Actively's agents make decisions in real time, which accounts to prioritize, what actions to take, when to involve a human. All of that reasoning runs on data: CRM records, call transcripts, external signals, and customer-specific context pulled from dozens of sources. When that data is stale, malformed, or missing context, the agents get it wrong.
You'll build and scale the data foundation that every agent, insight, and workflow at Actively depends on; designing pipelines that handle diverse, often messy inputs and turn them into clean, structured, agent-ready representations. At scale, that means millions of accounts, each with their own data shapes, business rules, and edge cases, all needing to stay fresh and reliable.
The challenge isn't just throughput. It's building infrastructure that's opinionated enough to enforce quality and consistency, but flexible enough to adapt as new data sources, customer configurations, and agent capabilities keep evolving.
What You’ll Do- Own the ingestion and transformation layer. Design and scale pipelines that pull structured and unstructured data from CRM systems, call transcripts, and external signals, normalizing and enriching it into representations agents can reason over in real time.
- Build for operational use, not just analytics. The data you produce doesn't power dashboards; it powers decisions. Freshness, accuracy, and low-latency access matter here in ways they don't in a typical data warehouse.
- Keep data current as the world changes. Architect real-time and mini-batch workflows using technologies like Pub/Sub, Kafka, or modern ETL tools to ensure data stays synchronized as customer activity happens.
- Solve for customer-specific variation at scale. Every customer has their own CRM configuration, field naming, and business logic. You'll build transformation systems that stay consistent and correct across all of them without becoming brittle.
- Own reliability end to end. Observability, lineage, schema management, alerting; you define what "trust in the data" means and make sure it holds across thousands of accounts, so agents and other teams can confidently build on top of it.
- Work across the full stack. Python, SQL, DBT, BigQuery, Snowflake and move between layers fluidly, contributing wherever the work needs it.
Who You Are
- Deep roots in data systems, not just data tooling. You have 5+ years designing and operating core data infrastructure from ingestion and transformation to serving and observability in high-growth environments where the data needed to be right, fresh, and fast.
- Built for agents and models, not just reports. You've worked on data systems that power ML models, intelligent workflows, or real-time decisioning. You understand the different demands that put on infrastructure compared to a typical analytics stack.
- Fluent across the modern data stack. Proficient in Python, SQL, and DBT, with hands-on experience in BigQuery or Snowflake, and familiar with orchestration tools like Fivetran, Airflow, or Polytomic.
- Fluent in real-time infrastructure. You've built streaming and mini-batch pipelines using Pub/Sub, Kafka, Dataflow, or similar technologies, and understand the trade-offs between latency, throughput, and operational complexity.
- Startup-proven or product-platform experience. You've either built a data platform from scratch at an early-stage company or worked at a data-focused product company (e.g. Segment, dbt Labs) scaling systems across many customers.
- Self-directed and accountable for quality. You take work from design to production without being managed through it, and you hold yourself responsible for whether the data your systems produce is actually trustworthy.
- Prior experience at a data infrastructure or platform company (e.g. Segment, Databricks, Confluent, Fivetran) or meaningful contributions to open-source data tooling.
- Familiarity with embedding and vector pipelines like chunking strategies, index management, and keeping representations in sync with fast-changing source data.
- Experience building data pipelines where correctness was a hard requirement like financial data, compliance systems, or other domains where bad data has real downstream consequences.
Actively AI provides an estimate of the compensation for roles that may be hired as required by state regulations. Compensation may vary based on (a) location, as Actively AI factors in specific location when benchmarking compensation for most roles; (b) individual candidate skills and qualifications; and (c) individual candidate experience. Additionally, Actively AI leverages current market data to determine compensation, so posted compensation figures are subject to change as new market data becomes available. The salary, other compensation, and benefits information is accurate as of the date of this posting. Actively.ai reserves the right to modify this information at any time, subject to applicable law.
Actively AI is committed to equal treatment and opportunity in all aspects of recruitment, selection, and employment without regard to gender, race, religion, national origin, ethnicity, disability, gender identity/expression, sexual orientation, veteran or military status, or any other category protected under the law. Actively AI is an equal opportunity employer; committed to a community of inclusion, and an environment free from discrimination, harassment, and retaliation.
At Actively, you write playbooks rather than follow them. You own outcomes, not just tasks, and see directly how your work changes what customers can do — across a product and go-to-market motion that is scaling fast.
The team is high-caliber and low-ego: people who work from first principles, move with urgency, and care deeply about building something that drives real value. If that's the kind of challenge you're looking for, Actively is the right place.
Benefits- 🚀 Competitive Early-Stage Equity
- ⚕ Health, Dental, Vision Coverage
- 💡 Unlimited PTO + Recharge Days
- 🍽️ Catered Lunch on Tuesday & Friday, Dinners every day!
- 🍿 Fully Stocked Kitchen
- 💻 Cutting-Edge Tech & Tools
- 🌴 Annual Off-sites & Monthly Events
- 🚆 Commuter Benefits
- 🏢 Cozy Office in NYC
Actively AI New York, New York, USA Office
30 W 21st St, New York, New York, United States, 10010 6905
Similar Jobs
What you need to know about the NYC Tech Scene
Key Facts About NYC Tech
- Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
- Key Industries: Artificial intelligence, Fintech
- Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
- Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory



