27 Big Data Startups in NYC You Need to Know

Meet the companies deriving data-informed insights for a wide array of industries in NYC.

Written by Emerson Dameron
Published on Feb. 28, 2024
27 Big Data Startups in NYC You Need to Know
​ Image: Shutterstock ​

When you think about big New York City industries, real estate or Wall Street might come to mind, but there’s a thriving big data startup ecosystem in the Big Apple. Most businesses proliferate data, but many companies just don’t have time to sift through all the numbers, much less turn it all into a plan of action. That’s where these big data startups come in. Meet the companies deriving data-informed insights for a wide range of industries.

Top Data Science Companies in NYC to Know

  • Garner Health
  • Cherre Reonomy
  • Howl
  • CB Insights
  • mParticle
  • Looker
  • Integral Ad Science
  • Flowcode
  • SPEEDA Edge


Caden allows consumers to take control of and earn money for their personal data. At the same time, the company brings enterprises products and solutions that let them benefit from consumer behavioral data while still preserving consumer privacy. Caden says it collects upwards of 100 million data points each day that cover everything from media consumption to financial transactions.


Enigma is a big data company that gathers business intelligence on small and medium-sized businesses. It uses AI and machine learning to bring together huge quantities of disparate data, ultimately returning analysis for a wide range of business concerns, from B2B marketing to financial services compliance and underwriting for loans. 


ActionIQ is a marketing tech company committed to breaking down data silos so brands can unlock revenue streams and improve customer experiences. Its customer data platform features a composable architecture, which enables marketing teams to directly access and leverage data from warehouses like Databricks, Amazon Redshift, Google BigQuery, Snowflake or Teradata throughout the customer lifecycle.


Thousands of organizations use dbt Labs’ SQL-first transformation workflow to make data development faster and more collaborative. The company says it aims “to empower data practitioners to create and disseminate organizational knowledge.” Its customers have included notable companies like HubSpot, JetBlue and Condé Nast.


Celonis offers process mining technology that’s been used by more than 14,000 companies to sift through large amounts of internal data on business processes and produce intelligence on potential opportunities for improvement. Its solutions have helped customers make changes that have led to faster order fulfillment times, fewer errors in payment collections and other positive business outcomes.


Garner Health offers an advanced search tool to connect patients with medical providers. Using data analytics and over 300 million patient records and medical claims to populate its search tools, Garner Health provides users with detailed insights on every suggested provider, such as locality, quality of care and pricing. The company aims to deliver targeted recommendations based on a prospective patient’s self-reported medical needs. Garner Health is using its data analytics capabilities to ensure patients can find quality care that’s within their budget. It’s also helping to cut down on costs for businesses that offer Garner’s services as part of their employee benefits package. 


SPEEDA Edge has built a comprehensive business intelligence platform with the latest insights. Through emails and articles, SPEEDA Edge provides detailed coverage of over 60 emerging industries. The company also tracks market trends and relays critical information about rising companies in each industry. Customers now gain a holistic perspective of their industries, competitors, and market performances. As a result, companies can compile the most up-to-date information and prepare for future developments with the platform of SPEEDA Edge. 


Flowcode equips businesses with QR codes that are fast, secure and scannable from a distance. Flowcode also enables users to create landing pages and track when consumers navigate to their sites via QR codes. Organizations can now leverage stronger connections between offline audiences and online tools. Flowcode’s QR codes fit both mobile and TV screens, allowing companies to reach more consumers. Plus, analytics tools enable businesses to conduct A/B testing and collect demographic information. With this data, companies can wield Flowcode technology to create more customized content. 


Prescriptive Data is an AI company developing an advanced operating system called Nantum OS that connects to building systems, real-time occupancy and any IoT device. Prescriptive Data’s algorithms enable automated building operations which leads to reduced energy use, utility costs and carbon emissions. Prescriptive’s technology, Nantum OS, analyzes data from building systems as well as third-party sources to suggest adjustments to building operations that enhance both building efficiency and tenant comfort. 


Ribbon Health is creating a world where navigating healthcare is simple through their comprehensive data platform and API layer. In addition to helping users find care options, the platform offers a care provider directory, insurance enrollment support and referral management. Ribbon Health’s platform provides continuously validated data on insurance plans, doctors, costs and care quality.


Knotch is a one-stop shop that combines measurement, competitor research and analysis of branded content. Their product, “The Intelligence Suite for Brands” is comprised of three products: Measurement, Knowledge and Wisdom. Knotch uses these three products to provide marketers with integrated, independent and transparent data. Through the use of simple but beautiful design, Knotch has created an enjoyable way for users to respond to branded content and for marketers to collect valuable sentiment data. Knotch bears some pretty impressive clients, including GE, Warner Brothers, 20th Century Fox and Heineken.


Working in the world of commercial real estate, Reonomy gathers information from an array of sources and proprietary validation algorithms, performing hardcore quantitative analyses with unprecedented speed. Reonomy allows lenders and investors to explore every mix-use building, condo and vacant lot in the city without leaving the office. 


Howl uses machine learning and big data to connect consumers with all of the content created about that product to make for a deeper, more-knowledgeable online buying process. Have you ever searched “best mattress to buy” or “what is the best TV?” Howl might have played a part in your search. The company’s big data connects consumers with fresh, updated content on a product to assist them in making quality decisions. Macy’s, Ulta and Nordstrom all use the company’s platform to reach more customers and provide them with a holistic shopping process.   


Veeva synthesizes big data to provide marketing analytics, measurement and optimization solutions to the healthcare industry. Veeva works across a wide variety of healthcare systems like pharmacies and hospitals to provide quality marketing consultation. The company offers a large suite of marketing solutions across different media and advertising platforms.


The leading venture capital database in New York and perhaps the world, CB Insights gathers and analyzes data for those working in private equity, venture capital, corporate strategy, entrepreneurship, and beyond. CB Insights has become so influential in the venture capital world and the business media that it’s hard to remember it used to answer to ChubbyBrain.


Yext’s mission is to help businesses and organizations around the world deliver official answers everywhere people search. Yext’s Search Experience Cloud organizes a business or organization’s facts in a knowledge graph so that it can provide official answers to questions across approximately 175 digital services, including Google, Siri, Alexa and Meta, as well as the business’s own website.


“Created by marketers, loved by engineers,” mParticle helps mobile developers and enterprise clients figure out what to do with all of their data. It collects customer information with a single API and provides robust options for analysis, segmentation, and allocation. There is a booming demand to help companies make sense of the seemingly chaotic mobile ecosystem. That’s why some of the world’s leading consumer brands (Spotify, NBCUniversal, Airbnb) choose mParticle to create their customer data pipelines in real time. 


Integral Ad Science walks global brands through the ever-shifting landscape of online advertising. This adtech firm offers unique suites of tools for marketers, traders and sellers, while employing leading academics from machine learning and data science backgrounds. It publishes free white papers and case studies online that give a rough idea of how detailed its work can get.


Cherre is one of the most useful tools for analyzing real estate data and making smarter investments in the housing and industrial markets. The platform acts as a hub that connects all real estate data feeds and develops insights that impact entire organizations. Additionally, Cherre’s CorePredict product allows real-time sourcing of real estate opportunities.


Unacast is a location data platform and creator of the Real World Graph®. The data collected by Unacast is beneficial to marketing platforms and companies by improving targeting, monetizing location data, generating analytics and measuring attribution. Unacast provides companies with insight into how consumers are connected to one another and the world.


Knewton builds PaaS tech for adaptive learning and personalizes online educational content for individual students. It analyzes the user’s history and finds what has worked in similar cases, giving educators the opportunity to customize lessons and pave the most efficient route to improvement. Knewton has tailored its recommendations for more than 10 million students. It gets sharper with each new data point, because what is true today may not true tomorrow. 


Looker’s analytics software provides transparency into an organization’s big data. The company’s platform makes cross-departmental sharing of data easy, helping everyone make better data-driven business decisions. Headquartered in Santa Cruz, Looker sports offices in New York, San Francisco, London and Dublin. Looker provides solutions for an array of industries and departments, allowing even non-technical users to benefit from data insights. The software is used by such big names as Amazon, Sony, The Economist, Lyft and Koehler.


Didit takes a philosophical approach to online marketing and SEO. Rather than flushing money into various channels and repackaging strategies that worked for others, it helps small businesses with limited budgets inject marketing and self-awareness into everything they do. With a friendly, humanistic approach, Didit lives by its own ideas. Its strength lies not just in numbers, but in how it crunches them differently in different cases. Those who fear (or hope) that big data will standardize and automate everything may have a lot to learn from its approach.


ListenFirst helps brands know themselves. It tracks 10,000 brands across channels, including sales, social, media impact and digital advertising, and works with a wide array of clients in entertainment, healthcare and consumer goods. The number crunchers at ListenFirst show how the engagement and listening skills that help Victoria Beckham win fashion week can work across disparate industries.


Drawing data from meters, sensors and complex system integration, the Enertiv platform gives real estate clients real-time information on where all that energy is going. Enertiv provides one-stop shopping for real-time energy management, measurement, verification, savings opportunities and even tenant billing. Its sophisticated algorithms spot patterns over time to help customers use energy efficiently. 


StatSocial provides social data to media brands, helping modern publishers find their specific audiences and understand what they care about. Its capabilities are perhaps best illustrated by its consumer taste maps, such as the #BeerMap. Gone are they days when one newspaper editor or talk show host addressed an entire geographical community. Today’s media brands must find increasingly specific niches and meet their audiences on their own turf. Based in the world’s media capital, StatSocial helps upstart media mavens better finesse their existing followers as they court new ones.

This article was originally published in 2016. Rose Velazquez and Margo Steines contributed reporting to this story.

Hiring Now
Basis Technologies
AdTech • Software