Goodie Logo

Goodie

Data Scientist

Posted Yesterday
Remote
Hiring Remotely in USA
Mid level
Remote
Hiring Remotely in USA
Mid level
Turn multi-model signals into measurement, forecasts, and optimizations. Clean and label large datasets, extract structure with LLMs/NLP, define product-grade metrics, run experiments, build predictive models, create evals and monitors, and translate insights into product and business decisions.
The summary above was generated by AI

We are:

Goodie helps leading brands win AI search. As billions of people use ChatGPT, Perplexity, Gemini, and other AI systems to discover products and make buying decisions, brands need a way to understand and influence how they’re represented.

Goodie gives teams a full AI control plane: real-time visibility into how AI models speak about their brand and products, how competitors show up, and an optimization engine to improve visibility and performance. This category didn’t exist two years ago - we were early, and we’re defining it.

We’re backed by strong investors, trusted by category-leading customers, and scaling fast. We’re hiring curious, ambitious builders to help shape the future of AI search.

We are looking for:

Goodie AI is searching for a talented and ambitious Data Scientist to join our growing team! Goodie helps brands win visibility and revenue across AI search, LLMs, and agentic commerce. You will be the point person turning messy multi-model signals into measurement, forecasts, and optimizations that our product can act on. If you enjoy building models that ship and change customer behavior, you will like this seat.

You’ll do:

  • Work with large datasets. Own efficient querying, cleaning, labeling, and taxonomy alignment for brands, SKUs, and categories.

  • Design sampling and classification strategies that turn noisy LLM outputs and crawler logs into reliable brand and product insights.

  • Use LLMs and NLP to extract structure from unstructured text at scale. Topics include query fan-out, sentiment, citation extraction, and entity linking for brands, products, and creators.

  • Define product-grade metrics. Create durable definitions for visibility score, answer coverage, product presence, and agentic checkout readiness.

  • Build and run experimentation frameworks. A/B tests, holdouts, counterfactuals, and uplift modeling to quantify impact on citations, share of voice, and conversions.

  • Develop and refine predictive models that analyze and forecast AI search behavior across models and surfaces.

  • Translate complex findings into clear decisions. Partner with the founding team to inform roadmap, pricing, and customer playbooks.

  • Create evaluation harnesses. Establish automatic evals and human-in-the-loop labeling for model quality, bias, and drift across LLM providers.

  • Detect anomalies. Build monitors for crawler behavior, rankings, and feed health to catch regressions before customers do.

You have:

  • 3 to 7 years in applied analytics or data science within tech, marketing, or ads. Startup or high-growth experience preferred.

  • Strong Python and SQL. Comfortable in notebooks and in code reviews.

  • Skilled with sampling and inference. Stratified sampling, bootstrapping, extrapolation, reweighting, and variance estimation.

  • Solid ML toolkit. Time series, classification, regression, weak supervision, and methods to estimate event frequency from partial observations.

  • Practical LLM knowledge. Strengths in prompt design, structured extraction, embeddings, and an understanding of model limits and failure modes.

  • Curious and current on multi-modal and LLM research. You enjoy reading papers and pressure testing ideas in real data.

  • Builder mindset in a fast team. You value clarity, speed, and ownership.

Nice to have:

  • Experience with large-scale information extraction or search quality

  • Background in causal inference, MMM, or attribution models

  • Hands-on work with product feeds and retail catalogs

  • Contributions to open source or published work we can read

  • Deployed side projects we can click through

Our data and modeling canvas:

  • Problems: AI search measurement, AEO scoring, agentic commerce readiness, product catalog and feed integrity, ranking and citation shifts, attribution for AI traffic

  • Signals: LLM responses, crawler and agent logs, SERP and AI answer snapshots, product feeds, marketplace metadata, GA4 and GSC connectors, CRM data

  • Targets: Share of voice, citation count, answer coverage, SKU presence, conversion lift, time-to-value for optimizations

Tech stack you will touch:

  • Languages: Python, SQL

  • Libraries: pandas, NumPy, scikit-learn, PyTorch or TensorFlow, Hugging Face, spaCy.

  • Data: Postgres or AlloyDB, BigQuery, dbt, DuckDB for local work

  • Production ML/MLOps: model serving (FastAPI/Flyte/Batch jobs), CI/CD, versioning, experiment tracking (MLflow/Weights & Biases), monitoring & alerting for performance/drift.

  • Cloud & data tooling: AWS/GCP/Azure, containers (Docker).

  • Models and providers: OpenAI, Anthropic, Google, Meta, Mistral, Perplexity, together with internal eval harnesses

BEWARE OF FRAUD! Please be aware of potentially fraudulent job postings or suspicious activity by persons that are posing as Goodie team members, recruiters, and HR employees. Our team will contact you regarding job opportunities from email addresses ending in @nogood.io or @higoodie.com. Additionally, we do utilize our ATS- Ashby- to help us schedule initial screening calls. Job seeking is hard- we’re sorry that scammers have added this element to your search for something new. Stay vigilant out there!

Similar Jobs

3 Days Ago
In-Office or Remote
195K-258K Annually
Senior level
195K-258K Annually
Senior level
Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
Partner with Digital Assets, Finance, Treasury, and Risk to design measurement frameworks, perform strategic analysis and modeling, build scalable SQL-driven automation and dashboards, analyze on-chain and finance data, create visualizations, and drive data-informed product decisions for USDC and related digital asset offerings.
Top Skills: BlockchainPythonRSQL
7 Days Ago
In-Office or Remote
195K-258K Annually
Expert/Leader
195K-258K Annually
Expert/Leader
Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
Lead payments-focused data science work: build foundational datasets, metrics, and scalable analytics; analyze customer and transaction behavior (including onchain data); conduct strategic analyses to identify product and revenue opportunities; build reporting, dashboards, and automation; and influence product and business strategy through data storytelling to senior stakeholders.
Top Skills: BlockchainBusiness Intelligence ToolsOnchain Transaction DataPublic Ledger DatasetsPythonRSQL
9 Days Ago
Remote or Hybrid
District of Columbia, USA
159K-246K Annually
Senior level
159K-246K Annually
Senior level
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Lead product analytics for in-vehicle experiences: define KPIs and instrumentation, perform deep-dive behavioral analyses, build dashboards and self-serve analytics, guide experimentation and causal inference, and mentor teams to scale analytics practices across the Vehicle Product organization.
Top Skills: DatabricksLookerPower BIPythonSQL

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account