Top AI & Machine Learning Jobs in New York City, NY

YesterdaySaved
Easy Apply
Hybrid
New York, NY, USA
Easy Apply
182K-220K Annually
Senior level
182K-220K Annually
Senior level
Healthtech • Pharmaceutical • Telehealth
Own evaluation, measurement, and optimization of production LLM-powered features. Design reproducible evaluation frameworks, run experiments and analyses to identify failure modes and regressions, build metrics and dashboards, partner with engineering to productionize improvements, and mentor teammates on experimental design and measurement best practices.
Top Skills: BraintrustCausal InferenceExperimentation PlatformsLangsmithLlmsOpenai EvalsPythonSQL
YesterdaySaved
Easy Apply
Hybrid
New York, NY, USA
Easy Apply
150K-184K Annually
Mid level
150K-184K Annually
Mid level
Healthtech • Pharmaceutical • Telehealth
Design and run evaluations for LLM-powered features: build datasets and rubrics, analyze production logs for failures, run experiments, track product/operational metrics, and partner with engineers to productionize improvements and monitoring dashboards.
Top Skills: A/B Testing FrameworksDashboardsEvaluation ToolingLlmsModel MonitoringPromptingPythonRetrievalSQL
YesterdaySaved
Easy Apply
Hybrid
New York, NY, USA
Easy Apply
150K-184K Annually
Mid level
150K-184K Annually
Mid level
Healthtech • Pharmaceutical • Telehealth
Build and ship LLM-powered features across patient and clinical workflows. Partner with clinical operators, prototype AI workflows (prompts, agents, RAG), write Python/SQL to connect AI systems, define success metrics, run evaluations, and iterate based on user feedback.
Top Skills: Agent WorkflowsEhr SystemsLarge Language Models (Llms)Orchestration FrameworksPromptingPythonRetrieval-Augmented Generation (Rag)SQL
YesterdaySaved
Easy Apply
Hybrid
New York, NY, USA
Easy Apply
182K-220K Annually
Senior level
182K-220K Annually
Senior level
Healthtech • Pharmaceutical • Telehealth
Embed with clinical and operations teams to identify high-value AI opportunities; build LLM-powered features end-to-end including prompts, orchestration, Python services, evaluation suites, golden datasets, and metrics-driven experiments to measure product impact and prioritize work.
Top Skills: APIsEvaluation SuitesLlmsModel OrchestrationPrompt EngineeringPython
YesterdaySaved
Easy Apply
Hybrid
New York, NY, USA
Easy Apply
182K-220K Annually
Senior level
182K-220K Annually
Senior level
Healthtech • Pharmaceutical • Telehealth
Build and ship LLM-powered production features end-to-end: prompt orchestration, tool calling, retrieval/RAG, embeddings, evaluation infrastructure, observability, safety guardrails, and integrations. Collaborate with clinical teams, prototype quickly, measure outcomes, and raise engineering standards through mentorship and technical leadership.
Top Skills: Ai Apis And FrameworksBackend ServicesLlmsPython
YesterdaySaved
Easy Apply
Hybrid
New York, NY, USA
Easy Apply
150K-184K Annually
Mid level
150K-184K Annually
Mid level
Healthtech • Pharmaceutical • Telehealth
Build and ship production LLM-powered features across patient-facing products, clinical workflows, and internal tools. Implement AI application layers (prompt orchestration, tool calling, RAG, agent workflows), connect LLMs to internal APIs and systems, and develop evaluation suites and observability for model quality. Prototype rapidly, collaborate with clinical teams, and contribute to AI infrastructure and engineering best practices.
Top Skills: AgentcoreAPIsArize PhoenixBraintrustEhrLangchainLanggraphLangsmithLlmsPythonRagVector Databases
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
All Filters
JobType
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account