The Generative AI Engineer will develop AI-driven features for medical records, collaborating with teams to create systems extracting and generating clinical insights from EHR data, while ensuring accuracy and clinical relevance.
As a Generative AI Engineer at Regard, you’ll work across the full lifecycle of developing and deploying AI-driven features, from ideation and design to prototyping, implementation, evaluation, and iteration. You’ll collaborate closely with product and clinical teams to build systems that transform medical records into structured insights and clinician-ready documentation.
Your work will center on applying modern LLMs to extract, summarize, normalize, and generate clinical information from diverse electronic health record (EHR) data sources. This includes developing robust pipelines, running model and prompt-engineering experiments, integrating models into production services, and ensuring outputs remain factual, safe, and clinically aligned.
You’ll directly contribute to high-priority product initiatives, shape new AI capabilities, and advance our LLM platform. Your work will have a tangible impact on how clinicians understand patient data and how healthcare organizations improve care quality.
About Regard
Our mission is to bring world-class healthcare to everyone. Regard is an AI-powered Proactive Documentation platform that advances how care is delivered by reviewing all patient data in the EHR to recommend diagnoses and surface clinical evidence. Regard drafts a note even before the physician sees the patient, enabling an approach that gets documentation right at the point of care - we call it Proactive Documentation. This improves quality of care, reduces physician burden, and improves hospital finances. We are excited by challenges, mission-oriented work, and meaningful relationships. We work closely with some of the top health systems in the country and are leading the change that healthcare - one of the largest and most inefficient industries in the world - needs. We want you to join us.
Our Tech Stack:
- Frontend: TypeScript, React
- Backend: Python, PostgreSQL, Redis, AWS
- AI: OpenAI, Anthropic, Langfuse
Responsibilities:
- Build and refine LLM-powered systems to extract structured medical concepts, diagnoses, medications, labs, and timelines from unstructured records
- Develop generation pipelines that produce clinically accurate drafts of notes (H&P, progress notes, discharge summaries, etc.) from factual inputs
- Design, prototype, and evaluate prompts, agent workflows, and retrieval-augmented generation (RAG) components
- Benchmark LLM systems to evaluate new models and audit accuracy
- Optimize inference cost, latency, and throughput through batching, caching, and model-selection strategies
Qualifications:
- BS in Computer Science or equivalent experience
- 3+ years of professional experience with software development in one or more programming languages (Python preferred)
- 1+ years of professional experience building generative AI products, such as RAGs, agents and chatbots
- Able to participate in on-call operational support for their areas of responsibility
- Able to travel up to 4 weeks a year for company co-working and/or retreat weeks
- Strong verbal and written communication skills
Preferred Qualifications:
- Familiarity with vector databases and embeddings generation
- Experience working on a mature enterprise SaaS technology product
- Exposure to startup and/or high growth environments
Hybrid Work | Location | Work Authorization
- For this role, Regard is currently only considering candidates who are authorized to work in the US without visa sponsorship, and are within the New York City metro area
- We expect our Engineers to be in the office on Tuesdays and Thursdays. We may request more frequent in-office work during the onboarding period
- We will provide relocation assistance to anyone who does not already reside in the NYC metro area
- We prefer hiring people within commuting distance of our NYC office because we value getting together in person regularly
- For those who enjoy working from our Manhattan office on a more regular basis, we offer catered lunches and other fun perks
- Additionally, hybrid employees have the flexibility to work from locations outside of their home office from up to 6 weeks per year
Comp | Perks | Benefits
- Eligible for equity
- 99% employer paid health benefits (Medical, Dental, and Vision) + One Medical subscription
- 18 PTO days/yr + 1 week holiday break
- Annual $750 learning & development stipend
- Company-sponsored team retreat + social events
- A sabbatical program
Our goal at Regard is to provide and maintain a work environment that fosters mutual respect, professionalism and cooperation. Regard is proud to be an equal opportunity employer that does not discriminate on the basis of actual or perceived race, creed, color, religion, national origin, ancestry, alienage or citizenship status, age, disability or handicap, sex, gender identity, marital status, familial status, veteran status, sexual orientation or any other characteristic protected by applicable federal, state or local laws. We celebrate diversity and are proud of our supportive, inclusive workplace.
All candidates must successfully complete a background check as part of the hiring process.
Top Skills
Anthropic
AWS
Langfuse
Openai
Postgres
Python
React
Redis
Typescript
Regard New York, New York, USA Office
432 Park Avenue South, New York, NY, United States, 10016-8010
Similar Jobs
Fintech • Financial Services
The role involves designing and building reliable software, partnering with teams, ensuring compliance, and innovating at scale in the Generative AI platform.
Top Skills:
AICi/CdClojureElixirKubernetesLlmsMlMlflowMlopsPythonRagRestfulRustScalaW&B
AdTech • Digital Media
Design and develop Generative AI agents for intent recognition and query decomposition, integrating them with internal and third-party systems, while ensuring operational efficiency and self-service data capabilities.
Top Skills:
AWSClaudeContext EngineeringDatabricksDspyGenerative AiIntent DetectionKnowledge GraphsLarge Language ModelsLlama 4 MaverickOpenaiSnowflakeVectorization
Gaming • Mobile • Software
The role focuses on machine learning, especially in video and image embedding, and improving user features using ML recommendations. Requires expertise in GCP and video production metrics.
Top Skills:
AirflowBigQueryC#C++CircleCIDockerElectronGCPGithub ActionsJavaKotlinKubeflow PipelinesKubernetesOpencvPyTorchRabbitMQReactRedisReduxSaltSwiftTerraform
What you need to know about the NYC Tech Scene
As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.
Key Facts About NYC Tech
- Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
- Key Industries: Artificial intelligence, Fintech
- Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
- Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory


.png)
