Pinecone Logo

Pinecone

Senior/Staff Software Engineer, Search & Retrieval Infrastructure

Reposted Yesterday
Remote
Hiring Remotely in US
190K-230K Annually
Senior level
Remote
Hiring Remotely in US
190K-230K Annually
Senior level
Design and build scalable backend components and indexing pipelines for semantic and hybrid retrieval, build retrieval orchestration and knowledge-graph services, improve retrieval quality via evaluation and observability, design APIs, and optimize latency, throughput, cost, reliability, and security for large-scale AI inference and retrieval workloads.
The summary above was generated by AI

About Pinecone

Pinecone is the leading vector database for building accurate and performant AI applications at scale in production. Pinecone's mission is to make AI knowledgeable. More than 9000 customers across various industries have shipped AI applications faster and more confidently with Pinecone's developer-friendly technology. Pinecone is based in New York and raised $138M in funding from Andreessen Horowitz, ICONIQ, Menlo Ventures, and Wing Venture Capital.

About the Team and Role:

We are hiring a senior/staff software engineer to help design and build core components of our next-generation knowledge retrieval system built for the AI era – search and retrieval infrastructure that powers high-quality, scalable, and enterprise-grade agentic systems. You’ll build the framework that allows our customers to connect knowledge–synthesized from structured and unstructured data–to modern LLM-powered applications, leveraging the world’s best-in-class vector DB supporting semantic search and hybrid retrieval. This role is ideal for someone who loves backend system architecture, distributed systems, and applied AI infrastructure. It is a high impact role with significant ownership across architecture, performance, and system reliability.

Responsibilities:

  • Design and build scalable platform components leveraging advanced retrieval via query planning, semantic and hybrid search, metadata-aware search, and LLM generation

  • Design and build optimized indexing pipelines for structured and unstructured data

  • Build backend services for semantic and hybrid retrieval, knowledge graph construction, and retrieval orchestration

  • Improve retrieval quality through evaluation and observability frameworks

  • Design APIs for internal and external user and agentic consumers

  • Optimize latency, throughput and cost across large-scale inference and retrieval workloads

  • Drive technical direction for reliability and security

What You’ll Bring to the Table:

To thrive in this role, you don't need to check every single box, but you should be deeply passionate about how to turn data into knowledge.

Systems Expertise

  • Architectural Depth: You have a proven track record (typically 6+ years) of shipping production-grade backends for large-scale systems. You don’t just write code; you design for high throughput, low latency, and long-term maintainability.

  • Data Engineering Savvy: You’re comfortable building high-throughput indexing pipelines that handle both the messy world of unstructured data and the rigid world of structured schemas.

AI & Retrieval

  • Retrieval Intuition: You understand that "search" is more than just a keyword match. You have direct experience (or deep theoretical knowledge) in semantic search, vector databases, hybrid retrieval strategies, or with traditional search engines like Elastic or OpenSearch.

  • RAG & Orchestration: You understand the nuances of Retrieval-Augmented Generation (RAG) patterns, from embedding pipelines and hybrid search techniques to how query planning and metadata filtering can make or break an LLM's performance.

Technical

  • Language Fluency: You are an expert in at least one major language like Go, Rust, C++, Java, or Python.

  • Infrastructure: Familiarity and experience with modern infrastructure tools, such as Kubernetes, cloud-native architectures, and observability frameworks, as well as infrastructure-as-code tools like Terraform or Pulumi.

Ownership & Impact

  • Product Thinking: You don't just build to spec; you build for the user. You can design clean, intuitive APIs that both human developers and autonomous agents will love.

  • Ambiguity Navigator: You’re comfortable in a high-growth environment. You prefer "owning a problem" over "executing a ticket."

Bonus Points

  • Experience building multi-tenant SaaS platforms.

  • Experience with retrieval evaluation frameworks—knowing how to actually measure "good" search results.

  • Experience with query planning or agentic reasoning loops (e.g., teaching a system how to break down a complex prompt into multiple specific steps).

Perks & Benefits:

  • Comprehensive health coverage including medical, dental, vision, and mental health resources

  • 401(k) Plan

  • Equity award

  • Flexible time off

  • Paid parental leave

  • Annual Company Retreat

  • WFH Equipment Stipend

All qualified applicants will receive considerations for employment without regard to race, color, religion, sex, age, disability, marital status, familial status, sexual orientation, pregnancy, gender identity, gender expression, national origin, ancestry, citizenship status, veteran status, and any other legally protected status under federal, state, or local anti-discrimination laws.

Top Skills

C++
Elastic
Embeddings
Go
Hybrid Retrieval
Java
Knowledge Graph
Kubernetes
Llms
Observability Frameworks
Opensearch
Pinecone
Pulumi
Python
Rag
Rust
Semantic Search
Terraform
Vector Databases
HQ

Pinecone New York, New York, USA Office

Nestled in Midtown, steps away from Times Square & Bryant Park. A bustling neighborhood, perfect for professionals seeking a vibrant work environment with easy access to iconic landmarks, dining, and excellent transportation options. Join us in the heart of Manhattan!

Similar Jobs

An Hour Ago
Remote
United States
213K-288K Annually
Senior level
213K-288K Annually
Senior level
Artificial Intelligence • Cloud • Consumer Web • Productivity • Software • App development • Data Privacy
Lead the development of AI-powered security products focusing on data access governance and DLP, defining strategies, delivering innovative solutions, and driving market impact.
Top Skills: Access GovernanceAIBoxChatgptClaudeData Loss PreventionDlpDropboxDspmGeminiGoogle WorkspaceMicrosoft OnedriveSaaSSspm
An Hour Ago
Easy Apply
Remote
USA
Easy Apply
120K-180K Annually
Mid level
120K-180K Annually
Mid level
Artificial Intelligence • Cloud • Software • Infrastructure as a Service (IaaS)
As a Technical Content Writer, you will create and manage technical documentation for Runpod's GPU cloud platform, collaborating with various teams to ensure clarity and engagement while staying updated on AI trends.
Top Skills: AIAPIsContent Management SystemsGpu TechnologyHTMLMachine LearningSdksSeo
An Hour Ago
Remote or Hybrid
US
84K-120K Annually
Senior level
84K-120K Annually
Senior level
Information Technology
Manage project scopes and timelines, coordinate Agile sprints, coach teams, facilitate communications, resolve conflicts, and ensure deliverable quality.
Top Skills: AgileItil 4Safe AgileScrum

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account