Top Tech Jobs & Startup Jobs in NYC, NY

Reposted 2 Hours AgoSaved
Remote
United States
167K-286K Annually
Senior level
167K-286K Annually
Senior level
Artificial Intelligence • Software
The Backend Engineer will build a multi-cloud inference platform and manage distributed systems for AI model deployment while ensuring operational excellence and low toil services.
Top Skills: AWSAzureBackend EngineeringCloud ProvidersGCPGoKubernetes
Reposted 2 Hours AgoSaved
Remote
United States
135K-242K Annually
Mid level
135K-242K Annually
Mid level
Artificial Intelligence • Software
The Mojo Compiler Engineer will design and implement features for the Mojo language within an MLIR-based compiler, optimize performance for AI systems, and enhance the developer experience.
Top Skills: C++LlvmMlirPython
Reposted 2 Hours AgoSaved
Remote
United States
270K-375K Annually
Expert/Leader
270K-375K Annually
Expert/Leader
Artificial Intelligence • Software
As Head of Hardware Partnerships, you will define and scale Modular's hardware ecosystem strategy by managing relationships with hardware vendors, developing partnership models, and supporting partnership economics.
Top Skills: Ai InfrastructureAsicsGpusMl SystemsTpus
Reposted 2 Hours AgoSaved
In-Office or Remote
United States
167K-242K Annually
Senior level
167K-242K Annually
Senior level
Artificial Intelligence • Software
As a GenAI Systems Engineer, you will architect scalable frameworks, optimize distributed systems, and develop APIs, enhancing AI model deployment efficiency.
Top Skills: AsyncioDaskKubernetesMicroservicesPythonSpark
Reposted 2 Hours AgoSaved
Remote
United States
167K-273K Annually
Senior level
167K-273K Annually
Senior level
Artificial Intelligence • Software
The Cloud Inference Engineer is responsible for building and maintaining a scalable LLM inference platform, focusing on distributed systems and machine learning technologies while collaborating across teams to optimize performance.
Top Skills: GoHigh Performance ComputingKubernetesMachine Learning
New

Cut your apply time in half.

Use ourAI Assistantto automatically fill your job applications.

Use For Free
Application Tracker Preview
Reposted 2 Hours AgoSaved
Remote
United States
234K-286K Annually
Expert/Leader
234K-286K Annually
Expert/Leader
Artificial Intelligence • Software
Drive the product vision for Modular's cloud compute platform, collaborating with teams to enhance AI service capabilities and customer engagement.
Top Skills: CudaKubernetesPyTorchSagemakerVertex AiVllm
Reposted 2 Hours AgoSaved
Remote
United States
198K-286K Annually
Senior level
198K-286K Annually
Senior level
Artificial Intelligence • Software
As a Senior AI Kernel Engineer, you will design and optimize high-performance kernels for AI inference on GPUs and custom accelerators, improving performance and collaborating with various teams to influence architecture and implementation.
Top Skills: C/C++CudaGpu ArchitectureHipPtx
Reposted 2 Hours AgoSaved
In-Office or Remote
United States
198K-242K Annually
Senior level
198K-242K Annually
Senior level
Artificial Intelligence • Software
The role involves optimizing hardware support within a software stack for AI model deployment, collaborating with teams and hardware vendors to enhance performance and portability across platforms.
Top Skills: C++CudaMojoOpenclSycl
2 Days AgoSaved
Remote
United States
229K-286K Annually
Senior level
229K-286K Annually
Senior level
Artificial Intelligence • Software
Lead a team building an inference optimization platform for LLMs across GPUs and ASICs. Define technical direction, partner with GTM and engineering to tune customer workloads, create scalable optimization systems, and grow a high-impact performance engineering team.
2 Days AgoSaved
Remote
United States
194K-286K Annually
Senior level
194K-286K Annually
Senior level
Artificial Intelligence • Software
Profile customer LLM inference workloads end-to-end and apply optimizations across GPU kernels, inference engine, and distributed systems. Build tooling and a repeatable automated optimization platform, partner with GTM and engineering to deploy tuned inference in production, and publish technical insights to shape industry best practices.
Top Skills: AsicCloud NativeDistributed InferenceDistributed SystemsGpuGpu Kernel ProgrammingInference EngineKubernetesLlm Architectures
All Filters
JobType
New Jobs
Job Category
Experience
Industry
Company Name
Company Size

Sign up now Access later

Create Free Account