Maximum of 25 job preferences reached.
Top Artificial Intelligence DevOps & Platform Engineering Jobs in New York City, NY
Artificial Intelligence • Cloud • Security • Software • Cybersecurity
Lead design and implementation of LLM observability features: prototype and scale product capabilities for tracing, evaluating, and debugging generative AI systems. Work cross-functionally to influence architecture, mentor engineers, prioritize customer pain points, and drive product and engineering decisions for reliable, high-performance AI observability.
Top Skills:
Distributed SystemsGenerative AiInference PipelinesLarge Language Models (Llms)Observability Tools/PlatformsPrompt EngineeringScalable Backend Architectures
Reposted 6 Hours AgoSaved
Easy Apply
Easy Apply
Artificial Intelligence • Cloud • Security • Software • Cybersecurity
Partner with Field and Product teams to design and implement LLM observability architectures, build proofs-of-concept, produce technical collateral, and advise customers to drive adoption and product feedback.
Top Skills:
DatadogJavaScriptLlmLoggingMetricsObservabilityPythonTracingTypescript
Artificial Intelligence • Software
Build and maintain production infrastructure platforms (Rubix, Apollo, environment PaaS, observability) across fleets and clusters. Own full product lifecycle, collaborate with cross-functional teams, deploy secure solutions for civil and defense customers, and participate in field exercises to validate deployments.
Top Skills:
ApolloC++CiliumClineEnvoyFoundryGitGoGothamGradleGrafanaJavaJavaScriptKubernetesPythonReactReduxRubixTypescriptWindsurf
Artificial Intelligence • Information Technology • Software
The role involves designing and managing multi-cloud infrastructure, implementing CI/CD pipelines, ensuring platform reliability, scalability, and security, while optimizing performance for a SaaS platform used by enterprise customers.
Top Skills:
ArgoAWSAzureDatadogDockerGCPGithub ActionsGoKubernetesPythonTerraform
Reposted YesterdaySaved
Artificial Intelligence • Cloud • Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
The role involves managing and engineering platform standards and certified images across Linux and Windows environments, deploying Terraform modules, and implementing security compliance measures. Responsibilities include patch management, vulnerabilities resolution, monitoring, and cross-team collaboration to ensure successful implementation of OS standards.
Top Skills:
AnsibleAWSDockerLinuxPackerTerraformWindows
New
Cut your apply time in half.
Use ourAI Assistantto automatically fill your job applications.
Use For Free
Artificial Intelligence • Machine Learning • Mobile • Social Impact • Software • App development
Lead the Platform team in defining the roadmap, focusing on cloud infrastructure, automation, and improving developer experience while mentoring and fostering technical excellence.
Top Skills:
AWSCloud InfrastructureDistributed Systems
Artificial Intelligence • Fintech • Payments • Social Impact • Analytics • Financial Services • Automation
As a Senior SRE, you'll ensure reliable and scalable systems, develop observability solutions and infrastructure as code, and lead incident response efforts.
Top Skills:
AWSCloudFormationDatadogElkPrometheusTerraform
Artificial Intelligence • Information Technology • Internet of Things • Marketing Tech • Social Media • Software • SEO
Develop automated infrastructures, optimize application performance, implement monitoring tools, and resolve issues in a primarily Linux environment.
Top Skills:
AnsibleAWSAzureBitbucketDockerGCPGitGitGoJavaKubernetesLinuxPythonTerraformTerragrunt
Artificial Intelligence • Fintech • Machine Learning • Mobile • Payments • Retail • Software
Lead and hands-on manage the Infrastructure team owning Upside's cloud platform: drive AWS architecture (cell-based), AI-assisted deployment tooling, security, CI/CD, observability, operational excellence, and cross-functional coordination to enable reliable, scalable, and secure platform services.
Top Skills:
AIAWSCi/CdDynamoDBEc2EncryptionGCPGithub ActionsIamLambdaObservabilityRdsSecrets ManagementTerraform
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead teams to deliver AWS-based solution architecture, manage cloud migration initiatives, and innovate processes while engaging senior clients. Foster team growth and operational excellence.
Top Skills:
ArgocdAWSCloudFormationFluxGithub ActionsGoKubernetesPythonTerraform
Artificial Intelligence • Healthtech • Machine Learning • Natural Language Processing • Real Estate
The Platform Operations Engineer drives technical projects to enhance operational scalability, collaborating across teams to optimize workflows and implement new technologies.
Top Skills:
AWSPythonSQL
Let Your Resume Do The Work
Upload your resume to be matched with jobs you're a great fit for.
Success! We'll use this to further personalize your experience.
Top Companies in New York City, NY Hiring Engineering Roles
See AllAll Filters
Total selected ()
No Results
No Results





















