CAST AI Logo

CAST AI

Senior Software Engineer - AI Enabler

Reposted An Hour Ago
Be an Early Applicant
Remote
5 Locations
7K-9K Annually
Senior level
Remote
5 Locations
7K-9K Annually
Senior level
The Senior Software Engineer will optimize LLM management in Kubernetes, enhance performance, and build software for cost efficiency across cloud platforms.
The summary above was generated by AI
Why Cast AI?

Cast AI is the leading Application Performance Automation (APA) platform, enabling customers to cut cloud costs, improve performance, and boost productivity – automatically.
Built originally for Kubernetes, Cast AI goes beyond cost and observability by delivering real-time, autonomous optimization across any cloud environment. The platform continuously analyzes workloads, rightsizes resources, and rebalances clusters without manual intervention, ensuring applications run faster, more reliably, and more efficiently.
Headquartered in Miami, Florida, Cast AI has employees in more than 32 countries worldwide and supports some of the world’s most innovative teams running their applications on all major cloud, hybrid, and on-premises environments. Over 2,100 companies already rely on Cast - from BMW and Akamai to Hugging Face and NielsenIQ.
What’s next? Backed by our $108M Series C, we’re doubling down on making APA the new standard for DevOps and MLOps, and everything in between.

About the role

AI Enabler – Helps customers deploying and managing LLMs in their Kubernetes cluster and optimizes their workloads by providing cost visibility and intelligent routing for LLM requests to the most cost-effective compute resources (e.g., Grok, self-hosted LLAMA models).

Here are some of the tools we use daily:
  • Languages: GoLang (primary), Python (secondary for some cases)
  • Cloud & Orchestration: Kubernetes, AWS, GCP, Azure
  • Databases & Storage: PostgreSQL, Cloud Object Storage
  • PostgreSQL and Cloud Object Storage for persistence
  • Messaging & APIs: GCP Pub/Sub, gRPC for internal communication, REST for public APIs
  • Observability: Prometheus, Grafana, Loki, Tempo
  • CI/CD & GitOps: GitLab CI with ArgoCD.
Requirements:
  • Strong software engineering skills with experience in distributed systems and backend development (ideally GoLang, but not a hard requirement as long as you’re willing to transition to it)
  • Strong debugging, optimization, and performance-tuning skills
  • Deep understanding of cloud platforms: hands-on experience with cloud platforms like AWS, Google Cloud Platform (GCP), Microsoft Azure, and tools such as Kubernetes for containerization and orchestration
  • CI/CD and DevOps practices experience
  • Strong English skills, both verbal and written
  • Ability to work independently and collaboratively within a team
  • Startup mindset: adaptable, proactive, and comfortable with ambiguity
  • A proactive, problem-solving mindset with a "yes we can" attitude.
What’s in it for you?
  • Competitive salary (€6,500 - €9,000 gross, depending on the level of experience)
  • Enjoy a flexible, remote-first global environment.
  • Collaborate with a global team of cloud experts and innovators, passionate about pushing the boundaries of Kubernetes technology.
  • Enjoy a flexible, remote-first global environment.
  • Equity options.
  • Private health insurance.
  • Get quick feedback with a fast-paced workflow. Most feature projects are completed in 1 to 4 weeks.
  • Spend 10% of your work time on personal projects or self-improvement. 
  • Learning budget for professional and personal development - including access to international conferences and courses that elevate your skills.
  • Annual hackathon to spark new ideas and strengthen team bonds.
  • Team-building budget and company events to connect with your colleagues.
  • Equipment budget to ensure you have everything you need.
  • Extra days off to help maintain a healthy work-life balance.
    #LI-Remote

#LI-Remote

Top Skills

Argocd
AWS
Azure
Cloud Object Storage
GCP
Gcp Pub/Sub
Gitlab Ci
Go
Grafana
Grpc
Kubernetes
Loki
Postgres
Prometheus
Python
Rest
Tempo

Similar Jobs

An Hour Ago
Remote
3 Locations
Senior level
Senior level
Information Technology • Software
Develop user interfaces and backend services for consumer products using modern technologies. Optimize UI for performance and collaborate with teams for integrated solutions.
Top Skills: CSSHTMLNest.JsNext.JsNode.jsReactSQLTailwind CssTypescript
An Hour Ago
Remote
EU
Mid level
Mid level
Cryptocurrency
The Technical Support Analyst handles customer-reported issues for a SaaS solution, collaborating with internal teams and maintaining high customer satisfaction.
Top Skills: AWSDatabasesHmacJIRAMfaRest ApiSaaSSQLZendesk
An Hour Ago
Remote
EU
Senior level
Senior level
Artificial Intelligence • Software • Biotech • Pharmaceutical
The Lead MES Consultant will direct client projects, drive digital transformation, analyze and enhance pharmaceutical manufacturing processes, and lead a team of consultants while collaborating with stakeholders across the industry.
Top Skills: ErpIiotLimsMesScada

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account