Nativelink Logo

Nativelink

Data Scientist - AI/ML

Reposted 12 Days Ago
Remote
Hiring Remotely in United States
Mid level
Remote
Hiring Remotely in United States
Mid level
The Data Scientist will develop machine learning models to optimize software build processes, analyzing large datasets and collaborating with teams for improvements.
The summary above was generated by AI

About Trace Machina:
Trace Machina is transforming the software development lifecycle with NativeLink, a high-performance build caching and remote execution system. NativeLink accelerates software compilation and testing processes while reducing infrastructure costs, enabling organizations to optimize their build workflows. We work with clients of all sizes to help them scale and streamline their build systems efficiently and effectively.

We are looking for an innovative and driven Data Scientist with a focus on AI/ML to join our team. As a key member of our team, you will apply your data science expertise to enhance NativeLink’s capabilities, from optimizing build processes to developing machine learning models that improve performance, scalability, and efficiency.

Job Description:
As a Data Scientist focusing on AI/ML at Trace Machina, you will work on developing, testing, and implementing machine learning models and algorithms to solve complex problems related to software build optimization and testing. You will work closely with engineering teams to improve the performance of NativeLink’s platform and collaborate with data engineers to ensure robust data pipelines and infrastructure. Your work will help build intelligent systems that make software development faster, more reliable, and more cost-efficient.

Job Responsibilities:

  • Design, implement, and deploy machine learning models to optimize software build systems, including caching, task distribution, and execution workflows

  • Work with large datasets to identify patterns, anomalies, and insights that inform decisions for improving build processes and remote execution

  • Develop predictive models to optimize build times, cache hit rates, and system resource utilization

  • Conduct experiments to improve the efficiency of build systems through data-driven decisions, leveraging AI/ML techniques such as reinforcement learning and optimization

  • Collaborate with cross-functional teams (engineering, product, and operations) to translate business problems into AI/ML-driven solutions

  • Analyze customer usage data to identify opportunities for feature improvements and innovations within the NativeLink platform

  • Develop custom algorithms for performance monitoring, anomaly detection, and optimization of CI/CD pipelines

  • Build, test, and validate machine learning models using a variety of techniques, ensuring they are scalable, robust, and interpretable

  • Build and maintain data pipelines to support model training, testing, and deployment in production environments

  • Communicate findings and insights to both technical and non-technical stakeholders in a clear and actionable way

Required Skills and Experience:

  • 3+ years of experience as a Data Scientist, with a strong focus on AI and machine learning

  • Expertise in machine learning algorithms, data analysis, and statistical modeling techniques

  • Proficiency in Python, R, or other data science programming languages, with experience using libraries such as TensorFlow, PyTorch, Scikit-learn, and Pandas

  • Strong knowledge of deep learning, reinforcement learning, or other advanced AI techniques

  • Experience with large-scale data processing, including working with big data technologies (e.g., Spark, Hadoop)

  • Familiarity with cloud infrastructure (AWS, GCP, Azure) and deploying machine learning models in production

  • Strong understanding of data wrangling, feature engineering, and building predictive models

  • Experience with version control (Git) and working in collaborative environments

  • Excellent problem-solving skills and ability to generate actionable insights from data

  • Ability to communicate complex AI/ML concepts effectively to both technical and non-technical teams

Nice to Have:

  • Experience with build systems or CI/CD pipeline optimization

  • Background in natural language processing (NLP) or time-series forecasting for predictive analytics

  • Familiarity with containerization tools like Docker and Kubernetes for deploying AI models

  • Experience in AI model explainability and interpretability

  • Published research or contributions to open-source machine learning projects

Why Join Trace Machina?

  • Work with cutting-edge AI and machine learning technologies to optimize high-performance build systems

  • Collaborate with a talented and innovative team of engineers, data scientists, and product managers

  • Shape the future of software build processes for leading companies around the world

  • Competitive salary and benefits package

  • Opportunities for career growth, professional development, and continuous learning

If you're passionate about applying AI/ML to solve real-world problems in software development, we’d love to hear from you!

Top Skills

AWS
Azure
Docker
GCP
Hadoop
Kubernetes
Pandas
Python
PyTorch
R
Scikit-Learn
Spark
TensorFlow

Similar Jobs

4 Hours Ago
In-Office or Remote
2 Locations
93K-204K Annually
Senior level
93K-204K Annually
Senior level
Fitness • Healthtech • Retail • Pharmaceutical
The role involves leading machine learning model design, AI development, data analysis, and enhancing digital workplace experiences at CVS Health.
Top Skills: AWSAzureBigQueryDatabricksGCPHugging FaceLangchainPythonPyTorchScikit-LearnSparkSQLTensorFlow
18 Days Ago
Remote
United States
146K-262K Annually
Mid level
146K-262K Annually
Mid level
Greentech
The Lead Data Scientist will design and implement AI solutions, develop machine learning algorithms, and collaborate with leadership on technical strategies in the energy sector.
Top Skills: JavaJavaScriptPythonRSQL
19 Days Ago
Easy Apply
In-Office or Remote
Easy Apply
170K-248K Annually
Senior level
170K-248K Annually
Senior level
Healthtech • Information Technology • Mobile • Productivity • Software • Analytics • Telehealth
As a Staff Data Analyst, you will leverage extensive datasets to optimize AI products, inform data team strategy, and collaborate on data projects from ingestion to analysis.
Top Skills: DaskPythonPyTorchSparkSQLTensorFlowUnix

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account