DomainTools Logo

DomainTools

Data Engineering Intern

Posted 6 Hours Ago
Be an Early Applicant
In-Office or Remote
Hiring Remotely in New York, NY, USA
Internship
In-Office or Remote
Hiring Remotely in New York, NY, USA
Internship
As a Data Engineering Intern, you'll assist in data hygiene projects, automate processes, and support machine learning pipelines, gaining hands-on experience in data analysis.
The summary above was generated by AI

DomainTools is seeking a R&D Data Engineering Intern. This role is intended for those seeking to hone their development and data analysis skills as they begin their career.  A successful candidate will be well organized, collaborative, and experienced in working with remote teams. 

As our intern, you will be part of a critical team supporting production machine learning pipelines, providing development and ad-hoc support to our business. Responsibilities include: assisting in data hygiene projects and documentation to maintain data integrity, automating processes, and supporting the R&D team on other special projects as needed. These valuable opportunities provide hands-on experience, allowing you to put your educational knowledge into action and lay a solid foundation for your future.  The role provides an excellent learning opportunity specifically for those interested in internet security, machine learning, and production development patterns.

The right person for us will have a support oriented mentality, wanting to enable organizations to make better business decisions and improve efficiency.  

Key Responsibilities:

  • Data cleaning and preparation to ensure our machine learning pipelines remain accurate and reliable. 
  • Update and maintain code to ensure our system stays compatible with evolving data sources. 
  • Develop and improve tools to monitor data health and help the team explore new ways to use our datasets. 

Requirements

Qualifications & Requirements

  • Strong Organizational Skills: Ability to manage your tasks and schedule effectively. 
  • A Strong Attention to Detail: A sharp eye for spotting inconsistencies in data and a commitment to high-quality documentation. 
  • Clear and Precise Communication Skills: The ability to share updates and collaborate effectively with a remote team. 
  • Development Skills: Familiarity with python, git. Ideally also familiar with Spark/PySpark.
  • Preferred: Knowledge of computer networks, including DNS, domain names, and IP addresses

Time Commitment

  • Hours Per Week: 5-15 hours, depending on availability 
  • Working Hours: Can be flexible but a regular check-ins are required during business hours, US/Eastern time

Benefits

This is an unpaid internship offered for academic credit only; monetary compensation is not available. The primary goal is the intern's education and training, not to generate immediate advantage for the employer. The experience is designed to provide valuable, hands-on learning similar to an educational environment, and there is no guarantee of a paid position at the conclusion of the internship. The intern will work under close supervision of existing staff and will not displace regular employees. 

Top Skills

Git
Pyspark
Python
Spark

Similar Jobs

9 Days Ago
Remote
United States
30-50 Hourly
Internship
30-50 Hourly
Internship
Artificial Intelligence • Machine Learning • Software • Analytics
Interns will work on AI agents, data reporting pipelines, and product features in a hands-on environment, collaborating with senior engineers.
Top Skills: AngularCi/CdCloud EnvironmentsContainersGoJavaPythonReactVue
9 Days Ago
Remote
US
27-27 Hourly
Internship
27-27 Hourly
Internship
News + Entertainment • Software
Software engineering intern role building data extraction, export, and delivery pipelines. Gather pipeline requirements, design and implement ETL/ELT processes, document pipeline architecture, and learn cloud, container, and data warehousing technologies under mentorship.
Top Skills: AWSC#Containerized ApplicationsData VisualizationData WarehousingDatabasesEtl/EltJavaPythonReporting
31 Minutes Ago
Remote or Hybrid
136K-231K Annually
Senior level
136K-231K Annually
Senior level
Aerospace • Hardware • Information Technology • Security • Software • Cybersecurity • Defense
The Engine Systems Program Manager II manages complex aerospace development programs, ensuring compliance with budget, schedule, and customer commitments while leading a cross-functional team and interfacing with stakeholders.
Top Skills: Earned Value ManagementProject Management

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account