Senior Data Engineer
At Arthur, we are building the first platform for Responsible AI and work with leaders in finance, self-driving transportation, and other leading-AI industries. We are backed by the best investors in enterprise software and are growing the top startup team in enterprise tech. We are led by industry veterans with deep expertise in ML. We are looking for a driven Senior Data Engineer to join our diverse and collaborative team!
As a Senior Data Engineer, you will:
Design & build a high-throughput SaaS platform with particular emphasis on data engineering components
Be responsible for production delivery of your components , including integration & coordination with your teammates responsible for the user-facing web application, CICD pipelines, and SRE infrastructure
Be forward-thinking in designing for future petabyte-scale, ensuring a performant & resilient architecture
Exhibit continuous curiosity in understanding emerging technology that could solve our challenges
Mentor others on best data engineering practices & guide decision-making
Qualifications:
4+ years software engineering experience on a SaaS platform with emphasis on large-scale data systems
Experience building large-scale data systems using distributed file storage technologies such as hdfs and s3 and distributed processing frameworks such as Spark, EMR, and HDFS.
Experience with event processing and streaming data technologies including message queues such as Kafka and stream processors such as Spark streaming, Storm, Kinesis, etc.
Experience with multiple RDBMS & NoSql technologies
Proficiency with Python (preferred) or other commonly used data processing languages such as Java or Scala
Understanding of multi-tenant platforms, best practices for managing multiple organizations on a shared platform, providing secure & controlled access to data, and role-based access control (RBAC)
Experience working with cloud environments such as AWS and GCP
CS (preferred) or other technical degree, or equivalent practical experience
Preferences
1+ year experience as a technical lead in data engineering
Experience with machine learning & AI and related tools such as Airflow, Tensorflow, and Sci-kit learn
Experience with analytics or data visualization architectures
Experience with on-prem deployment architectures
Experience running a 24x7 SaaS platform with an SLA
We offer
Working with a small, fast-growing team, lots of opportunity to take ownership and run with projects
The opportunity to get in on the ground floor of a rapidly growing startup
Generous equity
A culture that empowers great people to accomplish great things
Full benefits package
Flexibility to work out of our NYC or DC offices