Data Scientist, Machine Learning at Codecademy
We are NYC based but remote friendly!
Codecademy was started in 2011 by two college students in a dorm room at Columbia that were frustrated by the huge gap between education and employment. Almost a decade later, we are a rapidly growing, diverse team of about 200 distributed through North America and headquartered in SoHo, NYC. We’ve raised over $87.5m in venture capital funding from top investors including Kleiner Perkins, Naspers, Owl Ventures, Union Square Ventures, Y Combinator, and more.
If you want to help build a business that impacts tens of millions of people each year and helps them lead better lives, join us!WHAT YOU'LL DO
As a Data Scientist focused on Machine Learning, you will work on an impactful team to analyze our millions of learners. We capture terabytes of data on how users engage with our platform. As Codecademy continues our rapid growth, we want to build a data-informed culture that uses hypothesis testing, experimentation, and exploratory analysis to guide our decision-making process.
You will join a small but growing team of Data Scientists. Our work is in high-demand from all corners of Codecademy. We work on a variety of problems and have a real impact on the business and product. If you have a proven background in data and you are excited about making code education accessible, we want to hear from you!
- Apply exploratory data analysis and causal inference to answer complex questions about our users.
- Collaborate across teams to help scope out analyses, through a combination of experimentation (A/B testing) and quantitative user research.
- Design experiments and evaluate results to test and iterate on new product ideas.
- Perform deep dives into our data to build understanding around our business.
- Work with our data science and engineering teams to maintain data integrity.
- Mentor and consult with a cross-functional team of data scientists, engineers, and product managers.
- 3+ years of industry experience in a data science, analytics, or research role, with 1+ years of Machine Learning experience. You have strong data intuition and knowledge of using data science best practices to drive impact.
- Expert SQL - we use Redshift. Able to write clean and efficient queries on massive datasets.
- Applied experience with statistical programming languages - R or Python preferred.
- Familiarity with multiple Machine Learning Frameworks
- Experience working with different algorithms (regressions, gradient boosting, random forest)
- Understanding of statistical methods and when to use them (hypothesis testing, experiment design, sampling).
- Strong written and verbal communicator. Comfortable working with loosely defined research problems.
- Background in Advanced Machine Learning/Deep Learning.
- Knowledge of Scala and dbt.
- A workflow involving reproducible methods and version control - Github, Docker.
- Experience automating dashboards with business intelligence tools - Looker, Tableau.
- Passionate about teaching the world to code. Empathy for our learners, such as a background in education or past experience using our site.
- How can we estimate learning based on a user's journey?
- What are some different types of patterns in user behavior (i.e. predict churn, retention, LTV)?
- What do we do when we are unable to conduct an experiment?
- How do we improve the relevance of our course recommendations?
- How can we scale our existing processes? (experiment reporting framework, forecasting)
At Codecademy, we are committed to teaching people the skills they need to upgrade their careers. Codecademy aims to educate a richly diverse demographic of learners with our product and in order to accomplish this, we believe our team should reflect that rich diversity. Our company celebrates diversity in all of its forms-- race, gender, color, national origin, marital status, sexuality, religion, veteran status, age, ability, disability status-- and works to create an inclusive workplace where people of all backgrounds and beliefs are empowered to better their futures.#LI-Remote