Principal Data Scientist
Principal Data Scientist
at LeafLink
New York, New York, United States
About LeafLink
LeafLink is the regulated cannabis industry's largest wholesale marketplace, providing licensed dispensaries the ability to order from their favorite brands, as well as a suite of software tools for those brands to manage and scale their operations.
With thousands of retailers and thousands of brands across 26 territories in the US and Canada, we are setting the industry standard for how cannabis businesses grow together. LeafLink annually processes and manages more than $3 Billion in annualized wholesale cannabis orders - you can learn more about our history and path to $1B here http://firstbillion.leaflink.com/.
Our team, backed by funding from leading VC's, is poised to define the cannabis supply chain through technology. LeafLink was named one of Built In NYC's 'Best Places to Work in 2021', as well as one of Fast Company's 'Top 10 Most Innovative Companies in Enterprise for 2020', joining the ranks of Amazon, Slack, and VMWare - and we're just getting started!
The Role
LeafLink is seeking a proven leader and expert in applied data science to join our New York team. The ideal candidate will possess experience and acumen of understanding business opportunities and challenges in a marketplace ecosystem and applying data science tools to solve them. You have built and deployed into production machine learning models. You are passionate about keeping up with the trends in the data science space and introducing industry leading concepts to keep LeafLink ahead of the curve. You love breaking down complex technical solutions into easy to understand concepts for a non-technical audience and showcase business value. You deeply value frequent and detailed communication to foster alignment and cross-functional collaboration to ensure data science solutions are grounded in full context of the business and align with stakeholder expectations. Above all, you firmly believe that data science applications should drive business value for all stakeholders.
Responsibilities:
- Serve as subject matter experts in developing and deploying advanced data science workflows and technologies for marketplace use cases
- Build and deploy rapid prototypes as concept demonstrators to business stakeholders and product managers
- Propose compelling solutions and intelligent on-platform features driving stickiness and value for LeafLink customers
- Partner with product managers on enabling data-driven features on platform through application of advanced analytics and ML
- Drive a test and learn culture at LeafLink by developing processes for conducting feature experiments and inferring impact on key performance metrics
- Communicate results and business value of experimentation and ML work to a non-technical audience
- Develop patterns and framework for ML Operations and experimentation at LeafLink in partnership with DevOps and Data Engineers
- Mentor and coach other data scientists and analysts on the team to develop data science expertise
- Test and recommend right fit technology choices to augment the data stack at LeafLink
Qualifications:
- Expert level skills in SQL & Python
- Significant experience in feature engineering on structured and unstructured data
- Strong familiarity building ML Pipelines using popular Python based frameworks such as sklearn, keras, spark mlib, pytorch
- Well versed working with at least one ML Ops management platforms such as ML Flow, Databricks, AWS SAgemaker, h2o.ai
- Accomplished in experiment design, causal inference and inferential statistics is a must have
- Qualified working in a SaaS or product based technology environment working alongside Developers, DevOps, Data Engineers and Analytics Engineers.
- Experience using Airflow to build workflows incorporating feature extraction, running model predictions and persisting output of model to data stores
- Driver in deploying models in high velocity streaming data environments using Spark or other frameworks
- A pro at working within AWS environment; Redshift preferred
- Proven background managing or leading data scientists on projects
- Comfortable using Git to share and manage code
- Ability to work in a fast-paced growth business with many collaborators
- Strong knowledge working across the stack when need arises
Benefits:
- Flexible PTO to give our employees a little extra R&R when they need it
- Competitive compensation and 401k
- Comprehensive health coverage (medical, dental, vision)
- Company sponsored Annual Citibike Membership for NYC employees
- Commuter Benefits through a Flexible Spending Account
- A robust stock option plan to give our employees a direct stake in LeafLink’s success
LeafLink’s employee-centric culture has earned us a coveted spot on BuiltInNYC’s Best Places to Work for in 2021 list. Learn more about LeafLink’s history and the path to our First Billion in Wholesale Cannabis Orders here.