Wynd Labs Logo

Wynd Labs

Web Scraping Specialist

Reposted 24 Days Ago
Easy Apply
Remote
70K-140K Annually
Mid level
Easy Apply
Remote
70K-140K Annually
Mid level
The Web Scraping Specialist will extract data from websites, optimize scraping processes, and manage data integrity, ensuring high-quality outputs.
The summary above was generated by AI
Web Scraping Specialist

$70k – $140k

Who We Are.

Wynd Labs is an early-stage startup that is on a mission to make public web data accessible for AI through contributions to Grass.

Grass is a network sharing application that allows users to share their unused bandwidth. Effectively, this is a residential proxy network that directly rewards individual residential IPs for the bandwidth they provide. Grass will route traffic equitably among its network and meter the amount of data that each node provides to fairly distribute rewards.

In non-technical terms: Grass unlocks everyone's ability to earn rewards by simply sharing their unused internet bandwidth on personal devices (laptops, smartphones).

This project is for those who lead with initiative and seek to challenge themselves and thrive on curiosity.

We operate with a lean, highly motivated team who revel in the responsibility that comes with autonomy. We have a flat organizational structure, the people making decisions are also the ones implementing them. We are driven by ambitious goals and a strong sense of urgency. Leadership is given to those who show initiative, consistently deliver excellence and bring the best out of those around them. Join us if you want to set the tone for a fair and equitable internet.

The Role.

We are seeking a Web Scraping Specialist who is proficient and brings significant experience in data extraction and web scraping techniques. You will join a small, specialized team and lead efforts to gather and analyze data, optimize scraping processes, and support our vision for a future where Grass plays a crucial role in transforming internet data accessibility.

Who You Are.

  • Demonstrated ability to extract data from complex websites with minimal supervision, with a portfolio or examples of past projects.
  • Proficiency in languages such as Python or JavaScript, with strong skills in libraries and frameworks like BeautifulSoup, Scrapy, or Selenium.
  • Knowledge of asynchronous programming, multithreading, and distributed scraping.
  • In-depth knowledge of HTML, CSS, JavaScript, and the Document Object Model (DOM).
  • Experience with NoSQL databases (MongoDB, Cassandra), capable of designing efficient storage solutions and managing data integrity.
  • Ability to apply machine learning algorithms for data cleaning, categorization, or predictive analysis adds significant value.
  • Experience with cloud services (AWS, Google Cloud, Azure) for deploying and managing scraping jobs at scale.
  • Active participation in open-source projects related to web scraping, data processing, or similar fields.

What You'll Be Doing.

  • Write, test, and refine code that extracts data from various online sources, ensuring reliability and efficiency.
  • Perform data retrieval tasks, handling complexities such as pagination and dynamic content loaded with AJAX.
  • Clean and format extracted data, ensuring it meets quality standards for further analysis or processing.
  • Database management: Store and manage the scraped data in appropriate databases, optimizing for access speed and data integrity.
  • Regularly monitor the scraping processes, identify and resolve any issues to maintain continuous data flow.

Why Work With Us.

  • Opportunity. We are at at the forefront of developing a web-scale crawler and knowledge graph that allows ordinary people to participate in the process, and share in the benefits of AI development.
  • Culture. We’re a lean team working together to achieve a very ambitious goal of improving access to public web data and distributing the value of AI to the people. We prioritize low ego and high output.
  • Compensation. You’ll receive a competitive salary and equity package.

Top Skills

AWS
Azure
Beautifulsoup
Cassandra
GCP
JavaScript
MongoDB
NoSQL
Python
Scrapy
Selenium

Similar Jobs

3 Hours Ago
Remote or Hybrid
Chicago, IL, USA
146K-256K Annually
Senior level
146K-256K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Design and implement complex ServiceNow integrations and data architectures using Workflow Data Fabric. Serve as the technical lead, build asynchronous messaging and ETL solutions, debug integration issues, and advise customers on best practices to drive adoption and business outcomes. Mentor teams and coordinate development to ensure timely delivery.
Top Skills: Servicenow,Workflow Data Fabric,Rest,Soap,Json,Middleware,Ldap,Sso,Saml,Jdbc,Import Sets,Export Sets,Idr,Remote Tables,Remote Process,Saas,Studio Ide,Automated Test Framework,Delegated Development,Flow Designer,Source Control,Apis,Javascript,Etl,Relational Databases,Nosql,Vpn,Ssl,Connect Chat,Agent Chat,Virtual Agent,Domain Separation,Servicenow Cis,Servicenow Cad,Ai
3 Hours Ago
Remote or Hybrid
Santa Clara, CA, USA
134K-178K Annually
Senior level
134K-178K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Partner with Sales to design and validate identity and security solutions, deliver tailored demos and POCs, support complex enterprise deals, translate technical capabilities into business outcomes, and collaborate with product and enablement teams to drive win rates.
Top Skills: Servicenow,Identity And Access Management (Iam),Identity Governance And Administration (Iga),Security Operations Platforms,Saas,Ai
3 Hours Ago
Remote or Hybrid
Santa Clara, CA, USA
139K-230K Annually
Senior level
139K-230K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead territory strategy and sales for ServiceNow CRM in the Energy & Utilities vertical. Support account planning, coach account teams, envision digital transformation value, align solutions with Now Value, and drive full-cycle deals from demand generation through negotiation and close while collaborating with specialists and partners.
Top Skills: Servicenow,Crm,Crm Saas,Ai,Ai-Powered Tools

What you need to know about the NYC Tech Scene

As the undisputed financial capital of the world, New York City is an epicenter of startup funding activity. The city has a thriving fintech scene and is a major player in verticals ranging from AI to biotech, cybersecurity and digital media. It also has universities like NYU, Columbia and Cornell Tech attracting students and researchers from across the globe, providing the ecosystem with a constant influx of world-class talent. And its East Coast location and three international airports make it a perfect spot for European companies establishing a foothold in the United States.

Key Facts About NYC Tech

  • Number of Tech Workers: 549,200; 6% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Capgemini, Bloomberg, IBM, Spotify
  • Key Industries: Artificial intelligence, Fintech
  • Funding Landscape: $25.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Greycroft, Thrive Capital, Union Square Ventures, FirstMark Capital, Tiger Global Management, Tribeca Venture Partners, Insight Partners, Two Sigma Ventures
  • Research Centers and Universities: Columbia University, New York University, Fordham University, CUNY, AI Now Institute, Flatiron Institute, C.N. Yang Institute for Theoretical Physics, NASA Space Radiation Laboratory

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account