Web Scraping Engineer (Data Extraction) en Treehouse Holdings

FULL_TIME

  Remoto | Semi Senior | Full time | Programación

Sueldo bruto $2400 - 4000 USD/mes

59 postulaciones
Responde el mismo día
Last checked today
Postular
Requiere postular en Inglés

We are Treehouse Holdings, building a large-scale, high-impact data extraction platform using Apify Actors technology. The mission is to provide reliable, one-click access to data from every useful website on the planet. By 2027, we aim to launch 1,000 high-quality Apify Actors that power data products influencing billions of users worldwide. This role is part of a passionate team focused on creating scalable, efficient web scraping solutions that enable new data-driven insights for research, AI training, and market intelligence.

Oportunidad laboral en getonbrd.com.

What You Will Do

  • Design, build, and release 10+ production-ready Apify Actors each month using Node.js/TypeScript or Python.
  • Select the optimal crawling technology (Cheerio vs. Playwright/Puppeteer) and fine-tune concurrency for maximum throughput and efficiency.
  • Implement advanced proxy usage, session management, and fingerprint rotation to avoid blocks and CAPTCHAs.
  • Write streamlined Dockerfiles, automate the build process, and deploy images to Apify Cloud.
  • Create clear input/output schemas, automated tests, and comprehensive README documentation to ensure user success on first run.
  • Monitor system logs, troubleshoot and resolve customer issues promptly while continuously improving performance and reliability.
  • Utilize AI coding assistants like Cursor and Claude Code daily to assist with scaffolding, refactoring, and documenting code faster.

Must-Have Skills

  • Minimum 2+ years experience building production-grade web scrapers.
  • Strong expertise in Node.js with TypeScript or Python development.
  • Deep knowledge of Crawlee plus Playwright or Puppeteer for building robust scraping actors.
  • Proven skills in designing proxy pools, managing sessions, and fingerprint rotation techniques to bypass anti-scraping measures.
  • Comfortable working with Docker containers, Git version control, and asynchronous/concurrent programming paradigms.
  • Experience as a daily user of AI coding tools such as Cursor and Claude Code to enhance coding productivity.
  • Good command of English (B2+), able to write clear documentation and communicate asynchronously with a remote team.

We seek a thoughtful engineer who thrives building scalable systems, quickly solving complex problems, and delivering reliable data extraction products that support multiple downstream applications.

Nice-to-Have Skills

  • Published or commercially deployed Apify Actors demonstrating real-world scraping impact.
  • Experience working with Continuous Integration/Continuous Deployment (CI/CD) pipelines and automated testing suites.
  • Familiarity with REST or GraphQL API design and development.

Compensation & Benefits

  • Competitive salary paid monthly in USD.
  • Learning budget available for courses and conference attendance to support professional growth.
  • Paid time off in addition to public holidays based on your local region.
  • Clear career progression opportunities: Senior Engineer → Staff Engineer → Tech Lead.
  • Fully remote-work environment with core hours between 10 a.m. – 4 p.m. EST (14:00–20:00 UTC).
  • Async-first culture with minimal meetings and strong ownership of your roadmap supported by the team.

GETONBRD Job ID: 55293

Trabajo 100% remoto El cargo puede ser desempeñado desde cualquier lugar del mundo.
Mascotas permitidas Las mascotas son bienvenidas en la oficina.
Horario flexible Entrada y salida flexibles, libertad para realizar trámites personales o familiares.
Vestimenta informal Treehouse Holdings no exige ningún código de vestimenta.
Vacaciones extra Treehouse Holdings otorga vacaciones pagadas adicionales al mínimo legal.

Política de trabajo remoto

Totalmente remoto

El trabajo es 100% remoto desde cualquier país.

  1. Empleos
  2. Programación
  3. Treehouse Holdings
  4. Web Scraping Engineer (Data Extraction)

Acerca de Treehouse Holdings

Treehouse Holdings starts, buys, and accelerates tech-enabled service brands. We recruit self-directed engineers and operators worldwide, hand them real ownership over revenue-critical projects, and arm them with the resources of a U.S. company. — Perfil completo de Treehouse Holdings

Web Scraping Engineer (Data Extraction)
Treehouse Holdings •   Remoto
Postula
Requiere postular en Inglés
Compartir este empleo Compartir