YipitData Logo

YipitData

Web Scraping Engineer II

Posted 3 Days Ago
Be an Early Applicant
Easy Apply
Remote
Hiring Remotely in India
Mid level
Easy Apply
Remote
Hiring Remotely in India
Mid level
As a Web Scraping Engineer II, you'll design and maintain web scrapers, optimize data extraction processes, and collaborate with teams to ensure data quality and resilience.
The summary above was generated by AI

About YipitData:

YipitData is the leading market research and analytics firm for the disruptive economy and recently raised up to $475M from The Carlyle Group at a valuation over $1B.

We analyze billions of alternative data points every day to provide accurate, detailed insights on ridesharing, e-commerce marketplaces, payments, and more. Our on-demand insights team uses proprietary technology to identify, license, clean, and analyze the data many of the world’s largest investment funds and corporations depend on.

For three years and counting, we have been recognized as one of Inc’s Best Workplaces. We are a fast-growing technology company backed by The Carlyle Group and Norwest Venture Partners. Our offices are located in NYC, Austin, Miami, Denver, Mountain View, Seattle, Hong Kong, Shanghai, Beijing, Guangzhou, and Singapore. We cultivate a people-centric culture focused on mastery, ownership, and transparency.

Why You Should Apply NOW:

  • High Impact: Your work will directly influence key reports and strategic decisions across multiple business units.
  • Exciting Challenges: Tackle the design of resilient web scrapers, navigate dynamic website structures, and optimize large-scale data extraction.
  • Growth Opportunities: As an early member of our expanding Web Scraping Engineering team, you will have significant input on our strategies, processes, and team culture.

About The Role:

We are seeking a Web Scraping Engineer to join our growing engineering team. In this hands-on role, you’ll take ownership of designing, building, and maintaining robust web scrapers that power critical reports and customer experiences across our organization. You will work on complex, high-impact scraping challenges and collaborate closely with cross-functional teams to ensure our data ingestion processes are resilient, efficient, and scalable, while delivering high-quality data to our products and stakeholders.

As Our Web Scraping Engineer You Will:

Refactor and Maintain Web Scrapers

  • Overhaul existing scraping scripts to improve reliability, maintainability, and efficiency.
  • Implement best coding practices (clean code, modular architecture, code reviews, etc.) to ensure quality and sustainability.

Implement Advanced Scraping Techniques

  • Utilize sophisticated fingerprinting methods (cookies, headers, user-agent rotation, proxies) to avoid detection and blocking.
  • Handle dynamic content, navigate complex DOM structures, and manage session/cookie lifecycles effectively.

Collaborate with Cross-Functional Teams

  • Work closely with analysts and other stakeholders to gather requirements, align on targets, and ensure data quality.
  • Provide support, documentation, and best practices to internal stakeholders to ensure effective use of our web scraped data in critical reporting workflows.

Monitor and Troubleshoot

  • Develop robust monitoring solutions, alerting frameworks  to quickly identify and address failures.
  • Continuously evaluate scraper performance, proactively diagnosing bottlenecks and scaling issues.

Drive Continuous Improvement

  • Propose new tooling, methodologies, and technologies to enhance our scraping capabilities and processes.
  • Stay up to date with industry trends, evolving bot-detection tactics, and novel approaches to web data extraction.

This is a fully-remote opportunity based in India. Standard work hours are from 11am to 8pm IST, but there is flexibility here.

You Are Likely To Succeed If:

  • Effective communication in English with both technical and non-technical stakeholders.
  • 3+ years of experience with web scraping frameworks (e.g., Selenium, Playwright, or Puppeteer).
  • Strong understanding of HTTP, RESTful APIs, HTML parsing, browser rendering, and TLS/SSL mechanics.
  • Expertise in advanced fingerprinting and evasion strategies (e.g., browser fingerprint spoofing, request signature manipulation).
  • Deep experience managing cookies, headers, session states, and proxy rotations, including the deployment of both residential and data center proxies.
  • Experience with logging, metrics, and alerting to ensure high availability.
  • Troubleshooting skills to optimize scraper performance for efficiency, reliability, and scalability.

What We Offer:

Our compensation package includes comprehensive benefits, perks, and a competitive salary: 

  • We care about your personal life and we mean it. We offer vacation time, parental leave, team events, learning reimbursement, and more!
  • Your growth at YipitData is determined by the impact that you are making, not by tenure, unnecessary facetime, or office politics. Everyone at YipitData is empowered to learn, self-improve, and master their skills in an environment focused on ownership, respect, and trust.

We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, marital status, disability, gender, gender identity or expression, or veteran status. We are proud to be an equal-opportunity employer.

Job Applicant Privacy Notice

ht="1" width="1" alt="" src="https://px.ads.linkedin.com/collect/?pid=4341228&conversionId=10486642&fmt=gif" />

Top Skills

HTML
HTTP
Playwright
Puppeteer
Restful Apis
Selenium
Ssl
Tls

Similar Jobs

5 Hours Ago
Easy Apply
Remote or Hybrid
Bangalore, Bengaluru Urban, Karnataka, IND
Easy Apply
Junior
Junior
Cloud • Information Technology • Security • Software
Outbound BDR responsible for prospecting target accounts via cold calling, email, and LinkedIn; conducting market research; qualifying leads into SQLs; and handing off pipeline to Account Executives while collaborating to refine outreach.
Top Skills: Linkedin,Email Campaigns,Prospecting Tools,Jumpcloud
5 Hours Ago
Remote
India
Mid level
Mid level
Cloud • Information Technology • Productivity • Software • Automation
Design modern, reusable UI components, templates, and branded design systems for demos and POCs. Build front-end components and flows, maintain a shared UI asset library, collaborate with presales and field teams, and define best practices to accelerate POC delivery and improve visual consistency.
Top Skills: Boomi Flow,Boomi,Html,Css,Javascript,React,Low-Code Platforms,Flow-Based Environments
7 Hours Ago
Remote
India
Mid level
Mid level
Cloud • Information Technology • Productivity • Software • Automation
Design, develop, execute, and maintain automated functional and integration tests for Boomi runtime using Java, Selenium, TestNG and API frameworks. Validate REST/SOAP services, perform regression and impact analysis, improve QA processes, collaborate in Agile teams, and use defect management tools to ensure product quality.
Top Skills: Java,Selenium,Testng,Restassured,Readyapi,Selenium Webdriver,Selenium Grid,Rest,Soap,Wsdl,Jmeter,Blazemeter,Jira,Zephyr,Hp Alm,Perl,Shell,Linux,Unix,Intellij,Eclipse,Git,Bitbucket,Sql,Hibernate,Aws

What you need to know about the Mumbai Tech Scene

From haggling for the best price at Chor Bazaar to the bustle of Crawford Market, the energy of Mumbai's traditional markets is a key part of the city's charm. And while these markets will always have their place, the city also boasts a thriving e-commerce scene, ranking among the largest in the region. Driven by online sales in everything from snacks to licensed sports merchandise to children's apparel, the local industry is worth billions, with companies actively recruiting to meet the demands of continued growth.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account