TypeScript web-scraping

Open-source TypeScript projects categorized as web-scraping

Top 6 TypeScript web-scraping Projects

  • crawlee

    Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

  • Project mention: Crawlee: Crawlee–build reliable crawlers. Works with Puppeteer, Playwright, Ch | news.ycombinator.com | 2024-05-24
  • ayakashi

    :zap: Ayakashi.io - The next generation web scraping framework

  • SurveyJS

    Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App. With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

    SurveyJS logo
  • LeMondeRssReader

    :newspaper: Read RSS feed from LeMonde.fr and display news inside the App

  • scrapyteer

    Web crawling & scraping framework for Node.js on top of headless Chrome browser

  • Project mention: Low-code Node.js web scraping tool | /r/webscraping | 2023-07-07

    Hi guys, I've created an open-source low-code Node.js web scraping tool on top of the Puppeteer - https://github.com/miroshnikov/scrapyteer. It offers a small set of functions that are combined in pipelines to define a crawling workflow and a shape of output data. Maybe somebody will find it useful.

  • botasaurus-starter

    🚀 OFFICIAL STARTER TEMPLATE FOR BOTASAURUS SCRAPING FRAMEWORK 🤖

  • Project mention: Meet Bose Framework: 🚀 Your Swiss Army Knife as a Ninja Scraper ✨ | dev.to | 2023-06-12
  • monster-hunter-now-events

    A tool that auto-generates calendar events for Monster Hunter Now by scraping web news articles, processing them with AI, and creating a convenient calendar subscription.

  • Project mention: I created a calendar feed for in game events | /r/MHNowGame | 2023-11-02
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

TypeScript web-scraping related posts

  • Crawlee: Crawlee–build reliable crawlers. Works with Puppeteer, Playwright, Ch

    1 project | news.ycombinator.com | 24 May 2024
  • Crawlee · Build reliable crawlers. Fast

    1 project | news.ycombinator.com | 8 May 2024
  • Launching Crawlee Blog: Your Node.js resource hub for web scraping and automation.

    1 project | dev.to | 26 Feb 2024
  • Anything like scrapy in other languages?

    1 project | /r/webscraping | 10 Dec 2023
  • Best web scraping framework to learn

    1 project | /r/webscraping | 12 Jul 2023
  • Deep diving into Apify world

    1 project | /r/thewebscrapingclub | 2 Apr 2023
  • Build and run your Python web scrapers in the cloud with Apify SDK for Python

    2 projects | /r/webscraping | 14 Mar 2023
  • A note from our sponsor - SaaSHub
    www.saashub.com | 3 Jun 2024
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source web-scraping projects in TypeScript? This list will help you:

Project Stars
1 crawlee 12,621
2 ayakashi 198
3 LeMondeRssReader 27
4 scrapyteer 19
5 botasaurus-starter 15
6 monster-hunter-now-events 8

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com