Skip to content
@apify

Apify

We're making the web more programmable.

Pinned

  1. crawlee crawlee Public

    Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, an…

    TypeScript 12.7k 556

  2. proxy-chain proxy-chain Public

    Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.

    JavaScript 800 136

  3. apify-client-js apify-client-js Public

    Apify API client for JavaScript / Node.js.

    TypeScript 61 24

  4. apify-sdk-js apify-sdk-js Public

    Apify SDK monorepo

    TypeScript 108 29

  5. got-scraping got-scraping Public

    HTTP client made for scraping based on got.

    TypeScript 415 30

  6. fingerprint-suite fingerprint-suite Public

    Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.

    TypeScript 763 80

Repositories

Showing 10 of 122 repositories
  • actor-vector-database-integrations Public

    Transfer data from Apify Actors to vector databases (Pinecone, Chroma)

    Python 0 Apache-2.0 1 0 1 Updated Jun 12, 2024
  • crawlee Public

    Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

    TypeScript 12,726 Apache-2.0 556 106 (1 issue needs help) 5 Updated Jun 12, 2024
  • apify-sdk-python Public

    The Apify SDK for Python is the official library for creating Apify Actors in Python. It provides useful features like actor lifecycle management, local storage emulation, and actor event handling.

    Python 110 Apache-2.0 8 19 3 Updated Jun 12, 2024
  • workflows Public

    Apify's reusable github workflows

    6 2 2 1 Updated Jun 12, 2024
  • apify-shared-js Public

    Utilities and constants shared across Apify projects.

    TypeScript 11 Apache-2.0 9 5 7 Updated Jun 12, 2024
  • apify-docs Public

    This project is the home of Apify's documentation.

    API Blueprint 22 Apache-2.0 67 66 19 Updated Jun 12, 2024
  • apify-cli Public

    Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.

    TypeScript 116 17 26 (1 issue needs help) 6 Updated Jun 12, 2024
  • openapi Public

    An OpenAPI specification for the Apify API.

    0 MIT 0 11 2 Updated Jun 11, 2024
  • fingerprint-suite Public

    Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.

    TypeScript 763 Apache-2.0 80 18 12 Updated Jun 10, 2024
  • actor-web-automation-agent Public

    This is the experimental version of Web Automation Agent. The agent uses natural language instructions to browse the web and extract data.

    TypeScript 16 14 9 3 Updated Jun 10, 2024