Titan AI LogoTitan AI

crawlee-python

5,774
394
Python

项目描述

Crawlee is a Python library for web scraping and browser automation, designed to build reliable crawlers that can extract data for AI applications. It supports various file types, works with BeautifulSoup, Playwright, and HTTP, and offers both headful and headless modes with proxy rotation.

项目信息

创建于 1/10/2024
更新于 7/2/2025

分类

ai-development-platform
data-science
search-and-retrieval

标签

development-tools
data-processing
open-source-community
automation
cloud-native

主题

apify
automation
beautifulsoup
crawler
crawling
hacktoberfest
headless
headless-chrome
pip
playwright
python
scraper
scraping
web-crawler
web-crawling
web-scraping