Error 1015: how to solve rate limiting from Cloudflare when web scraping
Apify Blog
by Natasha Lekh
2d ago
The Apify platform is designed for web scraping. Check us out. Coming across a Cloudflare error 1015 can disrupt your web scraping, but read on to learn how to overcome this hurdle and get on with your data extraction. What is error 1015 from Cloudflare? HTTP code error 1015, or "You are being rate limited", is a server-side error normally associated with Cloudflare. Error 1015 most often occurs when your crawler hits a website with too many requests in a short time. This detection triggers Cloudflare's security measures, leading to a denial of access to the website. 1015 error for web s ..read more
Visit website
Error 1009: how to solve it when web scraping
Apify Blog
by Natasha Lekh
5d ago
Encountering Cloudflare error 1009 (or 1007 or 1008 for that matter) may seem like hitting a roadblock in your web scraping journey, but understanding its causes and implementing effective solutions can keep you on track toward your data collection goals. What is Cloudflare error 1009? Cloudflare error 1009 occurs when the website you're trying to access through Cloudflare's network denies you entry due to an IP address block. This block can result from various reasons, including suspicious activity, security concerns, or geographic restrictions. 1009 error means a geographical access restric ..read more
Visit website
HTTP headers with Axios: a comprehensive guide
Apify Blog
by Brian Wachira
6d ago
The internet is a vast, interconnected network of computers that are in constant communication. At its core, this communication relies on the exchange of HTTP messages between different clients and servers. These messages are the fundamental blocks that allow client applications to access and retrieve various resources from servers—be it webpages, data files, or other digital resources. These applications utilize HTTP clients like Axios (which provide the capability to handle HTTP requests and responses) to send HTTP messages to servers, specifying the required resources. HTTP messages consis ..read more
Visit website
How to collect data from a website: a comprehensive guide
Apify Blog
by Daniela Ryplová
1w ago
You must have already faced this issue. You want to collect data from a website to use it for in-depth analysis but are stuck because you don’t have an idea how to actually get the data. Let me tell you directly that copy-pasting is not a clever way. But no worries! Fortunately, there is data scraping, also known as web scraping, the automated process of web data extraction. In this comprehensive guide, you’ll learn step-by-step how to collect data from a website with web scraping using both your own code and no-code scrapers. Side note: For more than 1,500 user-friendly and easy-to-deploy we ..read more
Visit website
How to handle forms and authentication in Scrapy
Apify Blog
by Theo Vasilis
2w ago
Hi! We're Apify, a full-stack web scraping and browser automation platform. If you're interested in using Python for web scraping, this article shows when and how to use Scrapy to deal with complex login forms and token authentication. When scraping a website, you'll likely encounter pages that require submitting a login form. Scrapy - the most powerful framework for web scraping in Python - provides ways to handle forms and authentication. While Web scraping with Scrapy 101 shows you how to use Scrapy's features in general, this short guide equips you with some Scrap ..read more
Visit website
How to handle timeouts in Python Requests
Apify Blog
by Satyam Tripathi
2w ago
Hi! We're Apify, a full-stack web scraping and browser automation platform. If you're interested in using Python for web scraping, this tutorial shows how to use timeouts effectively in the Python Requests library for any HTTP request. Timeouts are crucial for making requests in Python. They prevent your program from freezing indefinitely while waiting for a response from external servers or APIs. In this article, you’ll learn how to use timeouts effectively in the Python Requests library for any HTTP request. You'll learn: how to set connect and read timeouts handle timeout except ..read more
Visit website
MechanicalSoup: a good Python tool for web scraping?
Apify Blog
by Theo Vasilis
2w ago
Hi! We're Apify, a full-stack web scraping and browser automation platform. If you're interested in using Python for web scraping, this article shows when and how to use MechanicalSoup. Developers love coming up with weird names for things. 'Python', 'Selenium', 'BeautifulSoup', 'MechanicalSoup'. There's nearly always a story behind these unusual names, and MechanicalSoup is no exception. The weirdly named MechanicalSoup is the Python library we'll be exploring here, focusing on its utility for web scraping. We'll also compare it with BeautifulSoup and Selenium. The reason for ..read more
Visit website
Playwright vs. Cypress
Apify Blog
by Theo Vasilis
2w ago
? Web testing automation: Playwright or Cypress? Playwright's versatility and multi-browser support make it a popular choice for comprehensive cross-browser testing across multiple domains. In comparison, Cypress excels in front-end testing, offers exceptional documentation, and provides interactive debugging. Once upon a time, Selenium reigned supreme in E2E web testing. Then Cypress and Playwright came along. So, following my article on Cypress vs. Selenium and Playwright end-to-end testing, it's time to navigate the differences between Cypress and Playwright. By the end, you should know ..read more
Visit website
Scaling email marketing and lead generation with itrinity and Apify
Apify Blog
by Theo Vasilis
2w ago
It all began in August 2023, when Fabian Maume, co-founder of Tetris.io, joined a unique startup studio as the CMO for  and Warmup Inbox. These are two tools designed by the software company itrinity to improve the deliverability and integrity of email lists for email marketing and lead generation. Over time, all cold email inboxes suffer from deliverability issues. Warmup Inbox actively helps to reverse this process by keeping you out of spam folders and blacklists. The studio (itrinity), a hub for six SaaS companies, was on a mission to streamline its ..read more
Visit website
ScraperAPI vs. Apify
Apify Blog
by Percival Villalva
3w ago
Following our review of ScrapingBee, let's now shift our focus to ScraperAPI. However, before diving into the review, let's address the elephant in the room: Apify, being a web scraping service, might raise concerns about bias when reviewing another service. While we hold Apify in high regard and take pride in the value it offers our users, the purpose of this review is to provide an impartial assessment of other services, highlighting their strengths and weaknesses. It's important to recognize that Apify may not be the perfect fit for everyone, and that's perfectly okay. Our top priority is ..read more
Visit website

Follow Apify Blog on FeedSpot

Continue with Google
Continue with Apple
OR