Explore the New Web Data Extract Summit Site, Submit Speaker Proposals & Grab Early Bird Tickets!

Hello Web Data Developers

I am excited to share that we recently launched a new website for Web Data Extract Summit-do check it out, and submit your speaker proposals. And, for the first time, Extract Summit is happening in the US(Austin, Texas). Take advantage of our early bird special on Extract Summit tickets! Purchase your tickets and book your flights soon!

We have a lot to catch up this week. So, let's get started. In This Week's Edition,

1. Call for Proposals For Extract Summit Speakers Open

2. All about page objects

What are page objects?
Why page objects are used in Web Scraping?
In what way does it relate to Web Scraping using the Zyte API?
Watch the webinar on page objects
Recommended blogs to read on page objects

3. Latest Zyte blogs.

4. Upcoming events- How to eliminate setup and maintenance on your headless browser fleet for web scraping

5. Join Extract Data Community on Discord

Happy Scraping Stay tuned and enjoy this newsletter :)

Web Data Extract Summit 2024: Call for Speaker Proposals

Join Us For Two Days in Austin, Texas on October 9-10, 2024— Extract Labs(Developer Day) and The Main Event.

We invite you to apply as a speaker at the first-ever U.S.-based Web Data Extract Summit.

Extract Data Community is one of the fastest-growing communities in the Web Data Extraction Industry. Speaking at Extract Summit is a huge opportunity to reach and influence a diverse audience, Share your expertise and innovative ideas, network with industry leaders, and boost your visibility.

We are looking for dynamic speakers who can offer creative solutions and innovative perspectives on web data extraction. Whether you are an established expert or an emerging talent, we encourage you to apply and share your unique insights.

Apply now to speak at Web Data Extract Summit 2024! We look forward to your proposals and seeing you in Austin! Submit Your Proposal-here

All about page objects

What are page objects?

The page object pattern is used to encapsulate the user interface details and provide a higher-level API for tests to interact with the application. This makes the application more robust to changes in the UI. The page objects abstract and organizes the code related to web page interactions, which enhances the maintainability and clarity of automated tests.

Why page objects are used in web scraping?

Page objects are used in web scraping to separate the structure and layout of web pages from the scraping logic, making the code cleaner and more maintainable. They enhance code reusability and readability by encapsulating web page elements and interactions, allowing for easy updates when web page structures change. This approach simplifies maintenance, improves scalability for complex projects, and provides a clear and organized representation of web interactions, making web scraping projects more efficient and manageable.

In what way does it relate to web scraping using the Zyte API?

The biggest application of page objects is the Zyte Spider Templates Library (zyte-spider-templates). You can get started

1. Start a Scrapy project by cloning the Ecommerce template. Run spider with URLs of your using the scrapy crawl e-commerce command.

2. Customize the template: If needed, you can customize the template by extending the Ecommerce template to include additional data fields or modify the crawling behaviour.

Note: Implement page objects to override parsing logic for all or some websites, both for navigation and item detail data.

Watch the webinar on page objects

Umair discusses Page Objects, a web scraping design pattern that improves code maintainability and scalability. The webinar covers integrating a generic scraper with Page Objects as Python packages for Scrapy or Python projects. It emphasizes the benefits of reusing Page Objects across multiple projects and introduces web-poet, an open-source Python package by Zyte for efficient web data extraction using Page Objects.

Recommended blogs to read on page objects

Latest Blogs by Zyte

Upcoming Events- How to eliminate setup and maintenance on your headless browser fleet for web scraping

Join our webinar to discover a new approach to deploying headless browsers for web scraping using Zyte API's fully hosted solution.

Traditional headless browser setups, while effective for small projects, can be complex and hinder scalability for larger ones, often requiring full-time developer teams.

Learn from Daniel Cave and Fernando Tadao Ito why these traditional methods are slowing down developers when to opt for hosted solutions, and how to integrate Zyte API Headless Browser with your current spiders.

Don't miss this chance to optimize your data extraction process—register now!

Join Extract Data Community on Discord

We’ve established a vibrant Discord community of 1300+ web scraping enthusiasts like yourself, dedicated to sharing insights, learning new technologies, and advancing in web scraping.

If you have an interesting story, a use-case, or a recent web scraping project you worked on to share with the community members. You can apply here ⬇️

Apply as Extract Community Webinar Speaker

Until next time

Neha .

🥑, Zyte

Explore the New Web Data Extract Summit Site, Submit Speaker Proposals & Grab Early Bird Tickets!

Zyte

Home of the all-in-one, AI-powered Web Scraping API, and a world-class data delivery team.

Web Data Extract Summit 2024: Call for Speaker Proposals

All about page objects

Watch the webinar on page objects

Latest Blogs by Zyte

Upcoming Events- How to eliminate setup and maintenance on your headless browser fleet for web scraping

Join Extract Data Community on Discord

Extract Web Data Newsletter

10,691 followers

More articles by this author

Explore topics

Web Data Extract Summit 2024: Call for Speaker Proposals

All about page objects

Watch the webinar on page objects

Latest Blogs by Zyte

Upcoming Events- How to eliminate setup and maintenance on your headless browser fleet for web scraping

Join Extract Data Community on Discord

Extract Web Data Newsletter

10,691 followers

Extract Summit Spotlight: Proxy Tech Future and Legal Landscape, Plus Major Court Win for Web Scraping

Jul 30, 2024

Global retailer enlists Zyte for data-driven, AI-powered pricing intelligence

May 1, 2024

AI Scraping for product data now available in Zyte API

Mar 19, 2024

Exploring the Frontier of AI Scraping: A Fireside Chat with Zyte's Tech Leaders- Kevin Magee and Konstantin Lopukhin

Feb 29, 2024

Recap of Zyte API and Reflections on Traditional web Scraping Systems

Feb 15, 2024

🚀A Month of Milestones: Expert Talks on Anti-bots, Community Growth, Web Scraping Projects and More!

Feb 5, 2024

Apply as a Speaker, 2023 Legal Wrap-Up from Zyte and Dive Into Our ChatGPT Web Scraping Workshop Recap!🚀

Jan 18, 2024

New Year, New Learnings: Stay Ahead with Extract Data's Community Digest!

Jan 4, 2024

Dive into Web Scraping Wisdom: Weekly Events with Industry Experts 🚀

Dec 8, 2023

The EWDCI Certification: A Milestone in Ethical Web Data Collection

Nov 29, 2023

Explore topics