Fetch-MCP: Playwright-Based MCP Server with Batch URL Fetching Support

Sulfide6416 · 2025-03-20T03:40:41 1742442041

Fetch-MCP is a MCP server built on Playwright, designed for efficient web page content fetching. It excels at retrieving content from both static and dynamic websites, leveraging Playwright's powerful headless browser capabilities. Key features include `fetch_url` for single page retrieval and `fetch_urls` for high-performance batch fetching of multiple URLs in parallel. Fetch-MCP intelligently extracts main content, supports Markdown conversion, and is easily configurable, making it an ideal tool for developers needing robust and scalable web scraping capabilities.

andrethegiant · 2025-03-20T04:58:55 1742446735

Check out https://pure.md for a REST API version of this

wejick · 2025-03-20T06:19:56 1742451596

Is there any example how an agent can interact with MCP? I imagine it will replace / complement Tools interface.

tuananh · 2025-03-20T06:25:12 1742451912

it can be either stdio or SSE.

tomjen3 · 2025-03-20T15:39:28 1742485168

Cool, but playwright doesn’t use your cookies.

Increasingly I want to stop spending time on twitter, but it’s also where the AI news drops first - and I can’t just scrape the data without being logged in.

If there was a way to have the ai go ahead and gather the data for me, that would be great.

omneity · 2025-03-21T03:03:12 1742526192

This is something I am building. Herd[0] gives you a puppeteer-like API over your own browser, in effect allowing you to use your session seamlessly for automation and data extraction (and avoid bot detection as a nice side effect)

0: https://herd.garden

aschobel · 2025-03-22T18:17:56 1742667476

Playwright can actually use your existing browser cookies if you connect it through Chrome's debugging protocol. Launch Chrome with the flag:

--remote-debugging-port=9222

Then connect via CDP in Playwright like this:

const browser = await chromium.connectOverCDP('http://localhost:9222');

yonl · 2025-03-20T15:58:15 1742486295

I would agree to this point as well.

Speaking of implementation, i don’t mind if a browser extension forward cookies from my browser to the automation (privacy and security is an issue of course, and i’d ideally want the cookies to not leave my device, but personally i’m okay with some trade off).

dd36 · 2025-03-25T03:57:55 1742875075

Can’t you just have it login?

chazeon · 2025-03-20T04:32:11 1742445131

What's MCP?

DogRunner · 2025-03-20T08:40:02 1742460002

A simple explanation can be seen here: https://www.youtube.com/watch?v=7j_NE6Pjv-E

pizza · 2025-03-20T04:40:55 1742445655

Model Context Protocol

dSebastien · 2025-03-20T04:53:40 1742446420

I shared some notes about it here. Well worth exploring right now: https://notes.dsebastien.net/30+Areas/33+Permanent+notes/33....

hi_hi · 2025-03-20T06:18:01 1742451481

Thanks for this. I'm not familiar with MCP, but having (briefly) read your link it appears to enable a use case I've been expecting where a chat window could replace the entire website experience (probably better suited to larger enterprise style websites) to provide tailored information for a company/product.

Would you know if it's possible to use this approach to constrain an LLM to only a specific context of information (For example, on the Microsoft site, any question related to CRMs would answer with information about Dynamics but never Salesforce)?