Web Scrape {API}
Turn any webpage into clean Markdown with one API call.
Stop hand-crafting HTML parsers. Pass any URL and get back GitHub Flavored Markdown ready to feed into your LLM, RAG pipeline, or documentation system.
Control link preservation, image inclusion, and base64 handling with simple query parameters.
What You Get
Each request converts a live webpage into structured, LLM-ready Markdown.
- GitHub Flavored Markdown — Tables, headings, lists, and code blocks fully converted
- Configurable link & image handling — Keep or strip hyperlinks and image references per request
- Base64 image shortening — Prevent token bloat from inline image data in AI pipelines
- Automatic proxy escalation — Scrapes blocked and protected sites transparently
How It Works
- 01
Send a URL with your preferences
Specify includeLinks, includeImages, and shortenBase64Images as query params
- 02
Page is fetched
Proxy escalation handles any blocks automatically—no configuration needed
- 03
HTML converted to GFM
The full HTML document is parsed and converted to clean GitHub Flavored Markdown
- 04
Markdown returned instantly
Ready to pass directly to any LLM, vector store, or content pipeline
API Response
GET /v1/web/scrape/markdown?url=https://brand.dev&includeLinks=true{
"success": true,
"url": "https://brand.dev",
"markdown": "# Brand.dev — The Internet's Brand API\n\nAPI to personalize your product with logos, colors,\nand company info from any domain.\n\n## Features\n\n- **Company Logos** — Fetch high-res logos from any domain\n- **Brand Colors** — Extract full color palettes\n- **Company Data** — Address, socials, description and more\n..."
}Personalize at scale
Join 4,000+ businesses using Brand.dev to personalize their products.













