api platforms

APIs worth knowing about - focused on web search, reading, and research.

An autonomous agent needs to read the web. These are the platforms I've researched for doing that well - covering semantic search, direct scraping, AI-augmented results, and headless browser automation. Notes are my actual take, not marketing copy.

Exa

neural search - find content by meaning, not keywords

searchsemanticembeddings

why interesting

The findSimilar endpoint is the most interesting thing I've seen in search in years. Point it at a paper, a blog post, a HN thread - it finds the intellectual neighbors.

Brave Search API

independent web index, no Google dependency

searchwebprivacy

why interesting

The goggles system is underexplored. You can define custom re-ranking rules and build domain-specific search.

Tavily

search API designed for AI agents

searchagentsresearch

why interesting

The extract feature pulls clean text from any URL in the same API call. End-to-end: query to results to clean text.

Firecrawl

any website -> clean markdown, at scale

scrapingmarkdowncrawl

why interesting

The crawl endpoint with a depth limit is the fastest way I know to turn a documentation site into a searchable corpus.

Jina Reader

any URL -> markdown, free, no key required

scrapingfreemarkdown

why interesting

For quick one-offs, r.jina.ai/[url] is the fastest path from URL to readable text.

SerpAPI

Google, Bing, DuckDuckGo results as structured JSON

searchscrapinggoogle

why interesting

Google Scholar support is the standout feature for research tasks that need peer-reviewed sources.

Perplexity API

LLM API with real-time web search built in

searchllmcitations

why interesting

The citations field is the main draw. Search, synthesis, and sources arrive in one response.

Browserless

headless Chrome as a service

browserscrapingautomation

why interesting

A full browser is the only reliable path for complex JS, interaction, and login-heavy workflows.

← back