Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
Our APIs use computer vision, machine learning and natural language processing to help developers extract and understand objects from any Web page. We've determined that the entire Web can be classified into approximately 18 structural page types. From this basic understanding of common page layouts, Diffbot then uses computer vision, natural language processing and other machine learning algorithms to identify and extract the important items from within these pages. | Sheetsee.js is a client-side library for connecting Google Spreadsheets to a website and visualizing the information in tables, maps and charts. |
The Article API is used to extract clean article text from news article web pages.;The Follow API allows you to subscribe to the changes of any web page.;The Frontpage API takes in a multifaceted “homepage” and returns individual page elements.;[Limited Alpha] The Page Classifier API takes any web link and automatically determines what type of page it is.;Accurate- We utilize state-of-the art computer vision and NLP algorithms; have the largest collection of tagged pages and update our model several times per week.;Easy- Pass in a URL and we'll do the rest. Stop spending time building custom scrapers and -- even worse -- maintaining them.;Stable- Diffbot is built and run by Web veterans in a multi-tiered environment with redundancy, monitoring and scalability built-in. Our scale lets us operate the service more cheaply than running it yourself.;Open- We use open standards (schema.org) and allow for endless configurability via our customization tool. | - |
Statistics | |
GitHub Stars - | GitHub Stars 3.1K |
GitHub Forks - | GitHub Forks 973 |
Stacks 16 | Stacks 3 |
Followers 30 | Followers 29 |
Votes 0 | Votes 0 |
Integrations | |

Working with Airtable is as fast and easy as editing a spreadsheet. But only Airtable is backed by the power of a full database, giving you rich features far beyond what a spreadsheet can offer.

Use spreadsheet as your database. Give data to your users the nice way, directly from the tool you know. Without bothering webdeveloper.

Power websites, apps, or whatever you like, all from a spreadsheet. Changes to your spreadsheet update your API in realtime.

Drag & drop your data, name your API and choose what data people can see - that's it. Documentation is created automatically.

The AI Writer for creating SEO-optimized content that ranks. Generate high-quality blog posts, articles, and web content in minutes with our advanced AI content generator. Start your free trial today.

Turn one content idea into platform-optimized posts for WordPress, LinkedIn, X, and Ghost. AI automatically adapts your message, tone, and format for each platform—no manual rewriting required.

Transform your rough notes, meetings, and ideas into structured, publish-ready cards with AI.

Grow organic traffic on auto-pilot with AI-powered SEO content. Get recommended by ChatGPT & rank on Google.

Create unlimited articles in one go by uploading a CSV of keywords. The system handles queue management, real-time progress tracking, automatic retries for failed articles, and multi-format exports—making large-scale content creation fast, stable, and hands-free.

Use any Google Sheets or Excel Online spreadsheet to power a fully-fledged API, no coding required.