Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
Our APIs use computer vision, machine learning and natural language processing to help developers extract and understand objects from any Web page. We've determined that the entire Web can be classified into approximately 18 structural page types. From this basic understanding of common page layouts, Diffbot then uses computer vision, natural language processing and other machine learning algorithms to identify and extract the important items from within these pages. | Build powerful news applications with NewsMesh. Access real-time news, trending articles, and search across 10+ categories. |
The Article API is used to extract clean article text from news article web pages.;The Follow API allows you to subscribe to the changes of any web page.;The Frontpage API takes in a multifaceted “homepage” and returns individual page elements.;[Limited Alpha] The Page Classifier API takes any web link and automatically determines what type of page it is.;Accurate- We utilize state-of-the art computer vision and NLP algorithms; have the largest collection of tagged pages and update our model several times per week.;Easy- Pass in a URL and we'll do the rest. Stop spending time building custom scrapers and -- even worse -- maintaining them.;Stable- Diffbot is built and run by Web veterans in a multi-tiered environment with redundancy, monitoring and scalability built-in. Our scale lets us operate the service more cheaply than running it yourself.;Open- We use open standards (schema.org) and allow for endless configurability via our customization tool. | api, news |
Statistics | |
Stacks 16 | Stacks 0 |
Followers 30 | Followers 1 |
Votes 0 | Votes 1 |
Integrations | |
| No integrations available | |

Elasticsearch is a distributed, RESTful search and analytics engine capable of storing data and searching it in near real time. Elasticsearch, Kibana, Beats and Logstash are the Elastic Stack (sometimes called the ELK Stack).

Our mission is to make you a search expert. Push data to our API to make it searchable in real time. Build your dream front end with one of our web or mobile UI libraries. Tune relevance and get analytics right from your dashboard.

It is an open source, typo tolerant search engine that delivers fast and relevant results out-of-the-box. has been built from scratch to offer a delightful, out-of-the-box search experience. From instant search to autosuggest, to faceted search, it has got you covered.

Amazon CloudSearch enables you to search large collections of data such as web pages, document files, forum posts, or product information. With a few clicks in the AWS Management Console, you can create a search domain, upload the data you want to make searchable to Amazon CloudSearch, and the search service automatically provisions the required technology resources and deploys a highly tuned search index.

Amazon Elasticsearch Service is a fully managed service that makes it easy for you to deploy, secure, and operate Elasticsearch at scale with zero down time.

It is a full-text search engine written in C++ and a fork of Sphinx Search. It's designed to be simple to use, light and fast, while allowing advanced full-text searching. Connectivity is provided via a MySQL compatible protocol or HTTP, making it easy to integrate.

Azure Search makes it easy to add powerful and sophisticated search capabilities to your website or application. Quickly and easily tune search results and construct rich, fine-tuned ranking models to tie search results to business goals. Reliable throughput and storage provide fast search indexing and querying to support time-sensitive search scenarios.

Swiftype is the easiest way to add great search to your website or mobile application.

It is a powerful, fast, open-source, easy to use, and deploy search engine. The search and indexation are fully customizable and handles features like typo-tolerance, filters, and synonyms.

It is the next-gen search & analytics engine built for logs. It is designed from the ground up to offer cost-efficiency and high reliability on large data sets. Its benefits are most apparent in multi-tenancy or multi-index settings.