STREAM

Explore billions of conversations with a single API.

 

Uncover unique audience insights from human data

Human data encompasses many things: the voice of your customers, the activities of your competitors, and the preferences of the marketplace. Discover audience insights from across multiple social networks to inform your product and marketing strategies.


Unified data format
Unified data format

Analyze across the world’s social networks, news and other data sources with a single API.

Historic and real-time access
Historic and real-time access

Access licensed and compliant historic and real-time data streams with our direct partnerships with data providers.

Smarter data
Smarter data

Focus on building valuable insights and leave data processing to us with our advanced data enrichment, filtering language and classification engine.

 

The world’s largest selection of human data sources

The power of the DataSift platform is its ability to deliver across a wide variety of Human Data sources. Below is a list of all the available sources for real-time and historic data access with a single API.

bitly

As the world’s most popular link-sharing platform, Bitly is used to share millions of new links every day, providing unparalleled insight into what the social world is paying attention to.

Access

Full fidelity access to real-time HTTP streaming connections with sub-second latency.

Enrichments

  • Links

Learn More 

Blogs

The Blog data source combines material from a wide variety of sites, ranging from well-known hosts such as Blogger with very large numbers of active users to small, single-user sites that run as blogs or incorporate a blog.

Access

Aggregated feeds provide broad coverage delivered as we receive it. Delivery times vary, 24 hours or less, many within minutes.

Enrichments

  • Language Detection
  • Links
  • Sentiment
  • Topic Detection
  • Entity Extraction
  • Gender

Learn More 

Boards

A data source that provides content from a variety of message boards around the world. We collect posts from the lower-volume message boards and bundle them up into a single feed to provide a broader coverage.

Access

Aggregated feeds provide broad coverage delivered as we receive it. Delivery times vary, 24 hours or less, many within minutes.

Enrichments

  • Language Detection
  • Links
  • Sentiment
  • Topic Detection
  • Entity Extraction
  • Gender

Learn More 

DailyMotion

DailyMotion.com is a major video-sharing site, attracting millions of unique monthly visitors and over a billion views worldwide. There are more than 30 localized versions.

Access

Aggregated feeds provide broad coverage delivered as we receive it. Delivery times vary, 24 hours or less, many within minutes.

Enrichments

  • Language Detection
  • Links
  • Sentiment
  • Topic Detection
  • Entity Extraction
  • Gender

Learn More 

Disqus

Disqus is a free commenting service that enables great online communities. As the web’s most popular discussion system, Disqus is used by millions of websites that cover pretty much any topic imaginable, including many of the web’s best-known news and blog sites.

Access

Full fidelity access to real-time HTTP streaming connections with sub-second latency.

Enrichments

  • Language Detection
  • Links
  • Sentiment
  • Topic Detection
  • Entity Extraction

Learn More 

Facebook

Facebook helps its members connect and share with the people in their lives. It allows users to like, share, and comment on pictures, videos, websites, articles, and more. Facebook currently has over a billion monthly active users and hundreds of millions of daily active users.

Access

Managed APIs accessed with user credentials supply full access (subject to API limitations) at near real-time. No data licensing fees apply to managed sources.

Enrichments

  • Language Detection
  • Links
  • Sentiment
  • Topic Detection
  • Entity Extraction
  • Gender

IMDb

The Internet Movie Database (IMDb) has a wealth of detail about movies, TV, and news. With a huge collection of user reviews, this is a primary source for movie related opinions.

Access

Aggregated feeds provide broad coverage delivered as we receive it. Delivery times vary, 24 hours or less, many within minutes.

Enrichments

  • Language Detection
  • Links
  • Sentiment
  • Topic Detection
  • Entity Extraction
  • Gender

Learn More 

Intense Debate

IntenseDebate is a robust commenting platform which powers discussions on Wordpress, Blogger, Tumblr and other content management platforms. IntenseDebate powers millions of comments per day and is used by thousands of bloggers around the world.

Access

Full fidelity access to real-time HTTP streaming connections with sub-second latency.

Enrichments

  • Language Detection
  • Links
  • Sentiment
  • Topic Detection
  • Entity Extraction
  • Gender

Learn More 

LexisNexis

The single, most powerful, global news and business information service. LexisNexis provides the breadth and depth of information to put your social insight in context.

Access

Aggregated feeds provide broad coverage delivered as we receive it. Delivery times vary, 24 hours or less, many within minutes.

Enrichments

  • Language Detection
  • Sentiment
  • Links

Learn More 

NewsCred

NewsCred licenses, curates and syndicates full text news articles, images and videos from thousands of the world's highest-quality publishers, including leading financial and entertainment publications in a fully license-compliant way.

Access

Aggregated feeds provide broad coverage delivered as we receive it. Delivery times vary, 24 hours or less, many within minutes.

Enrichments

  • Language Detection
  • Links
  • Sentiment
  • Topic Detection
  • Entity Extraction

Learn More 

Reddit

A social news sharing site where users submit posts in the form of either a link or a text "self" post. Other users then vote the submission "up" or "down," which determines the rank of each post and prioritises its position accordingly.

Access

Aggregated feeds provide broad coverage delivered as we receive it. Delivery times vary, 24 hours or less, many within minutes.

Enrichments

  • Language Detection
  • Links
  • Sentiment
  • Topic Detection
  • Entity Extraction
  • Gender

Learn More 

Topix

Topix is the leading news community on the Web, connecting millions of people to the information and discussions that matter to them in every US town and city.

Access

Aggregated feeds provide broad coverage delivered as we receive it. Delivery times vary, 24 hours or less, many within minutes.

Enrichments

  • Language Detection
  • Links
  • Sentiment
  • Topic Detection
  • Entity Extraction
  • Gender

Learn More 

Tumblr

As one of the world’s fastest growing social network, Tumblr is a platform for people to share content they love. By providing a simple-to-use blog to share content, Tumblr has grown to a massive global audience.

Access

Full fidelity access to real-time HTTP streaming connections with sub-second latency.

Enrichments

  • Language Detection
  • Links
  • Sentiment
  • Topic Detection
  • Entity Extraction

Learn More 

Videos

There are many video hosting sites beside YouTube and, collectively, they hold a massive corpus of content. The Videos data source collects content from many of the lesser-known video hosting sites. You can use this data source in conjunction with the YouTube and DailyMotion sources for maximum coverage.

Access

Aggregated feeds provide broad coverage delivered as we receive it. Delivery times vary, 24 hours or less, many within minutes.

Enrichments

  • Language Detection
  • Links
  • Sentiment
  • Topic Detection
  • Entity Extraction
  • Gender

Learn More 

Wikipedia

This data feed monitors editorial changes at Wikipedia such as the creation of new pages and updates to existing ones. Major news events tend to increase attention and content changes to related Wikipedia pages, and we allow you to include topical and significant Wikipedia content into your streams.

Access

Aggregated feeds provide broad coverage delivered as we receive it. Delivery times vary, 24 hours or less, many within minutes.

Enrichments

  • Links
  • Gender

Learn More 

WordPress

WordPress is widely considered the world’s most popular content management system. Considering the breadth of consumer and B2B blogs and sites built on WordPress, it’s a key data source when measuring and understanding the web presence of your brand or industry.

Access

Full fidelity access to real-time HTTP streaming connections with sub-second latency.

Enrichments

  • Language Detection
  • Links
  • Sentiment
  • Topic Detection
  • Entity Extraction

Learn More 

YouTube

YouTube is the world's most popular video hosting site. This data source offers new YouTube content, including the title, duration, the username of the author, and a link to the video itself, plus comments on existing videos.

Access

Aggregated feeds provide broad coverage delivered as we receive it. Delivery times vary, 24 hours or less, many within minutes.

Enrichments

  • Language Detection
  • Links
  • Sentiment
  • Topic Detection
  • Entity Extraction
  • Gender

Learn More 

 

How can I use STREAM?

STREAM for Human Data use cases are essentially limitless, ranging from brand management to operational improvements.


Share of voice
BRAND HEALTH

Analyze what customers are thinking and saying in real-time about your company, competitors and industry.

Audience research
CUSTOMER EXPERIENCE

Measure satisfaction, track churn risks, fix service problems and make things right with unhappy customers.

Market research
CONTENT OPTIMIZATION

Target with compelling, personalized content with insights into audiences and behavioral traits.

Campaign effectiveness
PRODUCT INNOVATION

Uncover what consumers are raving about, ranting about and wishing and inform ideas.

 
 

STREAM offers an array of features to help you achieve actionable insight beyond simple data aggregation.



NORMALIZATION

niversal schema for data from different unstructured human data sources, allowing developers to search across different sources with a single body of code.



ENRICHMENT

40+ additional fields of metadata real-time enrichment from sentiment analysis to language and gender detection.



PRECISION FILTERING

Precision filtering language to filter against data and metadata to extract high quality insight from vast volumes of Human Data.



CATEGORIZATION

Advanced classification engine to deliver tagged and structured data for use based on industry insight or end client requirements.



DELIVERY

Robust and secure data connectors to a variety of data destinations from ad hoc streams for analysis to building powerful apps using our PUSH API.

 

Our data destinations

Pull Connector

The Pull connector enables you to retrieve your output at your own pace through a REST API. It’s a great option for on-premise integrations.

Learn More 

HTTP

The HTTP connector allows you to transfer data to any endpoint with Webhook requests at regular intervals.

Learn More 

Redis

Redis is an open source, in-memory, key-value data store. It supports lists of strings, a feature well suited to real-time social data.

Learn More 

MongoDB

MongoDB is an open source, NoSQL database. Utilizing dynamic schemas for greater flexibility and leveraged by organizations like eBay and Craiglist, it’s a popular choice for large-scale data management projects.

Learn More 

CouchDB

CouchDB is a NoSQL database which stores data in JSON format. Geared toward document storage, it supports multi-version concurrency control and comes with a web-based administration console.

Learn More 

Amazon Dynamo DB

Amazon DynamoDB is a fast, fully managed NoSQL database service. Available globally in all of Amazon Web Services availability zones, it reduces the administrative costs of maintaining a distributed database cluster.

Learn More 

Google Big Query

Google BigQuery is Google's proprietary analytics service in the cloud, specifically designed to provide interactive querying capability on massive amounts of data (petabytes and billions of rows). It is based on battle tested Dremel and makes use of Google's Data Center Infrastructure. Simply move your data into Google BigQuery and start querying that data using SQL-like queries or RESTful APIs. BigQuery supports realtime data ingestion and querying on BigData at scale.

Learn More 

Amazon S3

Amazon S3 (Simple Storage Service) is a scalable file system in the Amazon Web Services cloud. It’s easy to configure and allows users to organizes files into multiple buckets.

Learn More 

FTP

File Transfer Protocol (FTP) is a standard protocol for transmitting files across the Internet. DataSift allows you to receive your output via FTP using intervals and maximum file sizes you specify.

Learn More 

SFTP

Secure File Transfer Protocol (SFTP) allows you to receive your data via FTP with Secure Shell (SSH) protection for additional security.

Learn More 

ZoomData

ZoomData is a real-time analytics platform. It offers both a pre-configured visualization library and a studio for building customized visualizations.

Learn More 

ElasticSearch

ElasticSearch is a distributed, RESTful, real-time search server. It’s great for use cases that rely on scalable document search.

Learn More 

Splunk Enterprise

Splunk Enterprise is an operational intelligence platform. Integrate social data for analysis alongside your real-time transactional and customer experience metrics.

Learn More 

MySQL

The MySQL Push connectors allow you to send data to any MySQL database, the worlds most popular open source database, either on your own hardware or one of the database-as-a-service offerings such as Amazon Relational Database Service.

Learn More 

PostgreSQL

The PostgreSQL Push connectors allow you to send data to any PostgreSQL database, the powerful, open source object-relational database system, either on your own hardware or one of the database-as-a-service offerings such as Amazon Relational Database Service.

Learn More 

Get actionable insights from STREAM

Contact us