Volume Streams provide streaming realtime access to unfiltered data. The content delivered in Firehose streams is pre-defined, and is not based on rules or keywords defined by the customer. However, there are several different types of Volume streams that may be used by Gnip customers - see the comprehensive list below along with a brief description:

Decahose 

The Decahose delivers a 10% random sample of the realtime Twitter Firehose through a streaming connection. This is accomplished via a realtime sampling algorithm which randomly selects the data, while still allowing for the expected low-latency delivery of data as it is sent through the firehose by Twitter.

Below are some of the new features available with Decahose:

  • Enhanced URL enrichment: - provides additional metadata (page title and description)
  • Stream partitioning - 2 partitions, each containing 50% of volume of the Decahose stream
  • Enhanced reliability - geographic diversity of backend systems

Note: This data is delivered in bulk, and does not support additional filtering (e.g. for keywords).


Firehose 

The Twitter Firehose delivers 100% of Tweets in realtime through a streaming connection. Full firehose streams provide 100% of the publisher’s realtime firehose to your app, with no additional limitations.

Below are some of the new features available with Firehose:

  • Access to Gnip enrichments - URL expansion, Klout, and profile geodata
  • Stream partitioning - 20 partitions, each containing 5% of volume of the full Firehose
  • Enhanced reliability - geographic diversity of backend systems

Note: This data is delivered in bulk, and does not support additional filtering (e.g. for keywords).


User Mention 

User Mention Stream provides a realtime stream of every Tweet in the Twitter firehose that contains a “mention” of a Twitter user, such as @replies and retweets.

Below are some of the new features available with User Mention:

  • Enhanced URL enrichment: - provides additional metadata (page title and description)
  • Stream partitioning - 8 partitions, each containing 12.5% of volume of the User Mention stream
  • Enhanced reliability - geographic diversity of backend systems

Note: This data is delivered in bulk, and does not support additional filtering (e.g. for keywords).