How to Create Tableau Hyper Extract

Cody Schneider8 min read

Dragging your feet while a Tableau dashboard loads is a frustratingly common experience. If you’re working with large datasets or connecting to a slow database, you’ve probably spent more time staring at a spinning wheel than actually analyzing data. The key to speeding things up and unlocking instant interactivity is the Tableau Hyper extract.

GraphedGraphed

Build AI Agents for Marketing

Build virtual employees that run your go to market. Connect your data sources, deploy autonomous agents, and grow your company.

Watch Graphed demo video

This tutorial will walk you through exactly what a Hyper file is, why you should use it, and how to create and manage one step-by-step. Get ready to make your dashboards faster and your life easier.

What is a Tableau Hyper Extract?

Think of a Tableau Hyper extract (a file ending in .hyper) as a highly compressed snapshot of your data. Instead of querying your original data source (like a big SQL database, a Google Sheet, or Salesforce) every time you click a filter, Tableau queries this small, optimized, and localized file that lives right alongside your workbook.

This is different from a live connection, which queries the source database in real-time for every interaction. While live connections are great for a few specific use cases where up-to-the-second data is critical, they are often the source of slow-loading dashboards.

Tableau's Hyper technology is an in-memory data engine designed for fast data ingestion and analytical querying. It replaced the older TDE (Tableau Data Extract) format a few years ago, offering significant improvements in query speed, extract creation time, and the ability to handle much larger datasets. For you, this means your dashboards built on Hyper extracts will feel snappy and responsive, even with millions or billions of rows of data.

Free PDF · the crash course

AI Agents for Marketing Crash Course

Learn how to deploy AI marketing agents across your go-to-market — the best tools, prompts, and workflows to turn your data into autonomous execution without writing code.

Why and When You Should Use Hyper Extracts

Creating an extract might seem like an extra step, but the benefits are massive and can completely change your experience using Tableau.

  • Blazing-Fast Performance: This is the number one reason. When you filter, drill down, or interact with a chart, Tableau reads from the hyper-optimized local extract file instead of sending a query over a network to a potentially overloaded database. This translates to near-instantaneous load times, allowing for a much smoother flow of analysis.
  • Offline Analysis: Need to work on a dashboard while on a plane or at a coffee shop with spotty Wi-Fi? With a Hyper extract, a copy of your data is saved locally. You can build visualizations, analyze trends, and tweak your dashboard without needing any connection to the original data source.
  • Reduced Database Load: If you and your entire team are running complex queries on a live production database all day, it can slow down the critical business operations that rely on that database. Using extracts takes that analytical workload off the main system, keeping your database administrators (and your app users) happy.
  • Enhanced Data Source Capabilities: Some data sources have limited functionality. By pulling the data into a Hyper extract, you unlock Tableau’s full range of functions, including advanced calculations like COUNTD (Count Distinct) that might not be supported by the original source.

How to Create a Tableau Hyper Extract: A Step-by-Step Guide

Creating your first Hyper extract is surprisingly straightforward. Let's walk through the process inside Tableau Desktop.

Step 1: Connect to Your Data Source

Open Tableau and, from the Connect pane on the left, choose your data source. This could be a static file like an Excel spreadsheet or CSV, or a connection to a server like Microsoft SQL Server, Google Analytics, or Salesforce.

For this example, we’ll connect to a PostgreSQL database, but the process is nearly identical for any source.

Step 2: Choose Your Tables and Define Your Data Model

Once connected, you’ll be on the Data Source tab. Drag the tables you need onto the canvas to create your data model. You can set up your joins or relationships here just as you normally would.

Step 3: Select the "Extract" Connection Option

In the top-right corner of the Data Source tab, you’ll see two options under "Connection": Live and Extract. By default, Tableau might be set to Live.

Simply click the radio button to select Extract. You've just told Tableau that when it's time to build, you want it to pull the data into a high-performance .hyper file instead of maintaining a live connection.

GraphedGraphed

Build AI Agents for Marketing

Build virtual employees that run your go to market. Connect your data sources, deploy autonomous agents, and grow your company.

Watch Graphed demo video

Step 4: Configure Your Extract (Highly Recommended!)

Don’t just stop there! Directly next to the "Extract" option is an "Edit..." link. This is where you can apply filters and aggregations to make your extract even smaller and faster. Clicking it opens the "Edit Extract" dialog box.

Add Filters to Reduce Data Size

This is one of the most important best practices. If you don't need all the data from your source, filter it out before creating the extract. Click "Add..." to open the filters dialog.

A common example is filtering by date. If your database has 15 years of historical sales data, but your analysis only requires the last two years, create a filter on your date field to exclude everything older. This alone can dramatically reduce your extract size and improve performance.

Aggregate Data for Speed

The "Aggregate" option allows you to roll up your data to a higher level of granularity. For instance, if your data is recorded every second but you only analyze it at a daily level, you can aggregate the data to "Roll up dates to" Day and check the "Aggregate data for visible dimensions" box.

Warning: This is a powerful feature, but it’s destructive. Once aggregated, you cannot drill back down to the more granular level (e.g., hourly or by the minute) in your worksheet. Only use this if you are certain you won’t need the row-level detail in your analysis.

Step 5: Create and Save the Extract

After configuring your connection, navigate to a new worksheet (e.g., "Sheet 1"). Because you selected the "Extract" option, Tableau will immediately prompt you to save the .hyper file. Choose a memorable name and a location on your computer.

Tableau will then show a dialog box reading "Creating extract..." This may take a few seconds or several minutes, depending on the amount of data you're pulling. Once it's finished, you're ready to start building! Notice how much faster every drag-and-drop action feels.

Free PDF · the crash course

AI Agents for Marketing Crash Course

Learn how to deploy AI marketing agents across your go-to-market — the best tools, prompts, and workflows to turn your data into autonomous execution without writing code.

Managing and Refreshing Your Hyper Extracts

A static extract is great, but your data is always changing. Keeping your extract up-to-date is critical for making accurate, timely decisions. You have two main ways to do this.

Manual Refresh in Tableau Desktop

If you just need a quick, one-time update, you can do it right inside your workbook.

  1. In the Data pane, find the data source you want to refresh.
  2. Right-click on the data source name.
  3. Navigate to Extract > Refresh.

Tableau will reconnect to the original data source and rebuild the extract from scratch. It will honor any filters you set up during the initial creation.

Scheduled Refreshes (Tableau Server/Cloud)

For most business dashboards, you'll want to automate the refresh process. This is done using Tableau Server or Tableau Cloud (formerly Tableau Online).

  1. Publish Your Data Source: First, you must publish the data source (and workbook) to your Tableau Server/Cloud environment. When publishing, ensure you embed the database credentials if necessary so the server can connect on its own.
  2. Set a Refresh Schedule: Once published, you can navigate to the data source on the server, go to the "Extract Refreshes" tab, and set a schedule. You can have it refresh every hour, every day at 3 AM, or every Monday morning, ensuring the business is always looking at recent data.

Full vs. Incremental Refresh

When setting up a scheduled refresh, you have an important choice:

  • Full Refresh: This option deletes the existing extract and replaces it with all new data from the source. It’s simple and reliable.
  • Incremental Refresh: This is much more efficient for large, ever-growing datasets. Instead of replacing the whole extract, it only adds new rows based on a specified column (like an Order ID or a Timestamp). This can cut the refresh time from hours to just minutes. You'll need to specify a column in your table that Tableau can use to identify new rows.

Final Thoughts

Learning how to properly create and manage Tableau Hyper extracts is a fundamental skill that separates frustrating dashboard experiences from fluid, insightful ones. By transforming your slow live connection into a lightning-fast local snapshot, you not only improve performance but also reduce the load on your source systems and enable powerful offline analysis.

While mastering tools like Tableau is incredibly powerful, we know the learning curve can be steep. That’s why we built Graphed to simplify the entire process. Instead of managing data connections, extracts, and visualization types, you can simply connect your marketing and sales sources in a few clicks and ask for what you need in plain English. Graphed automatically handles the data heavy-lifting in the background to build real-time, interactive dashboards instantly, letting you get straight to the insights.

Related Articles