The following sections describe how to use data from Google BigQuery with the BigQuery data source:
- Overview
- Configure BigQuery in Google Cloud
- Add the BigQuery data source to your AppSheet app
- Add a BigQuery table to your AppSheet app
Note: You can use data from Google BigQuery with the BigQuery data source (as described in this topic) or with Connected Sheets. To compare the two options, see Use data from Google BigQuery with AppSheet.
Overview
The BigQuery data source provides app creators with read-write access to BigQuery datasets.
Watch the following video for an introduction to using data from BigQuery with the BigQuery data source.
(Note: Since the date this video was created, the BigQuery data source now provides app creators with read-write access to BiqQuery datasets. The video incorrectly states access is read-only.)
Connecting AppSheet to BigQuery
Licensing requirements for a BigQuery data source
BigQuery is a Google Cloud service that many enterprise customers use for data reporting. Because the volume of data that must be processed can be significant, BigQuery is classified as an Advanced Data Connector and requires an Enterprise-level plan.
The following AppSheet subscription plans include access to a BigQuery data source:
- AppSheet Enterprise Standard
- AppSheet Enterprise Plus
Limits when using a BigQuery data source
Because BigQuery has the potential to have massive amounts of data (potentially tens of millions of rows or more and up to 10,000 columns), we have put some product limits in place to ensure your apps will be able to load data and perform well.
AppSheet enforces the following limits for a BigQuery data source:
- Maximum of 100,000 rows - BigQuery datasets exceeding the maximum number of rows will be truncated.
- Partitioned tables are not supported - Any BigQuery tables that have partitioned columns will report an error during configuration in AppSheet. Use a BigQuery View that omits the partitioned column to work around this limitation.
- Google Cloud access required - Service accounts are the only access method AppSheet supports. You will need to have access to create service accounts and keys in your Google Cloud project, or you will need to request assistance from an IT administrator with the appropriate level of access. The IT administrator can create the data source as a team data source and share it with you.
In addition, there are complex types that are not supported for writing to BiqQuery. These types are Array
, Bytes
, Geography
, and Struct
.
The next section provides best practices for setting up your app to work within the product limits.
Best practices for setting up your app to use a BigQuery data source
1. Use security filters.
Include AppSheet expressions in the security filters, which will be translated to BigQuery SQL and executed on the server before the data is downloaded to the app.
Security filters are pushed to the SQL Provider only for AND()
, IN()
, NOT()
, and OR()
expressions. Other expressions aren't supported for Provider-level querying before the row limit is applied. For BigQuery, the row limit of 100K is applied on that Provider query, so you can get too few rows if you use a security filter that isn't purely composed of these expression.
Examples:
- Filter by
USEREMAIL()
to get rows owned or viewable by the currently logged-in app user. - Filter by
DateTime
values in the last week or month to get only the most recent data. For example, assuming there is a column in the dataset calledDateCreated
, you could use security filter expressions like:- For the last week:
[DateCreated] >= TODAY() - 7
- For the last month:
[DateCreated] >= TODAY() - 30
- For the last week:
-
Use a combination of user settings and security filters to allow your app user to limit the data. For example, using the user setting column named
Year
and a column in the dataset calledStartTime
, you can get data from the year 2018 or later (assuming the user settingYear
is set to2018
), as follows:[StartTime] >= DATETIME("1/1/" & USERSETTINGS("YEAR"))
2. Use BigQuery Views (read only)
You can create specific views for AppSheet in BigQuery that have potentially complex SQL queries to create a limited number of rows from the dataset. When setting-up the BigQuery data source as a table in your app, select from the list of views in your project as well as from the base datasets.
3. Create a new BigQuery Table
BigQuery allows you to set up a copy of a table that can be re-created periodically based on a scheduled query. This is similar to a view, but it is a physical table that contains the subset of data. This can also be used to reduce the number of columns in order to simplify the table structure.
Configure BigQuery in Google Cloud
In order to connect AppSheet to BigQuery, a service account must be created in Google Cloud with the correct role to access BigQuery data. A service account is a special kind of account that is used to grant system-to-system level access, rather than grant access to individual users. AppSheet (a system) will use the service account user to access BigQuery (a system) which will establish the system-to-system communications. Additional user-level access can be configured in AppSheet through security filters on the data source.
Required roles
Accessing a public dataset only requires the BigQuery Job User role.
Accessing a private dataset requires the BigQuery Job User and BigQuery Data Editor roles. These roles encompass all required permissions.
bigquery.datasets.get
bigquery.jobs.create
bigquery.routines.get
bigquery.routines.list
bigquery.tables.get
bigquery.tables.getData
bigquery.tables.list
resourcemanager.projects.get
Create a service account
The following steps describe how to create a service account using the Google Cloud console.
-
Go to the Google Cloud console.
-
In the navigation menu, select IAM & Admin > Service Accounts to access the Service accounts page.
-
Click + Create Service Account.
-
Create service account with the BigQuery Job User role
-
Enter the a Service account name and Service account description and click Create.
-
Type in BigQuery Job in the text box to filter to the list of roles and select BigQuery Job User.
- If connecting to a private dataset, repeat this step to add the BigQuery Data Editor role. Alternatively, if you created a custom role, add that role instead of the BigQuery Data Editor role.
-
Click Done.
-
- Alternatively, see the Create a service account in the IAM documentation for instructions on how to create service accounts using the gcloud command-line interface.
- See the BigQuery documentation for additional details about the BigQuery Job User and BigQuery Data Editor roles
- Setting the BigQuery Data Editor role as described above gives the service account access to all datasets in the project. If more restrictive permissions are required, the BigQuery Data Editor role can be set at the dataset or table level instead. See Grant access to a resource in the BigQuery documentation.
Create a JSON key for the service account
-
Click the More menu (three dots) in the Actions column of the new service account you created, and select Manage keys.
-
Click Add Key and select Create new key.
-
Accept the default selection of JSON and click Create.
-
Note the file name. You will find this in your browser’s download folder. Depending on your browser type, you may see the downloaded file in the bottom bar of your browser window. Click Close.
-
Open the file with your favorite file editor on your desktop so that you can cut and paste contents when Add a BigQuery data source to your AppSheet app.
Add the BigQuery data source to an AppSheet app
- Sign in to AppSheet.
- Go to the My Account > Sources.
- Click + New Data Source.
- Enter the data source name to use for your new data source in the text box, such as BigQuery-NYC-Citibikes.
- Select Cloud Database.
-
Choose BigQuery from the drop-down list of available database types.
-
Next, you need to obtain the following three values from the Google Cloud Console and the key file you downloaded previously: the BigQuery DataSet ID, the Google Cloud Project ID, and the Service Account key
-
To obtain the BigQueryDataSet ID, go to the Google Cloud Console and select BigQuery under the Big Data section in the navigation menu. You may need to scroll down to find it.
-
Find the BigQuery dataset that you’d like to use. We’ll use a public dataset called bigquery-public-data:new_york_citibike in our demo, but you should select the dataset that you would like to use in your app. If you don’t see the dataset you want, click + Add Data and add it.
-
Open the dataset by clicking on the More menu (three vertical dots) to the right of the dataset name. Make sure you are at the top level. That is, the dataset level, indicated by this icon:
Not on a table level, indicated by this icon:
-
Copy the Dataset ID so that you can paste it into AppSheet.
-
Paste the Dataset ID into the DataSet ID field in AppSheet's Add database connection information window.
Notes:- The Dataset ID has the following format:
<project_id>.<dataset_id>
. You must replace the . (period) with a : (colon). The resulting format will be this:<project_id>:<dataset_id>
- The
<project_id>
component is optional if the dataset is in the same project as the service account. You must always include the<project_id>
component when connecting to a public dataset. This is because public datasets are stored in thebigquery-public-data
project, which is different from the project where you created your service account
For example:
- The Dataset ID has the following format:
-
To obtain the Google Cloud Project ID, go back to the Google Cloud Console. Click on the project name in the top navigation bar, and then copy the Project ID in the Select a project dialog so that you can paste it into AppSheet.
-
Paste the Project Id into the Google Cloud Project ID field in AppSheet's Add database connection information window.
-
To obtain the Service Account key, go to the JSON file that was downloaded when you created the key on the Service Account in Google Cloud Console (IAM & Admin). Find the file that was downloaded (it will be in your browser’s download directory) and open it with any text editor (if it is not already open). Copy the entire contents of the file.
-
Paste the contents of the file into the Service Account key field in AppSheet's Add database connection information window.
-
Click Test and then Authorize Access.
Add a BigQuery table to your AppSheet app
- In AppSheet, go to Data and click + in the top header of the Data panel.
We've made some improvements to the app editor.
You are opted in to the new editor by default, but you can switch back to the legacy editor at any time.If you are using the legacy navigation
Go to Data > Tables and click + New Table. - Select the data store you created (like big query NYC citibikes).
- Select Tables or Views. (In this demo, we select Tables.) Then, select the table that you want for your App. (In this demo, we select citibike_stations.)
- Mark the table as Read-Only to improve performance as the app will not try to recache as often. Click Add This Table.
- Now, your app will have access to this data as if it were any other table.
- Important: Don’t forget to add security filters to limit the number of rows that will be used with AppSheet.