Skip to main content
The GCS connector exposes objects in a Google Cloud Storage bucket as datasets. Summation reads Parquet, CSV, JSON, JSONL, and TSV files directly from the bucket.

What you’ll need

Grant the service account access only to the bucket that Summation should read.

Form fields

FieldRequiredStored asNotes
Base PathYesConfigA gcs:// prefix that scopes the connection, e.g. gcs://my-bucket/path. gs:// paths are also accepted. Browsing starts from this prefix.
Service Account KeyYesSecretThe full service account JSON key, including the surrounding { and }.
HMAC Access IDOptionalSecretCloud Storage HMAC access ID. If set, HMAC Secret must also be set.
HMAC SecretOptionalSecretCloud Storage HMAC secret. If set, HMAC Access ID must also be set.

Grant read access

Grant the service account object viewer access to the bucket you want Summation to read.
PROJECT_ID=my-gcp-project-123
BUCKET_NAME=my-bucket
SERVICE_ACCOUNT=summation@$PROJECT_ID.iam.gserviceaccount.com

gcloud storage buckets add-iam-policy-binding "gs://$BUCKET_NAME" \
  --member="serviceAccount:$SERVICE_ACCOUNT" \
  --role="roles/storage.objectViewer"
Generate a JSON key for the service account and paste the full file contents into Service Account Key.
gcloud iam service-accounts keys create summation-gcs-key.json \
  --iam-account="$SERVICE_ACCOUNT"
The downloaded summation-gcs-key.json is a credential. Paste it into Summation, then delete the local copy. See Best practices for managing service account keys.

Adding datasets

Each dataset is a single file or folder in the bucket. Supported file formats are parquet, csv, json, jsonl, and tsv. Source references use the form:
gcs://bucket/path/to/data/

Common problems

Error or symptomLikely cause
GCS base path is requiredBase Path is blank or doesn’t include a bucket.
Invalid GCS service account keyThe pasted JSON is incomplete or malformed. Paste the full key file contents.
GCS authentication failedThe service account key is invalid, disabled, or lacks bucket access.
GCS bucket does not existThe bucket name in Base Path is wrong, or the service account cannot see it.
HMAC validation errorOnly one HMAC field is filled. Provide both HMAC Access ID and HMAC Secret, or leave both blank.