Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.summation.com/llms.txt

Use this file to discover all available pages before exploring further.

The S3 connector exposes objects in an AWS S3 bucket as datasets. Summation reads Parquet, CSV, and JSON files directly — no warehouse needed.

What you’ll need

Form fields

FieldRequiredStored asNotes
RegionYesConfigAWS region of the bucket, e.g. us-west-2. See Bucket regions.
Base PathOptionalConfigAn s3:// prefix to scope the connection to, e.g. s3://my-bucket/path. Browsing starts from this prefix.
Access Key IDYesSecretAWS access key ID. See Managing access keys for IAM users.
Secret Access KeyYesSecretAWS secret access key matching the key ID.

IAM policy

Minimum policy for read-only access to a single prefix. Replace my-bucket and the path/ prefix with your own. For more, see Bucket policy examples and Identity-based policies for Amazon S3.
{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Effect": "Allow",
      "Action": ["s3:ListBucket"],
      "Resource": "arn:aws:s3:::my-bucket",
      "Condition": {
        "StringLike": { "s3:prefix": ["path/*"] }
      }
    },
    {
      "Effect": "Allow",
      "Action": ["s3:GetObject"],
      "Resource": "arn:aws:s3:::my-bucket/path/*"
    }
  ]
}

Adding datasets

Each dataset is a single file or a glob within the bucket. Summation infers the schema from the file format. Source references use the form:
s3://bucket/path/to/data/

Common problems

Error or symptomLikely cause
AccessDeniedThe IAM user/role lacks s3:GetObject on the key. Check both the IAM policy and the bucket policy.
NoSuchBucketWrong region. Region must match the bucket’s region exactly.
CSV columns inferred wrongRe-check the file’s encoding and delimiter. Parquet is the most reliable format for repeatable analyses.