Setup guide

This is the end-to-end walkthrough, from a blank AWS account to a running Fargate task you can connect to from your game client, plus the optional Discord bot. Allow ~30 minutes the first time; most of that is waiting for terraform apply.

The submodule guide covers the alternative workflow of vendoring this repo inside a private parent that holds terraform.tfvars and state. Come back here afterwards for the per-step detail.

Prerequisites

On the machine that will run terraform apply and the management app:

Tool	Version	Notes
Node.js	20+	Enforced by both setup scripts and the Nest server boot.
npm	10+	Ships with Node 20.
Terraform	1.5+	Installed automatically by `setup.sh` (Debian/Ubuntu) or `setup.ps1` (Windows via winget).
AWS CLI	v2	Installed automatically by `setup.sh` (Linux) or `setup.ps1` (Windows via MSI).
Docker	24+	Only if you plan to run the app via `docker compose`.

On the AWS side you need:

An AWS account you control (pure personal use is fine).
A Route 53 hosted zone you already own — e.g. yourdomain.com. Terraform looks it up as a data source and will not create it for you. If you use an external registrar, delegate the zone's NS records to Route 53 before running Terraform or DNS updates will go nowhere.

1. Create and authorise an IAM user

In the AWS IAM console → Users → Create user, give it a name like game-server-deploy.
On the permissions step, choose Attach policies directly and skip through without selecting any managed policy. Create the user.
Open the new user → Permissions → Add permissions → Create inline policy → JSON. Paste the policy below, name it GameServerDeployAll, and save.
Security credentials → Create access key → Command Line Interface (CLI). Copy the Access Key ID and Secret Access Key. Treat the secret like a password — AWS will not show it again.

{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Sid": "GameServerDeploy",
      "Effect": "Allow",
      "Action": [
        "ecs:*",
        "elasticfilesystem:*",
        "ec2:*",
        "lambda:*",
        "logs:*",
        "cloudwatch:*",
        "events:*",
        "route53:*",
        "ce:*",
        "elasticloadbalancing:*",
        "acm:*",
        "dynamodb:*",
        "secretsmanager:*",
        "s3:*",
        "cloudfront:*"
      ],
      "Resource": "*"
    },
    {
      "Sid": "GameServerIAM",
      "Effect": "Allow",
      "Action": "iam:*",
      "Resource": [
        "arn:aws:iam::*:role/game-servers-*",
        "arn:aws:iam::*:policy/game-servers-*"
      ]
    }
  ]
}

Why one inline policy instead of stacking managed policies? AWS caps each user at 10 directly-attached managed policies, and this stack touches ~14 services. One inline policy also keeps the full blast radius visible in one place. Trade-off: you lose AWS's auto-maintenance of action lists, but since everything is {service}:* there is nothing to maintain.

iam:* is scoped to project-prefixed ARNs, not Resource: *, to avoid granting iam:PassRole on every role in the account. The game-servers-* prefix matches the default project_name. If you change project_name in terraform.tfvars, update the two ARN patterns in GameServerIAM to match.

Two permission areas used by Terraform are not covered by any AWS managed policy and are explicitly included above to avoid AccessDenied during terraform apply:

EventBridge tag operations — the AWS provider tags EventBridge rules on creation, which requires events:TagResource, events:UntagResource, and events:ListTagsForResource. events:* above already grants these — if you tighten the policy later, keep those three actions in.
CloudFront — the Discord interactions endpoint is fronted by a CloudFront distribution. cloudfront:* above covers creation, updates, tagging, and deletion of distributions.

This policy is the single source of truth for IAM permissions. If you need to add or remove permissions, edit it here — do not create separate inline policies or update the README independently.

2. Configure the AWS CLI

aws configure
#   AWS Access Key ID:     AKIA...
#   AWS Secret Access Key: ****
#   Default region name:   us-east-1          # must match terraform.tfvars
#   Default output format: json

aws sts get-caller-identity                   # verify

Both Terraform and the management app read ~/.aws/credentials and ~/.aws/config automatically. If you prefer environment variables, export AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY, and AWS_DEFAULT_REGION instead — the management app will pick them up too.

3. Clone and bootstrap

Linux / macOS:

git clone https://github.com/codercoco/game-server-deploy.git
cd game-server-deploy
chmod +x setup.sh
./setup.sh

Windows (PowerShell 5.1+):

git clone https://github.com/codercoco/game-server-deploy.git
cd game-server-deploy
.\setup.ps1

If PowerShell blocks the script with an execution-policy error, run Set-ExecutionPolicy RemoteSigned -Scope CurrentUser once, then retry.

Both scripts are idempotent — safe to re-run at any time. They:

Checks for Node 20+, and installs Terraform and the AWS CLI if missing (setup.sh uses apt on Debian/Ubuntu; setup.ps1 uses winget + the AWS MSI installer on Windows; macOS users should install those tools manually first).
Runs npm ci from app/ so all workspaces are installed.
Runs npm run build:lambdas to produce app/packages/lambda/*/dist/handler.cjs — Terraform's archive_file data sources zip these at apply time, so the bundles must exist on disk before terraform apply or init will fail.
Copies terraform/terraform.tfvars.example to terraform/terraform.tfvars if the latter doesn't exist yet.
Creates the S3 state bucket ({project_name}-tf-state) and DynamoDB lock table ({project_name}-tf-locks) if they don't already exist. The bucket gets versioning, public-access blocking, and AES-256 encryption enabled. The script waits for the DynamoDB table to reach ACTIVE status before continuing. Both names are derived from project_name in terraform.tfvars (default: game-servers). This step requires the s3:* permissions in the inline policy above.
Runs terraform init inside terraform/, passing the bucket and table as -backend-config flags. If a local terraform.tfstate is present (migrating from a previous local-backend setup), it automatically migrates state to S3 without prompting.

4. Configure your servers

Open terraform/terraform.tfvars in your editor and fill in:

aws_region       = "us-east-1"
project_name     = "game-servers"
hosted_zone_name = "yourdomain.com"    # must already exist in Route 53

# Watchdog knobs (defaults shown)
watchdog_interval_minutes = 15
watchdog_idle_checks      = 4          # 15 × 4 = 60 min grace before auto-stop
watchdog_min_packets      = 100

# One entry per game. Everything downstream iterates over this map.
game_servers = {
  palworld = {
    image  = "thijsvanloef/palworld-server-docker:latest"
    cpu    = 2048
    memory = 8192
    ports = [
      { container = 8211,  protocol = "udp" },
      { container = 27015, protocol = "udp" },
    ]
    environment = [
      { name = "PLAYERS",        value = "8" },
      { name = "SERVER_NAME",    value = "My Palworld Server" },
      { name = "ADMIN_PASSWORD", value = "CHANGE_ME" },
    ]
    volumes = [
      { name = "saves", container_path = "/palworld" },
    ]
    https = false
    # Optional: Discord message shown when the server reaches RUNNING.
    # Supports {host}, {ip}, {port} (first port), and {game} placeholders.
    # connect_message = "connect in game at {host}:{port}"
  }
}

Rules worth knowing before you save:

volumes is a list of EFS mount points for the game. Each entry creates a dedicated EFS access point rooted at /${game}/${name} and mounts it at container_path inside the container. Most games need one entry; add more if the image expects multiple distinct paths. All access points use UID/GID 1000 ownership — game images that run as a different UID will fail to mount.
file_seeds (optional) pre-populates files on the EFS volume during terraform apply. Each seed needs an in-container path and either content (UTF-8 text) or content_base64 (binary, e.g. mod .pak files — encode with base64 -w0 MyMod.pak). An optional mode sets the file permissions (default "0644"). The seeder runs once per unique seed content and is a no-op on re-apply when nothing changes. Removed entries are not deleted from EFS. Do not put secrets in file_seeds — content is stored in Terraform state.
```
file_seeds = [
  {
    path    = "/palworld/Pal/Saved/Config/LinuxServer/PalWorldSettings.ini"
    content = <<-INI
      [/Script/Pal.PalGameWorldSettings]
      OptionSettings=(Difficulty=None,DayTimeSpeedRate=1.0,NightTimeSpeedRate=1.0)
    INI
  },
  {
    path           = "/palworld/Pal/Content/Paks/MyMod.pak"
    content_base64 = "UEsDBBQAAAAI..."  # base64 -w0 MyMod.pak
  },
]
```
https = true routes the game through an ALB + ACM + Route 53 ALIAS. Only set it on games that actually serve HTTP(S); UDP games (most game servers) must stay false. The ALB is only created if at least one game has https = true.
CPU / memory must be a valid Fargate pair (see the Fargate task size table).
Do not write aws_route53_record resources — the update-dns Lambda owns that.

Optionally seed Discord credentials here too. If you leave them out, you can paste them into the dashboard later:

discord_application_id = "123456789012345678"
discord_bot_token      = "xxxx.yyyy.zzzz"          # sensitive
discord_public_key     = "abcd...ef01"             # sensitive

terraform.tfvars is gitignored, so these stay on your machine. Rotation after the first apply takes one terraform taint; see the submodule guide for the pattern that puts this file in a private parent repo.

5. Apply the infrastructure

cd terraform
terraform plan
terraform apply

apply takes 5–10 minutes end-to-end. It creates the VPC, two public subnets, an ECS cluster, one task definition + EFS access point + CloudWatch log group per game, the four Lambdas, a DynamoDB table, two Secrets Manager secrets, the EventBridge rule + schedule, and (if any game has https = true) an ALB with an ACM certificate.

When it finishes, note two outputs:

interactions_invoke_url — the Lambda Function URL you'll paste into the Discord Developer Portal for the bot.
ecs_cluster_name / game_names — used by the dashboard (it reads terraform.tfstate directly, so you normally don't need to copy these by hand).

6. Run the management app

Pick one.

API token

The dashboard API is gated behind a bearer token; /api/* requests without a matching Authorization: Bearer … header return 401. There are two ways to configure the value, in priority order:

API_TOKEN environment variable — takes precedence over server_config.json when set, including when set to empty. An empty value is normalized to "no token configured" and prevents the config file from being consulted, but it is not a supported way to disable auth — NODE_ENV=production startup fails when neither source supplies a non-empty token.
api_token field in app/server_config.json — the persisted file bind-mounted by docker-compose.yml. Used when API_TOKEN is absent. Edit the file directly; the dashboard's /api/config endpoint only manages watchdog settings and does not write the token.

Generate a fresh token with openssl rand -hex 32. The dashboard prompts for it on first load (and any time the server returns 401); paste the value and click Save. It is stored in your browser's localStorage under the key apiToken — clear browser data to revoke client-side, or rotate the value in API_TOKEN / server_config.json to invalidate every browser at once.

In dev mode (NODE_ENV unset) the server logs a warning and allows unauthenticated requests when no token is configured — convenient for local iteration, not safe to expose.

Option A — dev mode

cd app
npm run dev

Serves the Nest API on :3001 and the Vite dev server on :5173 (with /api proxied to :3001). Open http://localhost:5173. In dev mode, if no API_TOKEN is configured the app logs a warning and allows unauthenticated requests — fine for local iteration, not safe to expose.

Option B — Docker (production-equivalent)

# First run only: ensure the persisted config file exists on the host so
# Compose can bind-mount it.  Without this the bind will error.
touch app/server_config.json

# REQUIRED: the app refuses to start in production without a bearer token.
export API_TOKEN="$(openssl rand -hex 32)"

docker compose up --build

Opens on http://localhost:5000. The dashboard will prompt you for the token on first load; paste the value of $API_TOKEN and click Save.

docker-compose.yml bind-mounts ./terraform read-only (for terraform.tfstate), ./app/server_config.json (persisted watchdog config), and ~/.aws (credentials). If you prefer AWS_ACCESS_KEY_ID/AWS_SECRET_ACCESS_KEY env vars, uncomment the corresponding block in docker-compose.yml.

7. (Optional) Wire up the Discord bot

The serverless bot is two Lambdas, one DynamoDB table, and two Secrets Manager secrets — all created by terraform apply in step 5. You now connect it to a Discord application.

Create a Discord application at discord.com/developers/applications → New Application → add a Bot. Copy three values from General Information:

Value	Where it goes	Used for
Application ID (Client ID)	DynamoDB `CONFIG#discord` row	Needed when the server registers slash commands for a guild. Public, not a secret.
Bot Token	Secrets Manager `${project_name}/discord/bot-token`	`Authorization: Bot <token>` for the REST call that registers commands. Treat like a password.
Application Public Key	Secrets Manager `${project_name}/discord/public-key`	The interactions Lambda verifies every incoming interaction against this Ed25519 key.

You do not need any Privileged Gateway Intents — HTTP interactions deliver the invoker's role IDs directly in the request body.

Seed the credentials. Either:
- Set discord_application_id, discord_bot_token, and discord_public_key in terraform.tfvars and re-apply. Terraform writes them once and then ignore_changes lets the dashboard edit them without being overwritten on subsequent applies. To rotate via tfvars later, terraform taint the relevant resource first.
- Or leave them empty and open the Credentials tab in the dashboard; paste and Save. The dashboard writes directly to DynamoDB and Secrets Manager.
Optionally set a base allowlist and admins in terraform.tfvars. These are written to a separate BASE#discord DynamoDB row on every terraform apply and cannot be removed via the dashboard UI — only a tfvars edit + re-apply can change them. Useful for locking in your own guild and user ID before handing the dashboard to others:
```
base_allowed_guilds = ["123456789012345678"]
base_admin_user_ids = ["987654321098765432"]
base_admin_role_ids = []
```
When discord_bot_token, discord_application_id, and at least one entry in base_allowed_guilds are all set, terraform apply also registers the slash commands in each base guild automatically — no manual "Register commands" click needed for those guilds.
Copy the interactions endpoint URL (the interactions_invoke_url Terraform output, also shown in the dashboard Credentials tab) into the Discord Developer Portal under General Information → Interactions Endpoint URL → Save. Discord sends a PING on save; the Lambda replies PONG and Discord accepts the URL.
Invite the bot to your server. In the Developer Portal:
- Installation → Installation Contexts: enable Guild Install, disable User Install.
- OAuth2 → URL Generator: tick scopes bot and applications.commands; under Bot Permissions, tick Send Messages and Use Slash Commands (Discord's UI name for the USE_APPLICATION_COMMANDS permission).
- Open the generated URL and add the bot to your server.
Enable Developer Mode in Discord (User Settings → Advanced → Developer Mode) so you can right-click servers/users/roles and Copy ID.
In the dashboard's Discord Bot panel:
- Guilds tab: guilds in base_allowed_guilds have their slash commands registered automatically by terraform apply (provided the bot token and application ID were set in tfvars). For any guild added via the UI, click Register commands to install /server-start, /server-stop, /server-status, /server-list. This is always a per-guild REST call; there are no global commands.
- Admins tab: user IDs and/or role IDs that can run everything on everything.
- Per-Game Permissions tab: for each game, which users/roles can invoke which subset of start / stop / status.

The user guide has the day-to-day command reference; the interactions/followup Lambda docs have the wire-level detail.

8. Smoke test

With infra applied, the app running, and (optionally) a Discord guild configured:

Open the dashboard → the game you configured should appear as stopped.
Click Start. Watch the card transition through PROVISIONING → PENDING → RUNNING. DNS is updated by the update-dns Lambda as soon as the task reaches RUNNING.
dig {game}.yourdomain.com should return the task's public IP within dns_ttl seconds (default 30). Connect your game client.
Click Stop, or type /server-stop {game} in Discord, or do nothing for watchdog_interval_minutes × watchdog_idle_checks minutes — any of the three stops the task and removes the DNS record.

9. Tear it down

Stop every server from the dashboard first (so the DNS updater gets a clean STOPPED event and removes records), then:

cd terraform
terraform destroy

The two Secrets Manager secrets use recovery_window_in_days = 0, so they are deleted immediately — you can terraform apply again tomorrow without hitting "already scheduled for deletion".

Troubleshooting

Symptom	Likely cause	Fix
`terraform apply` fails with "data source not found for zone"	`hosted_zone_name` doesn't exist in Route 53	Create the hosted zone first (or delegate your registrar's NS records).
`archive_file` fails during `terraform apply`	You didn't run `npm run build:lambdas`	`cd app && npm run build:lambdas`, then re-apply.
EFS seeder Lambda times out or returns `EFS mount failed`	Mount targets not ready or security group misconfigured	Ensure `terraform apply` completed fully (mount targets take ~30 s); check the seeder Lambda's CloudWatch log group `/aws/lambda/${project_name}-efs-seeder-{game}`.
`file_seeds` path error: "does not start with container_path"	Seed path doesn't share the first volume's `container_path` prefix	Check that `path` begins with `volumes[0].container_path` (e.g. `/palworld/…`).
App refuses to start under `NODE_ENV=production`	No bearer token configured	`export API_TOKEN=$(openssl rand -hex 32)` or set `api_token` in `app/server_config.json`.
Dashboard says terraform not applied in the Discord panel	`interactions_invoke_url` output missing	Re-run `cd app && npm run build:lambdas && cd ../terraform && terraform apply`.
Dashboard says awaiting credentials	Secrets still contain the Terraform `"placeholder"` seed	Paste the real bot token + public key in the Credentials tab and Save.
Discord rejects the interactions URL with "invalid interactions endpoint URL"	Public key in Secrets Manager doesn't match Discord's	Re-copy the Application Public Key from the Developer Portal and Save.
`/server-*` slash commands don't appear in Discord	Per-guild registration not done	For base guilds: ensure `discord_bot_token`, `discord_application_id`, and `base_allowed_guilds` are all set in tfvars, then re-run `terraform apply`. For UI-added guilds: Guilds tab → Register commands next to the guild ID.
`/server-start` says "You don't have permission"	Your user/role isn't in admins or per-game permissions, or the `start` action isn't ticked	Admins tab or Per-Game Permissions tab, then retry.
Task reaches RUNNING but DNS never updates	update-dns Lambda errored; EventBridge rule might be disabled	Check the Lambda's CloudWatch logs; verify the EventBridge rule is enabled.
Watchdog stops tasks too aggressively	Low `watchdog_min_packets`, short `watchdog_interval_minutes`, or low `watchdog_idle_checks`	Tune the three knobs via the dashboard Server Config panel and re-apply.

Prerequisites​

1. Create and authorise an IAM user​

2. Configure the AWS CLI​

3. Clone and bootstrap​

4. Configure your servers​

5. Apply the infrastructure​

6. Run the management app​

API token​

Option A — dev mode​

Option B — Docker (production-equivalent)​

7. (Optional) Wire up the Discord bot​

8. Smoke test​

9. Tear it down​

Troubleshooting​