What happens when port 18789 is already in use?

Local admin/diagnostic HTTP may fail to bind or fall back to a degraded mode; health probes can return connection refused. Use lsof -nP -iTCP:18789 -sTCP:LISTEN to find the owning PID, stop stale Gateway instances or conflicting services, then start cleanly.

Deployment Guide 2026-03-31

2026 OpenClaw Multi-Channel Gateway Troubleshooting: `openclaw doctor`, Health Probes & Telegram/Discord Failures—`openclaw.json` Hot Reload, Port 18789 Conflicts & Remote Physical Mac Runbook (FAQ)

Q: Telegram works but Discord fails—what three config buckets should I check first?

Bot token and intent permissions, whether enterprise egress blocks Discord API hosts, and whether the gateway process can resolve discord.com. Use curl -I plus per-channel doctor checks to separate DNS/TLS issues from application logic.

If you run Telegram and Discord channels on a remote physical Mac and see “one works, one is dead” or flapping health checks, this guide pins failures to config, ports, or the network using openclaw doctor and probes. You get a symptom decision matrix, a seven-step reproducible runbook, three on-call numbers you can paste into SLO docs, and an FAQ. For long-term stability patterns, see Optimizing Mac mini Stability for Long-Term OpenClaw Operation in 2026 and 2026 OpenClaw Global Deployment Comparison Guide.

2026 OpenClaw multi-channel gateway troubleshooting and network diagnostics

1. Three recurring pain points (where multi-channel gateways fail)

1) Unclear hot-reload boundaries: You edit openclaw.json and expect instant effect, but webhook secrets, TLS cert paths, or bot tokens may still live in the old process—so logs keep showing stale errors even though the file “looks right.”

2) Port 18789 and duplicate instances: Local diagnostic/admin HTTP (this article uses 18789 as the common default; substitute your actual port) collides when another Gateway, an old launchd job, or an interactive session still listens. Probes then flip between connection refused and 503 bursts, which masquerades as upstream Telegram/Discord outages.

3) Asymmetric channel failures: Telegram often uses long polling while Discord mixes WebSocket and REST. If corporate egress, system proxy, and HTTP(S)_PROXY differ between launchd and your login shell, you get believable “one channel always times out, the other is fine” false positives.

Before rewriting channel configs, confirm the Gateway install matches the vendor layout—version skew is the fastest way to misread doctor output as a network fault.

2. Symptom → action decision matrix (classify before you reboot)

Cross-check openclaw doctor with probe output against the table below to avoid “restart three times, then ask why.”

What you see	Do this first	Likely root-cause bucket
doctor says local HTTP is down; curl to 127.0.0.1:18789 fails	Map listeners, kill stale processes, reinstall plist via install-daemon	Port conflict / not listening
Both channels red; doctor network section fails wholesale	Verify DNS, proxies, and outbound firewall in a non-interactive environment	Egress / proxy
Discord only fails; Telegram OK	Validate token + intents; TLS/SNI probe to discord.com from the same user as launchd	Credentials / permissions / API path
JSON change “partially applies”	Bucket fields into cold-start vs. hot-reload; rerun doctor to diff behavior	Fields not covered by reload

3. Seven-step remote physical Mac runbook (paste into your playbook)

Baseline snapshot: Run openclaw doctor, store the full transcript plus Gateway semver, and hash openclaw.json (for example with shasum) so rollback diffs are trivial.
Stabilize health probes: Hit the readiness URL at least ten times with a 3s gap—one green sample is not recovery. Align observations with launchd KeepAlive backoff windows.
18789 listen matrix: Run lsof -nP -iTCP:18789 -sTCP:LISTEN (swap the port if yours differs). If multiple OpenClaw-related PIDs appear, keep exactly one primary instance.
Per-channel smoke tests: Telegram: lightweight getMe-style call; Discord: shard/socket state in logs or a minimal REST HEAD. Never hide single-channel failure behind a single “global healthy” boolean.
Hot reload vs. cold start: Tag each JSON key as runtime-refreshable or restart-required; for the latter, prefer controlled restart over hammering SIGHUP.
Controlled restart: Stop via launchctl or the project’s install-daemon flow, confirm zero listeners, boot, then tail the first 200 lines of structured logs.
Close the incident: Within 30 minutes, rerun doctor, probes, and one real user message on each channel—three screenshots in the ticket are enough to close.

If you need a deeper map between launchd, install-daemon, and openclaw health, stack this runbook with your existing 7×24 Gateway daemon guide—the probe cadence and KeepAlive backoff language should match line for line.

4. Quotable thresholds & checklist (drop into SLO docs)

Probe interval: For gateway-class readiness, poll every 30–60 seconds; under ~15s you will false-positive on GC spikes.
Consecutive success rule: Require at least five back-to-back HTTP 2xx responses before declaring “healthy,” or you will chase flap.
Regression window: After any config change, complete end-to-end messages on both channels within 30 minutes—minimum bar to close an incident.

5. FAQ

After editing openclaw.json, do I always need to restart the Gateway?

Channel credentials, webhooks, and TLS-related fields usually need a full process restart. Some versions hot-reload non-listener toggles, but production should still run openclaw doctor and then execute a controlled restart to avoid half-loaded listeners.

What does port 18789 contention look like in practice?

Admin/diagnostic HTTP fails to bind or degrades; probes return connection refused. Use lsof, terminate stale Gateway jobs, and restart once the port is free.

Telegram works but Discord fails—what three config buckets should I check first?

Bot token and intents, corporate proxy allowlists for Discord API endpoints, and DNS resolution for discord.com from the same account launchd uses. Combine curl -I with per-channel doctor output to isolate TLS/DNS from app bugs.

6. Why Mac mini makes this gateway stack easier to operate

Multi-channel triage fails when environments diverge: your interactive shell reaches the internet while the launchd job lacks proxy or custom CA trust. macOS on Apple Silicon gives a stable, long-running Node footprint with unified memory—fewer mystery memory spikes during overnight traffic. Mac mini M4 idles around 4W, which is ideal for unattended closet/rack gateways, and Gatekeeper plus SIP plus FileVault shrinks the attack surface versus a typical Windows utility host.

If you are fanning out Telegram and Discord for a global user base, anchoring gateways on audited, region-pinned physical Mac pools removes single-region egress drama and the classic “reproduces on my laptop, not on the server” debate.

To run this runbook on hardware that stays quiet, cool, and predictable, Mac mini M4 is one of the best price-to-stability entry points—grab a ZoneMac node now and align doctor probes with production-grade macOS from day one.

Remote Mac Nodes

Need a physical Mac to run OpenClaw 24/7?

ZoneMac offers region-selectable Mac mini capacity for always-on gateways, CI, and compliance-friendly operations.

💡 On demand ⚡ Native macOS 🔒 Auditable