docs(Sizing): rewrite as workload-driven guide by germangarces · Pull Request #7592 · Flagsmith/flagsmith

germangarces · 2026-05-25T13:30:00Z

Thanks for submitting a PR! Please check the boxes below:

I have read the Contributing Guide.
I have added information to docs/ if required so people know about the feature.
I have filled in the "Changes" section below.
I have filled in the "How did you test this code" section below.

Changes

Closes #7200

Rewrite Sizing and Scaling docs as a workload-driven guide

Mental mode:

Sections 1–4: figure out your tier
Section 5: Day-1 cache setting
Sections 6–8: advanced tuning when something specific demands it
Sections 9–10: operate it
Sections 11–12: edges / specialty

How did you test this code?

N/A

Signed-off-by: germangarces <german.garces@flagsmith.com>

vercel · 2026-05-25T13:30:06Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
docs	Ready	Preview, Comment	May 25, 2026 1:50pm

2 Skipped Deployments

Project	Deployment	Actions	Updated (UTC)
flagsmith-frontend-preview	Ignored	Preview	May 25, 2026 1:50pm
flagsmith-frontend-staging	Ignored	Preview	May 25, 2026 1:50pm

gemini-code-assist

Code Review

This pull request significantly expands the 'Sizing and Scaling' documentation for self-hosted Flagsmith, introducing workload-driven sizing patterns, worked examples, and specific infrastructure recommendations across four tiers. It also adds comprehensive sections on cache configuration, monitoring metrics, and a scaling decision tree. Feedback indicates that the removal of specific environment variables for database replication (such as REPLICA_DATABASE_URLS and REPLICA_READ_STRATEGY) is a regression in documentation quality, as these details are essential for users implementing the recommended scaling strategies.

Signed-off-by: germangarces <german.garces@flagsmith.com>

Holmus

This is gold and will be incredibly helpful for our selfhosted customers. Great work!

Some very minor comments. Also, did you look for places in the existing documentation where we can reference to this page?

Holmus · 2026-05-26T11:25:24Z

+
+### B: Server-side service with local cache
+
+Backend polls Flagsmith every 60 seconds for the full environment snapshot, then evaluates flags locally. No round-trip


Add a reference to local evaluation: https://docs.flagsmith.com/integrating-with-flagsmith/sdks#local-evaluation.

Maybe even as a tip or a note: "Unsure if you should be using local or remote evaluation? Learn more here"

Holmus · 2026-05-26T11:28:08Z

+| If you change…                                  | New RPS | New tier |
+| ----------------------------------------------- | ------- | -------- |
+| Pods scale up to 300 (same one environment)     | 5 RPS   | Small    |
+| Poll interval dropped to 10 s (default is 60 s) | 3 RPS   | Medium   |


Doublecheck this, i guess the second row should be small or the first row should be medium

Holmus · 2026-05-26T11:37:47Z

+| `CACHE_ENVIRONMENT_DOCUMENT_MODE`       | `EXPIRING` | `PERSISTENT` at Large+                                       | Persistent mode survives pod restarts; warm-up cost amortised across the deployment.                                                                 |
+| `GET_IDENTITIES_ENDPOINT_CACHE_SECONDS` | `0` (off)  | `30–60`                                                      | Cache the personalised response from a _GET_ identity request. _POST_ identity (which updates traits) always bypasses the cache.                     |
+
+### Cache backend trade-offs


Maybe clarify that this section is directly tied to CACHE_ENVIRONMENT_DOCUMENT_BACKEND.

Holmus · 2026-05-26T11:38:14Z

+-   **Database (default).** Shared across pods. Cache hits still touch PostgreSQL. Fine through Medium.
+-   **LocMemCache.** Pod-local. Zero DB round-trip, but each pod warms separately and memory cost scales with pod count.
+    Best at Small / Medium with a small number of pods.
+-   **Redis / Memcached.** Shared, fast, off-DB. Adds a service you operate. Right at Large+.


Do we have docs on how to set up Redis or Memcached? Thinking if we can reference helm charts or some docs

docs(Sizing): rewrite as workload-driven guide

a8ac180

Signed-off-by: germangarces <german.garces@flagsmith.com>

github-actions Bot added the docs Documentation updates label May 25, 2026

gemini-code-assist Bot reviewed May 25, 2026

View reviewed changes

Comment thread docs/docs/deployment-self-hosting/scaling-and-performance/sizing-and-scaling.md

transform sdk defaults into a collapsible

a011db3

Signed-off-by: germangarces <german.garces@flagsmith.com>

vercel Bot deployed to Preview – docs May 25, 2026 13:33 View deployment

add back env vars

86fb12d

Signed-off-by: germangarces <german.garces@flagsmith.com>

vercel Bot deployed to Preview – docs May 25, 2026 13:37 View deployment

add useful env vars

8c3cf67

Signed-off-by: germangarces <german.garces@flagsmith.com>

vercel Bot deployed to Preview – docs May 25, 2026 13:45 View deployment

update section order

d14dbb4

Signed-off-by: germangarces <german.garces@flagsmith.com>

vercel Bot deployed to Preview – docs May 25, 2026 13:50 View deployment

germangarces marked this pull request as ready for review May 25, 2026 13:51

germangarces requested a review from a team as a code owner May 25, 2026 13:51

germangarces requested review from Holmus and adamvialpando and removed request for a team May 25, 2026 13:51

Holmus reviewed May 26, 2026

View reviewed changes

Holmus removed the request for review from adamvialpando May 26, 2026 12:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs(Sizing): rewrite as workload-driven guide#7592

docs(Sizing): rewrite as workload-driven guide#7592
germangarces wants to merge 5 commits into
mainfrom
feat/7200-sizing-scaling-docs

germangarces commented May 25, 2026 •

edited

Loading

Uh oh!

vercel Bot commented May 25, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Holmus left a comment

Uh oh!

Holmus May 26, 2026

Uh oh!

Holmus May 26, 2026

Uh oh!

Holmus May 26, 2026

Uh oh!

Holmus May 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		### B: Server-side service with local cache

		Backend polls Flagsmith every 60 seconds for the full environment snapshot, then evaluates flags locally. No round-trip

Conversation

germangarces commented May 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

How did you test this code?

Uh oh!

vercel Bot commented May 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Holmus left a comment

Choose a reason for hiding this comment

Uh oh!

Holmus May 26, 2026

Choose a reason for hiding this comment

Uh oh!

Holmus May 26, 2026

Choose a reason for hiding this comment

Uh oh!

Holmus May 26, 2026

Choose a reason for hiding this comment

Uh oh!

Holmus May 26, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

germangarces commented May 25, 2026 •

edited

Loading

vercel Bot commented May 25, 2026 •

edited

Loading