New Job Invocations Page in 3.18.1 - Unusable due to performance

singularity001 · April 16, 2026, 11:30am

Problem:

We are a extremely heavy user of Foreman and Dynflow. I would guess maybe even the largest in the world. 70,000 hosts, around 2M facts processed daily, and around 150,000-300,000 daily jobs.
The new job invocations page is quite a bit (to put it nicely) slower in loading than the previous old page.
In addition, it doesn’t appear to update live like the old one did. I see hosts that show Pending, but if you click on the arrow to see the output, they have already finished their job.

When loading the Legacy page (32,000 job test) it loads in about 3-6 seconds, that includes all the hosts and their statuses. And it updates live, and quickly.

The new UI isn’t even useable. If I load the new UI on the same parent job (32,000 jobs), it loads and loads and loads and never seems to complete. Each load I have waited about 12-17 mins. After 2-3 mins it loads the job page itself. After another 10-15 mins the host lists still shows loading. It took me around 20 mins to write this post, the hosts are still stuck loading.

On occasion, we also see this error:

TypeError: Cannot read properties of undefined (reading 'length')     in g     in t     in main     in div     in Page     in div     in FlexItem     in div     in Flex     in p     in b     in t     in o     in a     in Connect(a)     in vo     in div     in h     in IntlProvider     in I18nProviderWrapper(h)     in d     in StoreProvider(I18nProviderWrapper(h))     in DataProvider(StoreProvider(I18nProviderWrapper(h)))     in Unknown

Expected outcome:

The new jobs page loads as quickly (if not quicker) than the current Legacy page.

Foreman and Proxy versions:

All latest (3.18.1)

$ rpm -qa | grep foreman
rubygem-foreman_salt-17.0.2-1.fm3_15.el9.noarch
rubygem-foreman_vault-3.0.0-1.fm3_15.el9.noarch
rubygem-foreman_statistics-2.1.0-4.fm3_16.el9.noarch
rubygem-hammer_cli_foreman_kubevirt-0.2.0-2.fm3_16.el9.noarch
rubygem-hammer_cli_foreman_salt-0.1.0-3.fm3_16.el9.noarch
rubygem-hammer_cli_foreman_ssh-0.0.3-2.fm3_16.el9.noarch
foreman-obsolete-packages-1.11-1.el9.noarch
rubygem-hammer_cli_foreman_tasks-0.0.24-1.fm3_17.el9.noarch
rubygem-foreman_puppet-9.1.0-1.fm3_17.el9.noarch
foreman-release-3.18.1-1.el9.noarch
foreman-proxy-3.18.1-1.el9.noarch
rubygem-hammer_cli_foreman-3.18.1-1.el9.noarch
foreman-selinux-3.18.1-1.el9.noarch
foreman-3.18.1-1.el9.noarch
rubygem-foreman-tasks-11.1.1-1.fm3_18.el9.noarch
rubygem-foreman_remote_execution-16.5.3-1.fm3_18.el9.noarch
foreman-dynflow-sidekiq-3.18.1-1.el9.noarch
foreman-libvirt-3.18.1-1.el9.noarch
foreman-postgresql-3.18.1-1.el9.noarch
foreman-redis-3.18.1-1.el9.noarch
foreman-service-3.18.1-1.el9.noarch
foreman-telemetry-3.18.1-1.el9.noarch
foreman-vmware-3.18.1-1.el9.noarch
rubygem-foreman_kubevirt-0.6.0-1.fm3_18.el9.noarch
rubygem-foreman_ovirt-2.0.3-1.fm3_18.el9.noarch
rubygem-foreman_templates-11.0.1-1.fm3_18.el9.noarch
rubygem-foreman_webhooks-5.0.2-1.fm3_18.el9.noarch
foreman-cli-3.18.1-1.el9.noarch
rubygem-hammer_cli_foreman_puppet-0.1.2-1.fm3_18.el9.noarch
rubygem-hammer_cli_foreman_remote_execution-0.4.2-1.fm3_18.el9.noarch
rubygem-foreman_maintain-1.14.3-1.el9.noarch
foreman-installer-3.18.1-1.el9.noarch

Distribution and version:

Alma Linux 9

jeremylenz · April 16, 2026, 12:35pm

What is your default per_page setting?

Do you see any errors or logs in the browser console?

singularity001 · April 16, 2026, 1:18pm

Default per page is 25.

Legacy:

entire page, along with all hosts and status updated in about 45 seconds.

New UI:

took about 6 mins this time just to load the page, but the hosts section below will never load (at least not for the 20 mins Ive attempted to wait 3-4 times)

jeremylenz · April 16, 2026, 1:31pm

@aruzicka @MariaAga any ideas here? I don’t recognize any of those console errors.. And while I’m sure we didn’t test with 34k hosts, it should only be dealing with 25 at a time.

singularity001 · April 16, 2026, 2:33pm

Incidentally, as I was working on this - we had a massive issue with the database (seemingly on the hosts table) and had to actually restart the database.

Foreman DB Slowness — Summary

Database: 56 GB PostgreSQL 17 on foreman-salt-control-pg17-admin.redacted

Root Causes

hosts table seq scans — 6.3M full table scans despite having 13 indexes. The planner isn’t using them, likely due to queries filtering on unindexed column combinations. This is the most probable cause of slowness.

Data bloat — Several tables have never been purged:

logs: 16 GB / 117M rows

audits: 12 GB / 28.7M rows

taxable_taxonomies: 9.7 GB / 57.9M rows

reports: 6.3 GB (5.8 GB is TOAST/report body content)

sessions: 1.5 GB / 3.9M stale sessions

229 idle connections from Foreman holding persistent PG backends open.

What’s Healthy

Cache hit ratio is 99.89% (good)

Autovacuum is running, dead tuple counts are low

No lock contention or long-running queries at time of check

No orphaned replication slots

singularity001 · April 16, 2026, 4:08pm

With DBA/AI help, we made some changes to the DB, in regards to hosts:

Foreman Database Slowness — Investigation Summary

Database: PostgreSQL 17 on foreman-salt-control-pg17-admin.redacted
Database size: 56 GB
Host count: ~70K managed hosts

Change Made

Created a functional index on the hosts table:
CREATE INDEX CONCURRENTLY index_hosts_on_lower_name_and_type ON hosts (type, lower(name));
Why: Foreman queries the hosts table using WHERE type = $1 AND LOWER(name) = '...' on every host checkin/task. The existing index_hosts_on_name is on name (case-sensitive), but the application queries use LOWER(name), so PostgreSQL couldn’t use the index and performed a full table scan every time. With 70K hosts and 34K+ concurrent task jobs, this resulted in millions of sequential scans.

Result:

Before: 20ms average per query, 6.3M sequential scans on the hosts table

After: 0.096ms per query (200x improvement), planner now uses the new index

2,580 index scans recorded within minutes of creation

So everything does actually load faster now. Our All Hosts page, the host page itself. No change in the new jobs UI though. Same issue.

singularity001 · April 16, 2026, 5:45pm

More information. The page seems to load somewhat normally immediately after kicking off a job. You still have to wait 4-6 mins, but it does eventually load and the hosts also load. However, if you attempt to refresh the page, or go from legacy back to new ui, that is where the hosts just never seem to load.

singularity001 · April 17, 2026, 2:43pm

And a bit more info. Not sure it helps. But the page slowness occurs even after the job/tasks are long done.

singularity001 · April 17, 2026, 3:09pm

Adding more fuel to the fire… The Create Report option is now greyed out in the Legacy, but not in the new UI. fyi.

pablomh · April 20, 2026, 6:40pm

Hi Jeff. Can I ask you if you’re comparing between different releases or are you using the same release and changing “live” between legacy and “new”? If so, can you tell me how so I can reproduce it?

singularity001 · April 20, 2026, 8:13pm

Thank you so much for replying. Not comparing between different releases, we are running all latest in our prod env (70,000 minions). Just switching between Legacy and New.

I will try to get some more data points for you. I will try running:

100 host job
1000 host job
10,000 host job

We can see if they exponentially get longer to load.

singularity001 · April 21, 2026, 11:25am

Another note.

When waiting for the hosts to load in said job, the entire Hosts/All Hosts page is unuseable. Completely locks up anything related to HOST.

I am running the above tests now.

singularity001 · April 23, 2026, 11:34am

100 hosts = 1-2 second load times for both the page, and the hosts list.
1000 hosts = 2-4 seconds load times for the page, 1-2 second load time for the hosts list.
30,000 hosts = fail

I’ll try to run 10,000 - It does seem there might be some threshold it hits and then fails.

singularity001 · April 23, 2026, 11:42am

singularity001 · April 23, 2026, 2:46pm

3,000 host run about 10 seconds.
So we can see a pattern here. The more hosts, the longer it takes.

ekohl · April 29, 2026, 7:56am

I tried to look up where that query exactly happens to see where that happens but I couldn’t find it. I had hoped to find out if this was a regression in the Ruby code. Anyone has more luck than I did?

Now I also wonder if this is a pattern we have in other places: query on a field that has an index, but is lowered at runtime. Worse, it may also fail some validations if we rely on database-level unique indexes.

singularity001 · April 29, 2026, 12:59pm

One thing I can tell you is that when the jobs page get stuck loading, anything related to Hosts gets backed up and slow. You cannot even load the All Hosts page (set to 25) if youre waiting for the job page to load.

singularity001 · June 16, 2026, 3:39pm

@pablomh @ekohl

Wanted to follow up on this and see if there was ever a bug/issue created for this, and if there are any fixes for it? We run in to this daily. Best thing we can do for now is force the source code to load the legacy page, since its the only one that works.

singularity001 · June 23, 2026, 6:30pm

Bump.

ekohl · June 23, 2026, 6:31pm

When I look at Pull requests · theforeman/foreman_remote_execution · GitHub there are a few PRs open.

github.com/theforeman/foreman_remote_execution

Performance

master ← adamruzicka:perf

opened 12:36PM - 26 May 26 UTC

adamruzicka

+113 -94

Includes https://github.com/theforeman/foreman_remote_execution/pull/1034 Ope…ning as a draft until I actually read through it. Whether the individual fixes should be included is up to debate. I'd expect these two to yield the biggest benefits: - [Stop serializing all hosts and template invocations in job invocation show](https://github.com/theforeman/foreman_remote_execution/commit/f4641bc6bac8c86a312844c58047d439eba07058) - [Wait for previous poll response before scheduling the next one](https://github.com/theforeman/foreman_remote_execution/commit/123676660ea7720515147059b3ef738a98e2f32b) The rest are sort of nice to haves

github.com/theforeman/foreman_remote_execution

Stop serializing all hosts and template invocations in job invocation

master ← adamruzicka:api-cleanup

opened 12:33PM - 27 May 26 UTC

adamruzicka

+33 -48

Includes #1034 , just look at the last commit please. > The show endpoint ser…ialized every host (via hosts/base) and every template invocation on each request, including the 1-second polling requests from the detail page. None of this data is used by the frontend — hosts are fetched separately by the host table, and per-host template invocations are fetched on row expand. The biggest problem with this is that removing it changes a public api. To offer some alternatives, we could: - add a parameter that would just disable the parts that I'm proposing to remove - add a new endpoint, deprecate the old one and eventually remove it, but then we'd eventually end up with an oddly shaped api