Slow product page

Problem:
When I open the product page in foreman (/products) it is very slow. Sometimes it takes almost a minute to show the list (20 of 37 total).

If I go back soon after that it’s faster but still taking a few seconds.

Expected outcome:
Faster load of page.

Foreman and Proxy versions:

foreman-3.11.4-1.el9.noarch
katello-4.13.1-1.el9.noarch

up-to-date.

Distribution and version:
AlmaLinux 9.4

Other relevant data:

I guess it’s due to a long running sql query:

2024-11-19 17:17:22 CET LOG:  duration: 49207.298 ms  execute <unnamed>: SELECT "audits".* FROM "audits" WHERE "audits"."auditable_id" IN (SELECT "katello_repositories"."id" FROM "katello_repositories" INNER JOIN "katello_root_repositories" ON "katello_repositories"."root_id" = "katello_root_repositories"."id" WHERE "katello_root_repositories"."product_id" = $1) AND "audits"."auditable_type" = $2 AND "audits"."action" = $3 ORDER BY "audits"."created_at" DESC LIMIT $4
2024-11-19 17:17:22 CET DETAIL:  parameters: $1 = '116', $2 = 'Katello::Repository', $3 = 'sync', $4 = '1'

Could try running foreman-rake audits:expire to clean up your old audits.

Makes me wonder why we’re loading audits. Is there any chance that other pages are slow too with an audit-related query? I won’t assume we’re doing it without reason, but worth questioning.

I have got plenty of audits:

foreman=# select count(*) from audits;
  count  
---------
 3061306
(1 row)

but I suppose this is a problem:

[root@foreman8 ~]# foreman-rake audits:expire
The interval for keeping the Audits is not defined in the settings, exiting...

I have set it to 7 days now and that clear up a lot:

[root@foreman8 ~]# foreman-rake audits:expire
Deleting audits older than 2024-11-12 17:19:49 UTC. This might take a few minutes...
Successfully deleted 3034244 audits!

Still the query isn’t really fast:

2024-11-19 18:21:37 CET LOG:  duration: 1843.762 ms  execute <unnamed>: SELECT "audits".* FROM "audits" WHERE "audits"."auditable_id" IN (SELECT "katello_repositories"."id" FROM "katello_repositories" INNER JOIN "katello_root_repositories" ON "katello_repositories"."root_id" = "katello_root_repositories"."id" WHERE "katello_root_repositories"."product_id" = $1) AND "audits"."auditable_type" = $2 AND "audits"."action" = $3 ORDER BY "audits"."created_at" DESC LIMIT $4
2024-11-19 18:21:37 CET DETAIL:  parameters: $1 = '116', $2 = 'Katello::Repository', $3 = 'sync', $4 = '1'

That is about the same amount of time as before for the second and later visits of the product page after the extremely slow first one. However, now, following refreshes of the page are faster and don’t log a long running query anymore. Thus, at the moment it seems to have helped.

That query does not return any rows anyway:

foreman=# SELECT "audits".* FROM "audits" WHERE "audits"."auditable_id" IN (SELECT "katello_repositories"."id" FROM "katello_repositories" INNER JOIN "katello_root_repositories" ON "katello_repositories"."root_id" = "katello_root_repositories"."id" WHERE "katello_root_repositories"."product_id" = 116 ) AND "audits"."auditable_type" = 'Katello::Repository' AND "audits"."action" = 'sync' ORDER BY "audits"."created_at" DESC limit 1;
 id | auditable_id | auditable_type | user_id | user_type | username | action | 
audited_changes | version | comment | associated_id | associated_type | request_
uuid | created_at | remote_address | auditable_name | associated_name 
----+--------------+----------------+---------+-----------+----------+--------+-
----------------+---------+---------+---------------+-----------------+---------
-----+------------+----------------+----------------+-----------------
(0 rows)

Product 116 is our company internal product, containing some local repositories with custom build rpms which are uploaded to foreman. None of the repositories are synced from elsewhere which is, I guess, why there are not audit entries. The status of the product is always (correctly) “Never synced”. Thus, it probably has to scan the full table which takes a while because the closest index would be

    "index_audits_on_auditable_type_and_auditable_id_and_version" btree (auditable_type, auditable_id, version)

which doesn’t help because of the lack of the version value.

Thanks for the analysis. Looking at the code, we fall back to searching Audits if we cannot find a sync task to relate to the Product. An interesting experiment might be to add some synced repo to your upload-only product, I wonder if the product index loading time would decrease…

It makes me wonder if storing the last sync time on the product record itself wouldn’t be better. I feel like there must be some reason that work was put into searching tasks and audits.

I created Bug #38086: Products index page is slow for products that have no synced repositories - Katello - Foreman.

1 Like