Problem: Cannot boot anymore Discovery image.
Had a working installation of foreman (1.24.x, foreman discovery image 3.5.1, do not remember the version of foreman discovery plugin). With that version I discovered and provisioned 30/40 bare metal servers.
Upgraded to v2.0 (+ installed foreman datacenter plugin) a couple of months ago.
Tried to discover new bare metal servers in the last week.
Initially, discovery image was booting, but facts were not sent to foreman. Had this error: Validation failed: Name has already been taken
While trying to troubleshoot that error, I decided to delete /var/lib/tftpboot and let foreman recreate the directory on both foreman master AND foreman proxy servers (did this be re-running the foreman-installer command with the same parameters used during the installation/upgrade).
Since that moment, I was no more able to boot the discovery image. I see the Grub2 boot screen, select Foreman Discovery Image EFI, screen become black, and after a few seconds, this error appears: error: timeout reading ‘boot/fdi-image/initrd0.img’. Press any key to continue. Boot process continues but then it fails.
On the smart proxy tftp server, I see this in the logs
Aug 10 12:13:28 s157p dhcpd: DHCPDISCOVER from 48:df:37:4d:f0:6c via enp70s0f0
Aug 10 12:13:29 s157p dhcpd: none: host unknown.
Aug 10 12:13:29 s157p dhcpd: DHCPOFFER on 192.168.240.254 to 48:df:37:4d:f0:6c via enp70s0f0
Aug 10 12:13:31 s157p dhcpd: DHCPREQUEST for 192.168.240.254 (192.168.240.157) from 48:df:37:4d:f0:6c via enp70s0f0
Aug 10 12:13:31 s157p dhcpd: DHCPACK on 192.168.240.254 to 48:df:37:4d:f0:6c via enp70s0f0
Aug 10 12:13:31 s157p in.tftpd[2499]: RRQ from 192.168.240.254 filename grub2/shim.efi
Aug 10 12:13:31 s157p in.tftpd[2499]: Error code 8: User aborted the transfer
Aug 10 12:13:31 s157p in.tftpd[2500]: RRQ from 192.168.240.254 filename grub2/shim.efi
Aug 10 12:13:31 s157p in.tftpd[2500]: Client 192.168.240.254 finished grub2/shim.efi
Aug 10 12:13:31 s157p in.tftpd[2501]: RRQ from 192.168.240.254 filename grub2/grubx64.efi
Aug 10 12:13:31 s157p in.tftpd[2501]: Client 192.168.240.254 finished grub2/grubx64.efi
Aug 10 12:13:32 s157p in.tftpd[2502]: RRQ from 192.168.240.254 filename grub2/grub.cfg-01-48-df-37-4d-f0-6c
Aug 10 12:13:32 s157p in.tftpd[2502]: Client 192.168.240.254 File not found grub2/grub.cfg-01-48-df-37-4d-f0-6c
Aug 10 12:13:32 s157p in.tftpd[2503]: RRQ from 192.168.240.254 filename grub2/grub.cfg-8BBFF0FE
Aug 10 12:13:32 s157p in.tftpd[2503]: Client 192.168.240.254 File not found grub2/grub.cfg-8BBFF0FE
Aug 10 12:13:32 s157p in.tftpd[2504]: RRQ from 192.168.240.254 filename grub2/grub.cfg-8BBFF0F
Aug 10 12:13:32 s157p in.tftpd[2504]: Client 192.168.240.254 File not found grub2/grub.cfg-8BBFF0F
Aug 10 12:13:32 s157p in.tftpd[2505]: RRQ from 192.168.240.254 filename grub2/grub.cfg-8BBFF0
Aug 10 12:13:32 s157p in.tftpd[2505]: Client 192.168.240.254 File not found grub2/grub.cfg-8BBFF0
Aug 10 12:13:32 s157p in.tftpd[2506]: RRQ from 192.168.240.254 filename grub2/grub.cfg-8BBFF
Aug 10 12:13:32 s157p in.tftpd[2506]: Client 192.168.240.254 File not found grub2/grub.cfg-8BBFF
Aug 10 12:13:32 s157p in.tftpd[2507]: RRQ from 192.168.240.254 filename grub2/grub.cfg-8BBF
Aug 10 12:13:32 s157p in.tftpd[2507]: Client 192.168.240.254 File not found grub2/grub.cfg-8BBF
Aug 10 12:13:32 s157p in.tftpd[2508]: RRQ from 192.168.240.254 filename grub2/grub.cfg-8BB
Aug 10 12:13:32 s157p in.tftpd[2508]: Client 192.168.240.254 File not found grub2/grub.cfg-8BB
Aug 10 12:13:32 s157p in.tftpd[2509]: RRQ from 192.168.240.254 filename grub2/grub.cfg-8B
Aug 10 12:13:32 s157p in.tftpd[2509]: Client 192.168.240.254 File not found grub2/grub.cfg-8B
Aug 10 12:13:32 s157p in.tftpd[2510]: RRQ from 192.168.240.254 filename grub2/grub.cfg-8
Aug 10 12:13:32 s157p in.tftpd[2510]: Client 192.168.240.254 File not found grub2/grub.cfg-8
Aug 10 12:13:32 s157p in.tftpd[2511]: RRQ from 192.168.240.254 filename grub2/grub.cfg
Aug 10 12:13:32 s157p in.tftpd[2511]: Client 192.168.240.254 finished grub2/grub.cfg
Aug 10 12:13:32 s157p in.tftpd[2512]: RRQ from 192.168.240.254 filename /EFI/centos/x86_64-efi/command.lst
Aug 10 12:13:32 s157p in.tftpd[2512]: Client 192.168.240.254 File not found /EFI/centos/x86_64-efi/command.lst
Aug 10 12:13:32 s157p in.tftpd[2513]: RRQ from 192.168.240.254 filename /EFI/centos/x86_64-efi/fs.lst
Aug 10 12:13:32 s157p in.tftpd[2513]: Client 192.168.240.254 File not found /EFI/centos/x86_64-efi/fs.lst
Aug 10 12:13:32 s157p in.tftpd[2514]: RRQ from 192.168.240.254 filename /EFI/centos/x86_64-efi/crypto.lst
Aug 10 12:13:32 s157p in.tftpd[2514]: Client 192.168.240.254 File not found /EFI/centos/x86_64-efi/crypto.lst
Aug 10 12:13:32 s157p in.tftpd[2515]: RRQ from 192.168.240.254 filename /EFI/centos/x86_64-efi/terminal.lst
Aug 10 12:13:32 s157p in.tftpd[2515]: Client 192.168.240.254 File not found /EFI/centos/x86_64-efi/terminal.lst
Aug 10 12:13:32 s157p in.tftpd[2516]: RRQ from 192.168.240.254 filename grub2/grub.cfg
Aug 10 12:13:32 s157p in.tftpd[2516]: Client 192.168.240.254 finished grub2/grub.cfg
Aug 10 12:13:32 s157p in.tftpd[2517]: RRQ from 192.168.240.254 filename /httpboot/grub2/grub.cfg-48:df:37:4d:f0:6c
Aug 10 12:13:32 s157p in.tftpd[2517]: Client 192.168.240.254 File not found /httpboot/grub2/grub.cfg-48:df:37:4d:f0:6c
Aug 10 12:13:32 s157p in.tftpd[2518]: RRQ from 192.168.240.254 filename /grub2/grub.cfg-48:df:37:4d:f0:6c
Aug 10 12:13:32 s157p in.tftpd[2518]: Client 192.168.240.254 File not found /grub2/grub.cfg-48:df:37:4d:f0:6c
Aug 10 12:13:32 s157p in.tftpd[2519]: RRQ from 192.168.240.254 filename grub2/grub.cfg-48:df:37:4d:f0:6c
Aug 10 12:13:32 s157p in.tftpd[2519]: Client 192.168.240.254 File not found grub2/grub.cfg-48:df:37:4d:f0:6c
Aug 10 12:13:32 s157p in.tftpd[2520]: RRQ from 192.168.240.254 filename grub.cfg-48:df:37:4d:f0:6c
Aug 10 12:13:32 s157p in.tftpd[2520]: Client 192.168.240.254 File not found grub.cfg-48:df:37:4d:f0:6c
Aug 10 12:13:39 s157p in.tftpd[2521]: RRQ from 192.168.240.254 filename boot/fdi-image/vmlinuz0
Aug 10 12:13:39 s157p in.tftpd[2521]: Client 192.168.240.254 finished boot/fdi-image/vmlinuz0
Aug 10 12:13:39 s157p in.tftpd[2522]: RRQ from 192.168.240.254 filename boot/fdi-image/initrd0.img
Aug 10 12:14:16 s157p in.tftpd[2522]: Client 192.168.240.254 finished boot/fdi-image/initrd0.img
The timeout error appears when the message Client 192.168.240.254 finished boot/fdi-image/initrd0.img
is written in the server log.
In addition, I have tried the following things
- tried to upgrade foreman/smart-proxy to v2.1.1. Nothing changed
- I made sure TFTP works, by booting a live cd on the server to discover, and after it was loaded, tried to retrieve the image from the smart proxy using TFTP. The download was working.
- I tried to Network boot using a different port of the same NIC. Nothing changed
- I tried to discover another new server. Same timeout issue.
What could be the issue? Any suggestion? Many thanks in advance
Expected outcome: Discovery image boot
Foreman and Proxy versions:
Foreman master 2.1.1
Foreman proxy (in a different subnet): 2.1.1
Foreman and Proxy plugin versions:
Foreman discovery plugin 16.1.0
Foreman discovery image 3.5.7
Foreman datacenter 2.0.1
Distribution and version:
CentOS 7.8