No GRUB since last update

, , ,

Hello to all the askfedoraproject members and all the best for 2023,

I am not at all an expert in Linux distributions but I am an enthusiast user of them for a few years. To briefly summarize my current configuration :

Setup :
AMD Ryzen 5 3600
X570
AMD GPU
sda : sdd with Windows OS
sdb : HDD for pure storage
sdc : sdd with Fedora 35
sdd : nvme M.2 with Fedora 36

All this setup was working easily and properly for a while but it drastically changed last Friday after an update on the Fedora 36. The day after, when I started the computer, the GRUB did not appear at all. All the Fedora OS were down
and only the Windows system is booting since. I checked in the boot menu setting to set only one disk and still only Windows works fine. The first boot, I got a message indicating :
Unexpected return from roots volume corrupt "buffer 1009???" (not really sure about the end)
Failed to load image start_image() returned volume corrupt

After a few starting sequences I got another message
Failed to open grub64.exi not found

I already had a look through different topics in the french community and tried a few things like monitoring the disks with smartmontool package.
smartctl -aA /dev/sdX

Excepted an issue related to power losses coming from the previous PSU, there was no clear indications. Those information are available here below :

sda : (Windows 10)

smartctl 7.2 2020-12-30 r5155 [x86_64-linux-5.15.0-33-generic] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke

=== START OF INFORMATION SECTION ===
Model Family:     Crucial/Micron Client SSDs
Device Model:     CT480BX500SSD1
Serial Number:    2011E3EF49A3
LU WWN Device Id: 0 000000 000000000
Firmware Version: M6CR022
User Capacity:    480,103,981,056 bytes [480 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    Solid State Device
Form Factor:      2.5 inches
TRIM Command:     Available
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-3 T13/2161-D revision 4
SATA Version is:  SATA 3.2, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Sun Jan  8 06:44:31 2023 CST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00)	Offline data collection activity
					was never started.
					Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0)	The previous self-test routine completed
					without error or no self-test has ever 
					been run.
Total time to complete Offline 
data collection: 		(  120) seconds.
Offline data collection
capabilities: 			 (0x11) SMART execute Offline immediate.
					No Auto Offline data collection support.
					Suspend Offline collection upon new
					command.
					No Offline surface scan supported.
					Self-test supported.
					No Conveyance Self-test supported.
					No Selective Self-test supported.
SMART capabilities:            (0x0002)	Does not save SMART data before
					entering power-saving mode.
					Supports SMART auto save timer.
Error logging capability:        (0x01)	Error logging supported.
					General Purpose Logging supported.
Short self-test routine 
recommended polling time: 	 (   2) minutes.
Extended self-test routine
recommended polling time: 	 (  10) minutes.

SMART Attributes Data Structure revision number: 1
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   000   100   000    Pre-fail  Always       -       0
  5 Reallocate_NAND_Blk_Cnt 0x0032   100   100   010    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       7185
 12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       1910
171 Program_Fail_Count      0x0032   100   100   000    Old_age   Always       -       0
172 Erase_Fail_Count        0x0032   000   000   000    Old_age   Always       -       0
173 Ave_Block-Erase_Count   0x0032   006   006   000    Old_age   Always       -       132
174 Unexpect_Power_Loss_Ct  0x0032   100   100   000    Old_age   Always       -       334
180 Unused_Reserve_NAND_Blk 0x0033   100   100   000    Pre-fail  Always       -       217
183 SATA_Interfac_Downshift 0x0032   100   100   000    Old_age   Always       -       2
184 Error_Correction_Count  0x0032   100   100   000    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
194 Temperature_Celsius     0x0022   087   030   000    Old_age   Always       -       13 (Min/Max 5/70)
196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always       -       0
197 Current_Pending_ECC_Cnt 0x0032   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   100   100   000    Old_age   Always       -       0
202 Percent_Lifetime_Remain 0x0030   094   094   001    Old_age   Offline      -       6
206 Write_Error_Rate        0x000e   000   000   000    Old_age   Always       -       0
210 Success_RAIN_Recov_Cnt  0x0032   100   100   000    Old_age   Always       -       0
246 Total_LBAs_Written      0x0032   100   100   000    Old_age   Always       -       29122563035
247 Host_Program_Page_Count 0x0032   100   100   000    Old_age   Always       -       910080094
248 FTL_Program_Page_Count  0x0032   100   100   000    Old_age   Always       -       1078680608

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]

Selective Self-tests/Logging not supported


***sdb : (storage space)***

smartctl 7.2 2020-12-30 r5155 [x86_64-linux-5.15.0-33-generic] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke

=== START OF INFORMATION SECTION ===
Model Family:     Seagate BarraCuda 3.5
Device Model:     ST2000DM008-2FR102
Serial Number:    WFL4B5EL
LU WWN Device Id: 5 000c50 0d3ea6ea3
Firmware Version: 0001
User Capacity:    2,000,398,934,016 bytes [2.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    7200 rpm
Form Factor:      3.5 inches
TRIM Command:     Available
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-3 T13/2161-D revision 5
SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Sun Jan  8 06:47:01 2023 CST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00)	Offline data collection activity
					was never started.
					Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0)	The previous self-test routine completed
					without error or no self-test has ever 
					been run.
Total time to complete Offline 
data collection: 		(    0) seconds.
Offline data collection
capabilities: 			 (0x73) SMART execute Offline immediate.
					Auto Offline data collection on/off support.
					Suspend Offline collection upon new
					command.
					No Offline surface scan supported.
					Self-test supported.
					Conveyance Self-test supported.
					Selective Self-test supported.
SMART capabilities:            (0x0003)	Saves SMART data before entering
					power-saving mode.
					Supports SMART auto save timer.
Error logging capability:        (0x01)	Error logging supported.
					General Purpose Logging supported.
Short self-test routine 
recommended polling time: 	 (   1) minutes.
Extended self-test routine
recommended polling time: 	 ( 198) minutes.
Conveyance self-test routine
recommended polling time: 	 (   2) minutes.
SCT capabilities: 	       (0x30a5)	SCT Status supported.
					SCT Data Table supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   084   064   006    Pre-fail  Always       -       232421735
  3 Spin_Up_Time            0x0003   099   098   000    Pre-fail  Always       -       0
  4 Start_Stop_Count        0x0032   094   094   020    Old_age   Always       -       6902
  5 Reallocated_Sector_Ct   0x0033   100   100   010    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000f   074   060   045    Pre-fail  Always       -       24208912
  9 Power_On_Hours          0x0032   092   092   000    Old_age   Always       -       7141 (168 131 0)
 10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   099   099   020    Old_age   Always       -       1722
183 Runtime_Bad_Block       0x0032   100   100   000    Old_age   Always       -       0
184 End-to-End_Error        0x0032   100   100   099    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
188 Command_Timeout         0x0032   100   100   000    Old_age   Always       -       0 0 0
189 High_Fly_Writes         0x003a   100   100   000    Old_age   Always       -       0
190 Airflow_Temperature_Cel 0x0022   066   052   040    Old_age   Always       -       34 (Min/Max 31/34)
191 G-Sense_Error_Rate      0x0032   100   100   000    Old_age   Always       -       0
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       203
193 Load_Cycle_Count        0x0032   090   090   000    Old_age   Always       -       21758
194 Temperature_Celsius     0x0022   034   048   000    Old_age   Always       -       34 (0 20 0 0 0)
195 Hardware_ECC_Recovered  0x001a   084   064   000    Old_age   Always       -       232421735
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       0
240 Head_Flying_Hours       0x0000   100   253   000    Old_age   Offline      -       1909h+35m+55.051s
241 Total_LBAs_Written      0x0000   100   253   000    Old_age   Offline      -       10121552832
242 Total_LBAs_Read         0x0000   100   253   000    Old_age   Offline      -       484994725

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

***sdc : (Fedora 35)***

smartctl 7.2 2020-12-30 r5155 [x86_64-linux-5.15.0-33-generic] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke

=== START OF INFORMATION SECTION ===
Model Family:     Crucial/Micron Client SSDs
Device Model:     CT120BX500SSD1
Serial Number:    2018E3FA9CB5
LU WWN Device Id: 0 000000 000000000
Firmware Version: M6CR013
User Capacity:    120,034,123,776 bytes [120 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    Solid State Device
Form Factor:      2.5 inches
TRIM Command:     Available, deterministic, zeroed
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2 T13/2015-D revision 3
SATA Version is:  SATA 3.2, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Sun Jan  8 06:47:44 2023 CST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED
See vendor-specific Attribute list for marginal Attributes.

General SMART Values:
Offline data collection status:  (0x00)	Offline data collection activity
					was never started.
					Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0)	The previous self-test routine completed
					without error or no self-test has ever 
					been run.
Total time to complete Offline 
data collection: 		(  120) seconds.
Offline data collection
capabilities: 			 (0x11) SMART execute Offline immediate.
					No Auto Offline data collection support.
					Suspend Offline collection upon new
					command.
					No Offline surface scan supported.
					Self-test supported.
					No Conveyance Self-test supported.
					No Selective Self-test supported.
SMART capabilities:            (0x0002)	Does not save SMART data before
					entering power-saving mode.
					Supports SMART auto save timer.
Error logging capability:        (0x01)	Error logging supported.
					General Purpose Logging supported.
Short self-test routine 
recommended polling time: 	 (   2) minutes.
Extended self-test routine
recommended polling time: 	 (  10) minutes.

SMART Attributes Data Structure revision number: 1
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   100   100   050    Pre-fail  Always       -       0
  5 Reallocate_NAND_Blk_Cnt 0x0032   100   100   010    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   100   100   050    Old_age   Always       -       6063
 12 Power_Cycle_Count       0x0032   100   100   050    Old_age   Always       -       1658
171 Program_Fail_Count      0x0032   100   100   050    Old_age   Always       -       0
172 Erase_Fail_Count        0x0032   100   100   050    Old_age   Always       -       0
173 Ave_Block-Erase_Count   0x0032   100   100   050    Old_age   Always       -       49
174 Unexpect_Power_Loss_Ct  0x0032   100   100   050    Old_age   Always       -       290
180 Unused_Reserve_NAND_Blk 0x0032   100   100   050    Old_age   Always       -       100
183 SATA_Interfac_Downshift 0x0032   100   100   050    Old_age   Always       -       1
184 Error_Correction_Count  0x0032   100   100   050    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   050    Old_age   Always       -       0
194 Temperature_Celsius     0x0022   072   039   050    Old_age   Always   In_the_past 28 (Min/Max 20/61)
196 Reallocated_Event_Count 0x0032   100   100   050    Old_age   Always       -       0
197 Current_Pending_ECC_Cnt 0x0032   100   100   050    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0030   100   100   050    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0032   100   100   050    Old_age   Always       -       0
202 Percent_Lifetime_Remain 0x0030   097   097   001    Old_age   Offline      -       97
206 Write_Error_Rate        0x002e   100   100   050    Old_age   Always       -       0
210 Success_RAIN_Recov_Cnt  0x0032   100   100   050    Old_age   Always       -       0
246 Total_LBAs_Written      0x0032   100   100   050    Old_age   Always       -       2988639655
247 Host_Program_Page_Count 0x0032   100   100   050    Old_age   Always       -       93394989
248 FTL_Program_Page_Count  0x0032   100   100   050    Old_age   Always       -       121261408

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]

Selective Self-tests/Logging not supported

sdd : (Fedora 36)

smartctl 7.2 2020-12-30 r5155 [x86_64-linux-5.15.0-33-generic] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke

=== START OF INFORMATION SECTION ===
Model Family:     WD Blue / Red / Green SSDs
Device Model:     WDC  WDS100T2B0B-00YS70
Serial Number:    210639441511
LU WWN Device Id: 5 001b44 4a7047b8a
Firmware Version: 401020WD
User Capacity:    1,000,204,886,016 bytes [1.00 TB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    Solid State Device
Form Factor:      M.2
TRIM Command:     Available, deterministic, zeroed
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-4 T13/BSR INCITS 529 revision 5
SATA Version is:  SATA 3.3, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Sun Jan  8 06:48:36 2023 CST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00)	Offline data collection activity
					was never started.
					Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0)	The previous self-test routine completed
					without error or no self-test has ever 
					been run.
Total time to complete Offline 
data collection: 		(    0) seconds.
Offline data collection
capabilities: 			 (0x11) SMART execute Offline immediate.
					No Auto Offline data collection support.
					Suspend Offline collection upon new
					command.
					No Offline surface scan supported.
					Self-test supported.
					No Conveyance Self-test supported.
					No Selective Self-test supported.
SMART capabilities:            (0x0003)	Saves SMART data before entering
					power-saving mode.
					Supports SMART auto save timer.
Error logging capability:        (0x01)	Error logging supported.
					General Purpose Logging supported.
Short self-test routine 
recommended polling time: 	 (   2) minutes.
Extended self-test routine
recommended polling time: 	 (  10) minutes.

SMART Attributes Data Structure revision number: 4
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  5 Reallocated_Sector_Ct   0x0032   100   100   ---    Old_age   Always       -       0
  9 Power_On_Hours          0x0032   100   100   ---    Old_age   Always       -       5200
 12 Power_Cycle_Count       0x0032   100   100   ---    Old_age   Always       -       1457
165 Block_Erase_Count       0x0032   100   100   ---    Old_age   Always       -       55861772612
166 Minimum_PE_Cycles_TLC   0x0032   100   100   ---    Old_age   Always       -       2
167 Max_Bad_Blocks_per_Die  0x0032   100   100   ---    Old_age   Always       -       42
168 Maximum_PE_Cycles_TLC   0x0032   100   100   ---    Old_age   Always       -       7
169 Total_Bad_Blocks        0x0032   100   100   ---    Old_age   Always       -       539
170 Grown_Bad_Blocks        0x0032   100   100   ---    Old_age   Always       -       0
171 Program_Fail_Count      0x0032   100   100   ---    Old_age   Always       -       0
172 Erase_Fail_Count        0x0032   100   100   ---    Old_age   Always       -       0
173 Average_PE_Cycles_TLC   0x0032   100   100   ---    Old_age   Always       -       3
174 Unexpected_Power_Loss   0x0032   100   100   ---    Old_age   Always       -       226
184 End-to-End_Error        0x0032   100   100   ---    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   ---    Old_age   Always       -       0
188 Command_Timeout         0x0032   100   100   ---    Old_age   Always       -       0
194 Temperature_Celsius     0x0022   058   060   ---    Old_age   Always       -       42 (Min/Max 19/60)
199 UDMA_CRC_Error_Count    0x0032   100   100   ---    Old_age   Always       -       0
230 Media_Wearout_Indicator 0x0032   001   001   ---    Old_age   Always       -       0x0040001e0040
232 Available_Reservd_Space 0x0033   100   100   004    Pre-fail  Always       -       100
233 NAND_GB_Written_TLC     0x0032   100   100   ---    Old_age   Always       -       3984
234 NAND_GB_Written_SLC     0x0032   100   100   ---    Old_age   Always       -       7687
241 Host_Writes_GiB         0x0030   253   253   ---    Old_age   Offline      -       6438
242 Host_Reads_GiB          0x0030   253   253   ---    Old_age   Offline      -       6495
244 Temp_Throttle_Status    0x0032   000   100   ---    Old_age   Always       -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]

After this, I also tried to boot on a live USB stick with Fedora 37 installed on it. I wanted to follow this procedure based on another topic :

mkdir -p /mnt/fedora

mount /dev/fedora-root /mnt/fedora

mount /dev/sdd2 /mnt/fedora/boot
mount /dev/sdd1 /mnt/fedora/boot/efi

mount -o bind /dev /mnt/fedora/dev
mount -o bind /proc /mnt/fedora/proc
mount -o bind /sys /mnt/fedora/sys
mount -t tmpfs tmpfs /mnt/fedora/tmp

chroot /mnt/fedora

dnf reinstall grub2-efi shim

But when I reach the second line, I get this message.

[liveuser@localhost-live mnt]$ sudo mount /dev/fedora-root /mnt/fedora
mount: /mnt/fedora: special device /dev/fedora-root does not exist.
       dmesg(1) may have more information after failed mount system call.

I am kind of stuck right and do not know what to do exactly or what am I doing wrong. I am wondering if installing another distribution on the sdc (Fedora 35) would not be efficient to “rebuild” the Grub and maybe find the Fedora 36 system back?

Thanks a lot for your time and help and I apologize for the (too?) long message.

Regards,

CM

Is your root partition a LVM volume? Maybe even encrypted?

Excepted if by default the LVM volume is selected I don’t think so but surely not encrypted.

Using verbatim the suggestions on a web site is often not possible. The devices named are appropriate for their system but usually not correct for yours.

One has to know the device and partition names on their own system then insert as appropriate in the suggested commands.

For example, one could use lsblk -f to identify all devices and partitions on the machine in use and once the partition to be used is identified then the needed info is available.
If using lvm then the name similar to what was posted above may be found by doing ls /dev/mapper. If using btrfs it is different but can be found with the lsblk command given above and the UUID as well as the subvolume name is required. If using ext4 it can be found with both lsblk -f and ls /dev

Booting to a live media allows one to identify the device and once the device has been identified then judicious editing of the suggested commands will work. The first 3 of those mount commands are the ones where you need to get the appropriate info to perform the mounts. You also need to edit the dnf reinstall command to read dnf reinstall grub2-efi* shim