comm systemd-journal: Detected aborted journal - Even on a new cloned nvme

BeastBurst

New Member
Aug 29, 2023
7
0
1
My server spec:

Code:
MB: Supermicro H12SSL-i
CPU: AMD Epyc 7742
GPU: NVIDIA RTX 3090
RAM: 512GB DDR4 RDIMM ECC


Today my Proxmox 8 nded up having the following error:

Code:
[40.294701] EXT-fs error (device dm-1): ext_journal_check_start:84: comm systemd-journal: Detected aborted journal
EXT4-fs (dm-1): Remounting filesystem read-only


This error appears like ~30 seconds after I load proxmox and even I can write my username and password and In a few seconds BOOM I get the error shown above.

First thing that I thought it might be the 2TB samsung NVME is dying on me. I run only one drive in the server. Luckaly I have two brand new nvme from Samsung again and they are even 4TBs. So I downloaded Clonezilla and initiated a clone from the old "suposadly" broken nvme of 2TB to the brand new 4TB nvme. The clone went sucessfully!

So I installed the new 4TB nvme and BOOM, the same error even on the brand new hard drive.
To make the things even more wierder I took that 4TB hard drive and place it on another machine where I had Ubuntu 22.04 installed. I remove the Ubuntu nvme and installed the 4TB cloned Proxmox on the machine and there Proxmox loads and runs fine on my PC and there Proxmox starts without showing me this error, but on the SuperMicro server it does not.

There on the PC which used to run my Ubuntu when I installed the 2TB initial drive from the server for even greater suprise it was runing without any kind of errors the same way the 4TB cloned was running.

So I decided to run test command to see if the disk is healthy and I executed `sudo smartctl -a /dev/nvme0n1` on which I've got:


Code:
SMART overall-health self-assessment test result: PASSED

If that result is acurate the disk should be fine, and why not it runs on my regular PC just fine. Why it does not on my SuperMicro Server ?

To test the server I took the nvme of my Ubuntu PC and place it on the server just to conclude even weirder result, the ubuntu loads in the same nvme slot in which it refuses to load the inital 2TB or the cloned 4TB nvme drives containing Proxmox.

How these strange outcomes can happen, anyone have a clue ?

This setup was running like this for months wihtout problems. What should I do not to make my Proxmox back online without this nasty `comm systemd-journal: Detected aborted journal`, any advice ?
 
Last edited:
Hi,

i have the same problem. Same MB with Hardware Raid. Have you found a solution?
 
Last edited:
i had this error today, too. I occurred after I replaced the motherboard on one of my two node cluster machines. really not sure what's going on. i'm fairly certain the drive is fine.

I should note I do not have anywhere near this spec of hardware. Using consumer grade hardware. 13th gen intel, 32gb memory, 250gb ssd
 
Last edited:

About

The Proxmox community has been around for many years and offers help and support for Proxmox VE, Proxmox Backup Server, and Proxmox Mail Gateway.
We think our community is one of the best thanks to people like you!

Get your subscription!

The Proxmox team works very hard to make sure you are running the best software and getting stable updates and security enhancements, as well as quick enterprise support. Tens of thousands of happy customers have a Proxmox subscription. Get yours easily in our online shop.

Buy now!