Search found 24 matches

by markmh
2012/03/06 22:28:41
Forum: CentOS 4 - General Support
Topic: software RAID-1 data recovery
Replies: 45
Views: 40514

Re: software RAID-1 data recovery

Today, I went to the cluster head node and I still couldn't boot it. I got these error messages: "BMC system error log (SEL) Full" "PXE-NFS: Exiting Intel Boot Agent Operating System not found" Then I have inserted a DVD with Puppy Linux. I was able to boot and the good thing is that I was able to m...
by markmh
2012/03/06 00:01:04
Forum: CentOS 4 - General Support
Topic: software RAID-1 data recovery
Replies: 45
Views: 40514

Re: software RAID-1 data recovery

Unfortunately, I cannot access the head node through the network anymore. I have to go to the cluster tomorrow and connect the monitor again. But today I did connect the monitor and keyboard to the head node for sure. And there was no response, not even the numpad light on. So I decided to reboot th...
by markmh
2012/03/05 21:51:41
Forum: CentOS 4 - General Support
Topic: software RAID-1 data recovery
Replies: 45
Views: 40514

Re: software RAID-1 data recovery

[quote]you're probably only at the equivalent of runlevel 1 now anyway so I'd not worry to much about it![/quote] I have executed the command as root. Now I went to the cluster and the headnode didn't respond to the connected keyboard or the monitor. So I have rebooted the head node. Now it shows th...
by markmh
2012/03/04 21:26:09
Forum: CentOS 4 - General Support
Topic: software RAID-1 data recovery
Replies: 45
Views: 40514

Re: software RAID-1 data recovery

Ok, I have tried to kill those processes using
[code]fuser -km /compute[/code]
The connection closed and now I'm not able to login again. Is it possible that I closed some important processes? I'm a little bit worried now :-(
by markmh
2012/03/04 18:41:52
Forum: CentOS 4 - General Support
Topic: software RAID-1 data recovery
Replies: 45
Views: 40514

Re: software RAID-1 data recovery

I have found the command [code]/sbin/service nfs stop[/code] But since there are several devices shared with nfs I was wondering if I can specifically stop one device? Also are the commands for stopping the nfs server and client identical? Do I have to execute the same command on the clients first a...
by markmh
2012/03/04 14:57:41
Forum: CentOS 4 - General Support
Topic: software RAID-1 data recovery
Replies: 45
Views: 40514

Re: software RAID-1 data recovery

Thanks, but I have unmounted /compute already on the head node where the discs /dev/sda and /dev/sdb physically are. (See post #20 [quote]We unmounted /compute by using umount -l.[/quote]) Now I want to mount /dev/sda in order to access the data but 1) the RAID is still running although I have tried...
by markmh
2012/03/03 19:38:39
Forum: CentOS 4 - General Support
Topic: software RAID-1 data recovery
Replies: 45
Views: 40514

Re: software RAID-1 data recovery

Thanks again, Trevor!

Could you be more specific about the commands:

1) How to unmount the file system?

2) How to make sure that nobody is using it?
by markmh
2012/03/03 01:56:41
Forum: CentOS 4 - General Support
Topic: software RAID-1 data recovery
Replies: 45
Views: 40514

Re: software RAID-1 data recovery

Unfortunately, stopping the RAID didn't work, after using your command I got: [code] [root@lcpp-cluster /]# /sbin/mdadm --stop /dev/md0 mdadm: fail to stop array /dev/md0: Device or resource busy [/code] However, I was able to deactivate the faulty device /dev/sdb by: [code] [root@lcpp-cluster mario...
by markmh
2012/03/02 01:37:04
Forum: CentOS 4 - General Support
Topic: software RAID-1 data recovery
Replies: 45
Views: 40514

Re: software RAID-1 data recovery

Thanks for the quick reply, Trevor! I think what happened is that the disk /dev/sdb is broken and that's why the data on RAID is not available. From the analysis that I have posted earlier it looks like /dev/sda has still all the data. I would like to access is somehow. For this I was trying to moun...
by markmh
2012/03/01 17:16:40
Forum: CentOS 4 - General Support
Topic: software RAID-1 data recovery
Replies: 45
Views: 40514

Re: software RAID-1 data recovery

the command gives: [root@lcpp-cluster ~]# echo "check" > /sys/block/md0/md/sync_action [code] bash: /sys/block/md0/md/sync_action: No such file or directory And if I go to the /sys/block/md0/ directory and do list -l I find this: -r--r--r-- 1 root root 4096 Mar 1 10:11 dev -r--r--r-- 1 root root 409...

Go to advanced search