-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Better way to detect failed disks #25
Comments
It's basically a function of which RAID controller is in use - if it's soft raid or a hardware RAID that http://hwraid.le-vert.net/ has support for then we get notifications. If it's a hardware RAID with only proprietary tools and no wrapper script then we don't. |
Looks like ramoth should be able to detect them, but the status command is bombing out:
|
The yevaud card is an Areca which hwraid doesn't have a status daemon for. |
The error on ramoth seems to be because megacli is claiming there are four arrays but will only return information about the first one. |
Ah I was wrong about that, the actual problem is array 2, which doesn't report a number of disks:
|
I've patched it now but it seems to be reporting everything as optimal anyway. |
Indeed - the arrays are optimal. But there's a global hot-spare, which has been pulled into the array to stand in for the failed drive. So there's a failed physical disk, which isn't part of any array... but we still want to know about it. |
So send a patch to https://github.com/eLvErDe/hwraid then ;-) |
eLvErDe/hwraid#13 may actually be what we want? |
I've opened eLvErDe/hwraid#17 for the CacheCade issue. |
Is this still an issue, or are we getting much better alerting these days? |
Closing as very old. If still an issue can be re-opened. |
Recently we've lost disks in ramoth and yevaud without notifications (at least - not that I can find). However, failed disk notifications appear to be working fine on orm and pummelzacken.
The text was updated successfully, but these errors were encountered: