Welcome to Linux Forums! With a comprehensive Linux Forum, information on various types of Linux software and many Linux Reviews articles, we have all the knowledge you need a click away, or accessible via our knowledgeable members.
Write an article for LinuxForums Today! Win Great Prizes!
For the past two days I've been having serious, but difficult to pin down problems with my gentoo box. It boots up, but at intermitant times after boot up my system completely locks up. No mouse, no keyboard, no dumping into a terminal, nothing, nathen, nada. I have to ctl-alt-sysreq- RSEIUB just to reboot.
These problems started not long after an emerge -upDN world && revdep-rebuild so I spent quite a bit of time running through different versions of packages to see if it helped, to no avail. Has anyone heard of any recent bug reports that might do this (especially on amd64 arch)?
So, I started thinking that it was hardware. The processor temperature isn't getting high, so I did a ram test. After running multiple passes of the extended RAM tests off of a CD, NO ERRORS. My next course of action, when I have some time available is to run off of a boot cd and see if the problems are reproducible then...
I am really at a loss here, anyone have any other ideas? I'm used to solving linux related problems, but I am not used to a complete system lock-up like this (in fact, come to think of it, I don't think I've ever had a linux system do that in a semi-regular fashion).
__________________
Linux since: 2001
Gentoo since: 2004
- - - - - - - -
Translation:
I fix things until they break.
Assuming that you are using stable settings and kernel and no external modules that could introduce bugs, then random lockups are a symptom of faulty hardware.
The fact that you never observed it before with any other OS might have a simple explanation, like for example:
a ram stick just broke, they are sensible enough and any electric peak can send them to hell
in the rest of distros you used the cpu and ram usage was't that high (and I am speaking about emerge here, because compiling is one of the heaviest task in all regards: cpu, ram, i/o, etc.)
First off, we need to go through the regular drill of posting your emerge --info.
Do you use ~arch or stable?
Most of the time when I get lockups like that it is a kernel or video driver. Usually hardware problems will appear in your compiling as a random error.
Although, I have had a problem before that when I loaded a certain module, the whole system locked. I had to push the reset button on the front. Maybe a module problem? Did your updates include a kernel update?
As you can see, I am stabbing in the dark here, and you have probably considered most of this, but hey, it will be great if we can get it resolved.
Also, which DE are you using? Window Manager? Compositing? Does it just lock up in X or can you be in VT1 -6 while it does it? Any log files?
First of all, thanks for the fast responses. I did a run through my package.keywords and have pruned out those things which I don't actually need to be unstable. So now I'm trying an emerge -eDN world, we'll see if that will help anything
Now to go through the suggestions:
No compositing, KDE (but I also tried to run under XFCE4 just to see if it was something kde related... still locked). DE?
I'm actually doing my system rebuild from a terminal without x running. So far that has not locked up, so I guess that sort of qualifies as evidence of a problem with my video drivers or X itself.
I was able to try running a boot cd for a couple hours this morning without the problem cropping up, but I'm still not excluding hardware error. i92goboj, you are right about the complete randomness making it sound like a hardware thing, and I think this weekend I'll be able to dig a little deeper into the hardware possibilities.
I did like the comment about the video drivers, but I have no proof about why that would be the culprit. I didn't do a kernel update, but I did emerge the nvidia-drivers (kernel module). I've tried removing, eselecting, then reloading all before starting an x session, but to no avail. I should also try switching back to no acceleration (vga or nv driver) to see if it could be specifically the nvidia drivers.
Emerge information. I've got about 30 packages ~amd64, the rest are using unmasked.
Problem solved, sort of. Lets compromise and say diagnosed.
After a weekend of too much testing, and increasing numbers of problems, I've determined that my harddrive is going bad. Now the fun of trying to get all my things off of there without too much corruption... then starting a new install.
Very disheartening...
However, I want to thank everyone for all the help. It helped me reallly narrow in on some of the important issues.
__________________
Linux since: 2001
Gentoo since: 2004
- - - - - - - -
Translation:
I fix things until they break.
For the sake of completeness I thought I would fill in a few more gaps before completely closing this thread as solved:
I did indeed find unrecoverable harddrive errors. These were tested by putting the drive into another machine and trying to repair it. I thought for awhile that this was the only problem. However, after buying two new drives and performing a full, I still encountered the same lockups that I had seen previously.
After doing some more testing, I think I've finally pinned down the source of the lockups.
1.) Rebuilding without X running never caused the system to lock up but did result in more problems -- Reason: Harddrive problems
2.) X locking up -- Reason: Failing video card
I bought a new video card and things seem to be working well. I am still a little worried that after all of this there could be something up with my motherboard, but I'll just be crossing my fingers.
__________________
Linux since: 2001
Gentoo since: 2004
- - - - - - - -
Translation:
I fix things until they break.
Open Source Security Myths Dispelled Dispel the five major myths surrounding Open Source Security and gain the tools necessary to make a truly informed decision for your IT organization subscribe
InformationWeek InformationWeek is the only newsweekly you'll need to stay on top of the latest developments in information technology. subscribe