Find the answer to your Linux question:
Results 1 to 5 of 5
This problem is really getting to me. The server Debian Stable NIS/NFS/Samba/Apache The Problem The server suddenly and randomly disappears, like, its just puff, gone! Cant log in over ssh, ...
  1. #1
    Just Joined! DusteD's Avatar
    Join Date
    May 2006
    Location
    Denmark
    Posts
    35

    Randomly disappearing server

    This problem is really getting to me.

    The server
    Debian Stable
    NIS/NFS/Samba/Apache

    The Problem
    The server suddenly and randomly disappears, like, its just puff, gone!
    Cant log in over ssh, nfs clients freezes, it dosent respond to ping.
    Then sometimes up to 10 minutes after, pop, there it is, it starts responding to ping
    and people can log in again..

    I havent been able to provoke it to disappear (by stressing nfs, loggin in and trying to bruteforce nis accounts :P)

    Theres NOTHING in the logs, or on the terminal, or in dmesg.
    ifconfig shows no lost packages, even after a "freeze"

    when i am in front of a client and it freezes, if i run into the server room and log in as root, theres no messages, but i cant ping out from the server either but ifconfig shows it has an ip address..

    I have tried moving the server to another room with better networking, ive tried changing the nic (3 times!), ive tried diffrent cables, im totally beat here...

    Please, if you find it in your heart to come with ANY advice, ill be very happy

    Thank you.

  2. #2
    Linux Guru antidrugue's Avatar
    Join Date
    Oct 2005
    Location
    Montreal, Canada
    Posts
    3,212
    Quote Originally Posted by DusteD
    but ifconfig shows it has an ip address..
    Is it always the same?
    "To express yourself in freedom, you must die to everything of yesterday. From the 'old', you derive security; from the 'new', you gain the flow."

    -Bruce Lee

  3. #3
    Just Joined! DusteD's Avatar
    Join Date
    May 2006
    Location
    Denmark
    Posts
    35
    Hi, thank you for your answer, yes it is the same

    i encountered (once) this error message:
    RPC: bad TCP reclen 0x39366436 (non-terminal)
    I lil googling tells me its something to do with nfs and/or tcp
    i wouldnt expect this kind of behavior from a sarge install.
    The clients are testing, but still, you shouldnt be able to crash
    a server just by talking wrongly to it..

  4. #4
    Linux Guru antidrugue's Avatar
    Join Date
    Oct 2005
    Location
    Montreal, Canada
    Posts
    3,212
    Quote Originally Posted by DusteD
    I lil googling tells me its something to do with nfs and/or tcp
    i wouldnt expect this kind of behavior from a sarge install.
    I remember having some networking issue with the default 2.6 Sarge kernel. I fix it by compiling my own kernel (I used 2.6.12.x, the last one that works well without udev).
    "To express yourself in freedom, you must die to everything of yesterday. From the 'old', you derive security; from the 'new', you gain the flow."

    -Bruce Lee

  5. #5
    Linux User DThor's Avatar
    Join Date
    Jan 2006
    Location
    Ca..na...daaa....
    Posts
    319
    Can you repro without running NFS? - maybe you're not in a situation where you can practically take that offline, though. I'm hearing about NFS issues too, especially with large file transfers - when you changed the NIC - was it replaced with the same driver-type? I'm reading about issues with aic7xx drivers in the kernel around that version.

    Certainly a kernel upgrade would be the next thing I'd investigate. Sure sounds like a kernel/driver issue, anyway.

    DT

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •