High CPU on multiple syncs on Win2K
jw schultz
jw at pegasys.ws
Fri Aug 23 11:10:00 EST 2002
On Fri, Aug 23, 2002 at 09:39:03PM +0200, bart.coninckx at watco.be wrote:
> >Some things that would help in evaluation would be the
> >rsync command-line and the disk configuration (are you running
> >RAID, what level and HW vs. SW, etc), approximate file counts
> >and size, and it also might help if we knew if the overloaded
> >server were low powered.
>
> this is the command line of the server:
> rsync --daemon --config=rsyncd.conf
>
> client:
> rsync -rtv --delete --modify-window=2 --stats /cygdrive/r/
> bee2bs01::d/backup/servername/data/
>
> We're using hardware RAID 5.
>
> >As a start i would check to be sure that you aren't using
> >the --checksum option unnecessarily.
>
> Nope, we use the default...
>
> >That was apparently the
> >problem the last time someone reported excessive load.
> >Also, don't run software RAID-5.
>
> No, we indeed prefer hardware RAID. We use a SCSI to ATA RAID configuration
> (a device with a SCSI connector, but with IDE disks in it), but this should
> not influence CPU performance, since the machine sees natice SCSI.
OK we've covered the obvious. I only mentioned SW RAID
because that can turn IO bound to CPU bound.
A server change might affect the number of client
connections that are supportable. If you are CPU bound,
changing hardware will probably have more effect than
the OS.
> >The other thing that would be worth doing is to manage the
> >load better by reducing the number of simultaneous
> >connections.
>
> True, but we have a limited timeframe (about 10 hours), to sync everything.
> It could well be that Rsync manages to do an avarage site in less than 2
> hours, but we don't know yet. There are a lot of variables there: what is
> changed in data on a particular day on a particular site, how fast is the
> WAN link, ... Since the clients are scattered, it's also hard to
> orchestrate a steady scheme. This would have been a lot easier in a pull
> configuration, but as I mentionned, for some reason the Rsync service
> (daemon) won't work with Netware mappings.
Look into the rsyncd.conf "max connections" parameter. I
haven't used it myself but expect that it will require
wrapping the job in a script to do a
collision_detect+backoff+retry loop. That would allow you
to have much less wasted window-time than you would with
pure scheduling.
--
________________________________________________________________
J.W. Schultz Pegasystems Technologies
email address: jw at pegasys.ws
Remember Cernan and Schmitt
More information about the rsync
mailing list