Sam Couter sam at couter.id.au
Wed Jan 20 20:33:20 MST 2010

steve jenkin <sjenkin at canb.auug.org.au> wrote:
> PS: There is a whole class of network problem which I've never heard a
> good name for ("cascade fault"?) like Martin's - some 'error recovery'
> mechanism kicks in and compounds the fault.
> Often causing accelerating damage and turning a recoverable/borderline
> situation into a guaranteed (massive) failure.

Reminds me of a time in the dim past when I used to tunnel PPP
connections over SSH. Bad idea. Works fine while the link is
underutilised, but as soon as packets start dropping there are two TCP
stacks trying to do error correction.


The end result of this is that you need an error-correcting, reliable
delivery protocol at exactly *one* place in the stack. If you're doing
stuff with TCP, such as fetching .debs via HTTP, just trust TCP to get
the job done right. If TCP is busted, you have bigger problems, and it's
not something you can easily work around anyway.
