Corrupt HTML files

Andrew Bartlett abartlet at
Mon May 20 20:13:45 EST 2002

Matthew Hawkins wrote:
> Jan-Yun Lou (loujanyun at wrote:
> > Hi
> >
> > I've been experiencing some strange problems lately.
> > Sometimes when I save a html document from the web
> > onto my hard-drive and then try to open it later all I
> > get is rubbish symbols.
> [--- snip ---]
> Are you sure its actual corruption?  Sounds a lot to me like its simply
> saving the pages in an alternate charater set, I'd fathom a guess at
> UTF-8 or UTF-16.

Or possilby gziped?  Some sites are starting to use this, so it might be
that the it isn't being decoded for the save.

Andrew Bartlett

Andrew Bartlett                                 abartlet at
Manager, Authentication Subsystems, Samba Team  abartlet at
Student Network Administrator, Hawker College   abartlet at

More information about the linux mailing list