[linux-cifs-client] Re: fsx-linux failing with latest cifs-2.6 git tree

Jeff Layton jlayton at redhat.com
Mon Dec 1 11:55:19 GMT 2008


On Mon, 1 Dec 2008 12:32:26 +0100
Nick Piggin <npiggin at suse.de> wrote:

> On Mon, Dec 01, 2008 at 06:28:49AM -0500, Jeff Layton wrote:
> > On Mon, 1 Dec 2008 09:44:35 +0100
> > Nick Piggin <npiggin at suse.de> wrote:
> > > > I think it actually is a problem. Suppose PageChecked is never cleared
> > > > like you say, we flush the page and then do a partial page write again.
> > > > We do a readpage this time and it fails, but the copy of data to the
> > > > page works. Now we hit cifs_write_end and PageChecked is set, but
> > > > the unwritten parts of the page actually aren't up to date. Data
> > > > corruption ensues...
> > > > 
> > > > I agree that we should drop that patch. We might be able to make
> > > > cifs_write_end more efficient, but we'll need to be more careful
> > > > with PageChecked.
> > > 
> > > Oh? I admittedly haven't looked at the source code after applying
> > > your latest patch, but I thought it should not be possible to have
> > > a leaking PageChecked. The page is under the page lock the whole
> > > time, so a concurrent write should not be an issue...?
> > > 
> > 
> > But a concurrent write and read is, right?
> > 
> > Suppose we do a successful cifs_write_begin and set PageChecked. Another
> > thread then incurs a page fault and does a readpage before we copy the
> > data to the page. Won't we then call write_end with both PageChecked and
> > PageUptodate set?
> > 
> > That write will be fine, of course. PageChecked is still true though,
> > and I think that sets up the problem I was describing...
> 
> Unless cifs is doing something different from the usual case, it should
> lock the page over the readpage operation (the end IO handler would
> typically unlock the page after doing a SetPageUptodate).
> 
> So concurrent reads should be protected with page lock as well.
> 

Ahh good point...the page would be locked there. If it's impossible for
PageUptodate to be flipped on while the page lock is held then this is
probably safe enough.

I'd still prefer that we handle the situation where both bits are set
in cifs_write_end. Some defensive coding is warranted here I think.
That can wait until 2.6.29 though. For now, the patch in Steve's tree
should be fine, IMO.

--
Jeff Layton <jlayton at redhat.com>


More information about the linux-cifs-client mailing list