[Samba] Windows SMB2 client doing excessive, inefficient SMB2 Find (and other) requests

awl1 awl1 at mnet-online.de
Thu Aug 24 13:37:07 UTC 2017


Hello Jeremy, Andrew, All,

I can now indeed confirm that my test scenario runs MUCH faster when 
executed against a Windows 10 SMB2 server than against Samba (test 
scenario takes ~ 18 seconds using a Windows SMB2 server as opposed to ~ 
300 seconds when the server is Samba 4.6.5).

But, most interestingly, the reason for these performance gains is NOT - 
as assumed by Jeremy - that the number of client-server request/response 
cycles over the network goes down - Wireshark pcapng file size is even 
slightly larger!!! (so it is NOT client-side caching on the Win SMB2 
client side that does the trick), but the Windows SMB2 server "only" 
responds much faster to the infamous (and unnecessary/imperformant) 
SMB2_FIND_ID_BOTH_DIRECTORY_INFO requests with Pattern "*" - which 
probably means that the Windows SMB2 server does some caching here.

In order to write the ~ 1000 files in my old test scenario, the total 
packet capture size for a SMB1 client against both a Samba or a Windows 
SMB1 server is about 10 MB, while the total packet caapture size for a 
SMB2 client against a SMB2 server (regardless whether Samba or Windows 
SMB2 server!!!) is about 32 MB, which clearly points to the inefficient 
use of Find requests with "*" pattern. (Unfortunately, the Windows SMB2 
server can still handle this 32MB communication in 18 seconds, while the 
Samba 4.6.5 server takes ~ 300 seconds.)

Nevertheless, comparing this with SMB1 behaviour (where the number of 
Find requests was equal to the number of files to be written, and none 
of the Find requests used a "*" Find pattern) this means that the whole 
scenario can (and probably should) be optimized by Microsoft for both 
cases as a "low hanging fruits" performance optimisation. Also, the way 
a Linux SMB2 client behaves (mount.cifs vers=2.x or 3.x) shows that it 
might even be possible to complete the write scenario without any Find 
requests (not sure about Windows' case insensitive file system, though...).

Of course, in addition, on the long run, the Samba server should also be 
improved to behave similarly to the Windows SMB2 server and cache the 
results of Find calls (probably this means implementing the infamous 
"directory handle leases"), but IMO that's a second step...


Before I follow your advice to move this whole issue/topic to the 
samba-technical alias and start over there with a clean description of 
the scenario and the findings, I have two simple questions:

1) I plan to use a new, reproducible test scenario with 2000 random 
small files with a file length between 1 and 2048 bytes, created along 
the lines of the following:

for i in $(seq -f "%04g" 1 2000) ; do
   length=`shuf -i 1-2048 -n 1`
   head -c $length < /dev/urandom > file${i}.rnd
done

in order to make test data non-confidential (unfortunately, my previous 
test files/packet traces were confidential). Do you agree that the above 
procedure is fine to create the test scenario?

2) What about confidential data from SMB/SMB2 sessions (i.e. Samba 
usernames/passwords)? What do I meed to do to filter all information 
from Wireshark traces that points to users and passwords?

More specifically, would filtering all SMB2 "SessionSetup" request and 
response packets from Wireshark traces be sufficient to do so? What 
about machine names/IP addresses from my LAN? Any other such IP 
addresses/machine names contained in any packets other than (obviously) 
those of my particular SMB2 client and Samba server (and Windows SMB2 
server to compare)?

Many thanks one more time for your feedback and replies to questions 1) 
and 2) above...

Best regards
Andreas



Am 18.08.2017 um 23:57 schrieb awl1:
> Ah, ok, "directory handle leases"... Ouch, I see... :-(
>
> In this case, I will first repeat my test scenario with a Windows SMB2 
> server and report back here.
>
> Based on the results of this exercise, you can then advise whether you 
> still want to move this to smb-technical and raise this with Microsoft 
> folks (who still might have a simple workaround "fix" to improve their 
> SMB2 client performance with a Samba server that does not [yet] 
> implement directory handle leases), or whether the only way to fix 
> this (on the long run, I am perfectly aware that this is a hugely 
> complex task) will be implementing those infamous "directory handle 
> leases" in Samba... ;-)
>
> Many thanks so far for the progress we made today - I'll call it a day 
> for today now (close to midnight over here), and get back once I have 
> the Wireshark recordings of the Windows SMB2 share server...
>
> Best regards
> Andreas
>
>
> Am 18.08.2017 um 23:42 schrieb Jeremy Allison:
>> On Fri, Aug 18, 2017 at 11:35:04PM +0200, awl1 wrote:
>>> Am 18.08.2017 um 23:17 schrieb Jeremy Allison via samba:
>>>> This might be hidden against Windows due to directory handle
>>>> leases, which we don't yet support.
>>> Are you saying that when I replace the Samba server by a Windows
>>> SMB2 share server, I should see better performance? I can perfectly
>>> test that out and record a Wireshark trace for this if you like...
>> Yes, that would be my guess.
>>
>>> Layperson question: Would such a "directory handle lease" be
>>> something like a cache for SMB2 Find responses?
>>>
>>> This would have to be a client-side cache in order to avoid sending
>>> back all 1000 file names in the directory from server to client over
>>> the network with every single response.
>>>
>>> Also, in case it indeed were a client-side cache, this cache would
>>> also need to be silently concurrently updated with every Create
>>> request/response cycle, because the number of files in the
>>> server-side directory always grows by the one file just written
>>> between one call to SMB2_FIND_ID_BOTH_DIRECTORY_INFO and the next...
>> That's exactly what directory handle leases are. They're
>> oplocks for directories.
>
>




More information about the samba mailing list