[Samba] Spotlight indexing with fscrawler for multiple shares

Kees van Vloten keesvanvloten at gmail.com
Thu Aug 10 13:01:43 UTC 2023


Hi Matthias,

Op 10-08-2023 om 14:46 schreef Matthias Kühne | Ellerhold 
Aktiengesellschaft via samba:
> Hey Kees,
>
> disclaimer: shameless self-plug!!
>
> If you dont need content indexing you can use my indexer:
> https://github.com/Ellerhold/fs2es-indexer

I have looked at it because of troubles with FScrawler and I love your 
solution because it does not need heavy weight java.

But there is one thing FScrawler is good at: it indexes all kinds of 
metadata of files (like exif data in photos etc), it can even do OCR. 
This is what the fs2es-indexer does not seem to do (to my understanding).

That is the reason why I am stuck with FScrawler for now.

Hopefully I am wrong and you are going to tell me that fs2es-indexer has 
all the functionality of FScrawler but not the issues :-)

The other thing is that I am pushing data to Opensearch which requires 
me to patch and  compile FScrawler, another complexity I don't like very 
much.

- Kees

>
> Ive created it because I couldnt get FScrawler to work correctly.
>
> You can add as many directories as you like in the config, it'll crawl
> it through one daemon service.
>
> I'm planning on adding smb.conf parsing, so you dont even have to add
> these directories into the yaml file and just use samba as you would.
>
> Let me know if you need some help setting it up or otherwise.
>
> Have a nice day,
>
> Matthias.
>
> Am 04.08.23 um 19:56 schrieb Kees van Vloten via samba:
>> Hi Team,
>>
>>
>> Did anybody solve the issue of FScrawler crawling over multiple
>> shares, preferably from a single job or from a single service?
>>
>> Setting up a service for FScrawler per share does not scale very nice...
>>
>>
>> - Kees.
>>
>>



More information about the samba mailing list