Version 2.10
New
Using the REST API
_document
, you can now fetch a document from the local dir, from an http website
or from an S3 bucket. Thanks to dadoonet.
* You can now remove a document in Elasticsearch using FSCrawler _document
endpoint. Thanks to dadoonet.
* Implement our own HTTP Client for Elasticsearch. Thanks to dadoonet.
* Add option to set path to custom tika config file. Thanks to iadcode.
* Support for Index Templates. Thanks to dadoonet.
* Support for Aliases. You can now index to an alias. Thanks to dadoonet.
* Support for Access Token and Api Keys instead of Basic Authentication. Thanks to dadoonet.
* Allow loading external jars. This adds a new external
directory from where jars can be loaded
to the FSCrawler JVM. For example, you could provide your own Custom Tika Parser code. Thanks to dadoonet.
Add temporal information in folder index. Thanks to bdauvissat
Fix
fs.ocr.enabled
was always false. Thanks to ywjung.Do not hide YAML parsing errors. Thanks to dadoonet.
Deprecated
The
_upload
REST endpoint has been deprecated. Please now use the_document
endpoint. Thanks to dadoonet.Support for Elasticsearch 6.x is deprecated. Thanks to dadoonet.
Support for Basic Authentication is deprecated. You should use API keys instead. Thanks to dadoonet.
Updated
Add full support for Elasticsearch Elasticsearch 8.15.3, Elasticsearch 7.17.23, Elasticsearch 6.8.23. Thanks to dadoonet.
Update to Tika Tika 2.9.2. Thanks to dadoonet.
Removed
Remove the specific distributions depending on Elastic version. Thanks to dadoonet.
Thanks to @dadoonet
, @ywjung
, @iadcode
, @bdauvissat
for this release!