Briefly, this error occurs when Elasticsearch fails to clear the scroll context associated with a specific jobId. This could be due to a network issue, a timeout, or an internal server error. To resolve this issue, you can try the following: 1) Check the server’s health and logs for any underlying issues. 2) Increase the timeout value if the operation is taking longer than expected. 3) Ensure that the jobId is correct and exists. 4) If the problem persists, consider restarting the Elasticsearch cluster, but be aware that this should be a last resort as it can impact running operations.
This guide will help you check for common problems that cause the log ” [” + context.jobId + “] Failed to clear scroll ” to appear. To understand the issues related to this log, read the explanation below about the following Elasticsearch concepts: plugin, scroll.
Overview
In Elasticsearch, the concept of scroll comes into play when you have a large set of search results. Large search results are exhaustive for both the Elasticsearch cluster and the requesting client in terms of memory and processing. The scroll API enables you to take a snapshot of a large number of results from a single search request.
Examples
To perform a scroll search, you need to add the scroll parameter to a search query and specify how long Elasticsearch should keep the search context viable.
GET mydocs-2019/_search?scroll=40s { "size": 5000, "query": { "match_all": {} }, "sort": [ { "_doc": { "order": "asc" } } ] }
This query will return a maximum of 5000 hits. If the scroll is idle for more than 40 seconds, it will be deleted. The response will return the first page of the results and a scroll ID. You can use the scroll ID to get additional documents from the scroll. You’ll be able to keep retrieving the documents until you have all of them.
Notes
- Changes made to documents after the scroll will not show up in your results.
- When you are done with the scroll, you can delete it manually using the scroll ID.
DELETE _search/scroll/<scroll_id>
Log Context
Log “[” + context.jobId + “] Failed to clear scroll” classname is ScrollDataExtractor.java.
We extracted the following from Elasticsearch source code for those seeking an in-depth context :
try { innerClearScroll(scrollId); } catch (Exception e) { // This method is designed to be called from exception handlers; so just logs this exception // in the cleanup process so that the original exception can be propagated logger.error(() -> "[" + context.jobId + "] Failed to clear scroll"; e); } } private void innerClearScroll(String scrollId) { if (scrollId != null) {