[Extractor] Make extractor crash resilient
ClosedPublic
Actions

Authored by bruns on Oct 16 2018, 10:47 PM.

Details

Reviewers

poboiko
ngraham

Group Reviewers

Baloo
Frameworks

Commits

R293:1509ca51c5ed: [Extractor] Make extractor crash resilient

Summary

Connect to QProcess::finished to detect the exit status. In case the
process has crashed, signal the indexer.

On a crash, restart the process and feed it a smaller batch. If the
crashing batch contains only a single file, mark the file as failed, i.e.
add it to the "failedid" db and remove it from the content indexing db
to avoid further indexing attempts.

CCBUG: 375131

Test Plan

start balooctl monitor
add a file known to crash the extractor to an indexable path
touch an unproblematic file
-> indexer crashes on first file and continues with the second

Diff Detail

Repository

R293 Baloo

Lint

Automatic diff as part of commit; lint not applicable.

Unit

Automatic diff as part of commit; unit tests not applicable.

bruns created this revision.Oct 16 2018, 10:47 PM

Restricted Application added projects: Frameworks, Baloo. · View Herald TranscriptOct 16 2018, 10:47 PM

Restricted Application added a subscriber: kde-frameworks-devel. · View Herald Transcript

bruns requested review of this revision.Oct 16 2018, 10:47 PM

Harbormaster completed remote builds in B3975: Diff 43768.Oct 16 2018, 10:47 PM

apol added a subscriber: apol.Oct 16 2018, 11:12 PM

apol added inline comments.

src/file/extractorprocess.cpp
45	Shouldn't it check the exitCode too?
55	Do you really need to waitForStarted?
src/file/filecontentindexer.cpp
74	`= false;`

bruns marked 3 inline comments as done.Oct 16 2018, 11:20 PM

bruns added inline comments.

src/file/extractorprocess.cpp
45	the exitCode is "only valid for normal exits" (Qt docu), and 0 otherwise.
55	Copied from the old code. Should not hurt, as done from a runner thread.

coding style

Harbormaster completed remote builds in B3976: Diff 43769.Oct 16 2018, 11:20 PM

rebase

Harbormaster completed remote builds in B3978: Diff 43777.Oct 17 2018, 2:06 AM

poboiko added inline comments.Oct 17 2018, 10:01 AM

src/file/filecontentindexer.cpp
75	Is it OK to use `QObject::connect` inside a `while` loop? Those are not `Qt::UniqueConnection`, won't they fire multiple times (more and more, actually)?
91	Do I understand correctly, that's a binary-search-like way to find the file actually causing `extractor` to crash, and it will reindex some of the files in a batch ~log2(batchsize) times? Why don't we rely instead on its progress reporting, via `startedIndexingFile` / `finishedIndexingFile`?

broulik added a subscriber: broulik.Oct 17 2018, 10:08 AM

broulik added inline comments.

src/file/filecontentindexer.cpp
75	given `loop` lives on the stack it is destroyed when the scope is left and the connection severed

bruns added inline comments.Oct 17 2018, 12:21 PM

src/file/filecontentindexer.cpp
91	Yes, its a binary search. Because we only have IDs here, and the progress reporting works on strings.

poboiko added inline comments.Oct 17 2018, 1:16 PM

src/file/filecontentindexer.cpp
91	There is `Transaction::documentId(const QByteArray& path)`, which can resolve it using `DocumentUrlDB`. I believe it still would be cheaper than reindexing several files multiple times.

bruns marked 3 inline comments as done.Oct 17 2018, 1:42 PM

bruns added inline comments.

src/file/filecontentindexer.cpp
91	But it is racy - if the file is replaced in the meantime, inode and filename no longer match. This is not completely unlikely when dealing with temporary files. It would make the code also significantly more complex, and I want it simple here. It should only be hit in exceptional cases. Also, it is not to uncommon to have a batch of size one from the start, i.e. when adding/modifying files.

bruns marked 3 inline comments as done.Oct 17 2018, 1:43 PM

poboiko added inline comments.Oct 20 2018, 9:16 AM

src/file/filecontentindexer.cpp
91	OK, we can simply count how many times we got `finishedIndexingFile`, and just go to the corresponding position in the batch. It's just the binary search here does look a bit unnecessary to me...

bruns marked an inline comment as done.Oct 24 2018, 12:09 PM

bruns added inline comments.

src/file/filecontentindexer.cpp
91	Counting would give a good hint which one failed, but still no guarantee. Files may be added or removed during the indexer run, so the position is only approximate. The indexer may also have sent a "finished" message, and crash afterwards in a destructor call. Doing a binary search is straight forward and avoids any dependencies or assumptions about other parts of the code. Also, this code should be only temporary anyway - if the extractor is run in a separate process which only receives the file using a readonly file descriptor (for sandboxing) and passes back the result, the problematic documents id is known by the parent process.

How about accepting this as-is, and postponing any enhancements to another patch?

This has three benefits:

no further delay
it is bisectable in case some bug appears
it is easy to spot how much code such a change requires

Yeah go ahead I think.

This revision is now accepted and ready to land.Oct 25 2018, 2:12 PM

Closed by commit R293:1509ca51c5ed: [Extractor] Make extractor crash resilient (authored by bruns). · Explain WhyOct 25 2018, 2:18 PM

This revision was automatically updated to reflect the committed changes.

bruns mentioned this in T9867: Handle crashing indexers.Oct 27 2018, 7:01 PM

Revision Contents
Changeset List

			Path	Packages
M			src/file/extractorprocess.h (2 lines)
M			src/file/extractorprocess.cpp (20 lines)
M			src/file/filecontentindexer.cpp (18 lines)
M			src/file/filecontentindexerprovider.h (1 line)
M			src/file/filecontentindexerprovider.cpp (10 lines)

Diff	ID	Base	Description	Created	Lint	Unit
Base			Base
Diff 1	43768	d33965e		Oct 16 2018, 10:47 PM	★	★
Diff 2	43769	d33965e	coding style	Oct 16 2018, 11:20 PM	★	★
Diff 3	43777	5d3d6a5	rebase	Oct 17 2018, 2:06 AM	★	★
Diff 4	44212	b62e76f	R293:1509ca51c5ed5b78d56a794e466eb4b9d0bd3f3b	Oct 25 2018, 2:17 PM	★	★