[TermGenerator] Leave single term phrases out of the PositionDB
ClosedPublic

Authored by bruns on Jun 16 2019, 11:57 PM.

Details

Summary

Any search phrase consisting of a single term only can be handled by
the PostingDB alone, and a document with single terms can never match
a real phrase (consisting a multiple terms).

Storing these terms also in the PositonDB just wastes space, so remove
these from the DB, and delegate single term phrases to the PostingDB.

While the document content and even filenames (basename + suffix) consist
of multiple terms typically, folder names, tags and properties are often
a single term.

Test Plan

ctest
ctest -R querytest

Diff Detail

Repository
R293 Baloo
Lint
Automatic diff as part of commit; lint not applicable.
Unit
Automatic diff as part of commit; unit tests not applicable.
bruns created this revision.Jun 16 2019, 11:57 PM
Restricted Application added projects: Frameworks, Baloo. · View Herald TranscriptJun 16 2019, 11:57 PM
Restricted Application added a subscriber: kde-frameworks-devel. · View Herald Transcript
bruns requested review of this revision.Jun 16 2019, 11:57 PM
ngraham accepted this revision.Jun 17 2019, 12:05 AM
This revision is now accepted and ready to land.Jun 17 2019, 12:05 AM
This revision was automatically updated to reflect the committed changes.