Currently, both XML and SVG documents are indexed as plain text due
to mimetype inheritance. This fills the content index with meaningless
data (tags, attributes, attribute values ...).
Use QDomElement::text() for generic XML documents and <text/> nodes
for SVG to extract the content. Also try do find Dublin Core metadata
and add the relevant properties.
Depends on D16488