Differential D16490

[XmlExtractor] Add unittest for XML extractor
ClosedPublic
Actions

Authored by bruns on Oct 28 2018, 5:40 PM.

Details

Reviewers

astippich

Group Reviewers

Frameworks

Commits

R286:dbb34ea5039c: [XmlExtractor] Add unittest for XML extractor

Test Plan

make && ./xmlextractortest

Diff Detail

Repository

R286 KFileMetaData

Branch

xml_extractor

Lint

No Linters Available

Unit

No Unit Test Coverage

Build Status

Buildable 4302
Build 4320: arc lint + arc unit

bruns created this revision.Oct 28 2018, 5:40 PM

Restricted Application added projects: Frameworks, Baloo. · View Herald TranscriptOct 28 2018, 5:40 PM

Restricted Application added subscribers: Baloo, kde-frameworks-devel. · View Herald Transcript

bruns requested review of this revision.Oct 28 2018, 5:40 PM

Harbormaster completed remote builds in B4279: Diff 44375.Oct 28 2018, 5:40 PM

bruns edited the test plan for this revision. (Show Details)Oct 28 2018, 5:41 PM

bruns added reviewers: Frameworks, astippich.

bruns added a dependency: D16489: [KFileMetaData] Add extractor for generic XML and SVG.Oct 28 2018, 5:43 PM

Generally looks OK to me, one note on QDir::separator() usage.

autotests/xmlextractortest.cpp
38 ↗	(On Diff #44375)	IIRC you shouldn't use `QDir::separator()`. See http://agateau.com/2015/qdir-separator-considered-harmful/

Replace QDir::Separator with "/"

Harbormaster completed remote builds in B4302: Diff 44409.Oct 29 2018, 9:27 AM

bruns marked an inline comment as done.Oct 29 2018, 9:40 AM

bruns added inline comments.

autotests/xmlextractortest.cpp
38 ↗	(On Diff #44375)	Thx, fixed here and elsewhere (D16505)

Only one minor thing: please also check that the mimetype is in the list of supported mimetypes

In D16490#351662, @astippich wrote:

Only one minor thing: please also check that the mimetype is in the list of supported mimetypes

This can actually happen and is completely valid, due to mimetype inheritance.

So the check would be for supported in supportedMimetypes { if QMimeType(input->mimeType()).inherits(supported) return true; }; return false. But this is already done from the calling code ...

In D16490#351799, @bruns wrote:

In D16490#351662, @astippich wrote:

Only one minor thing: please also check that the mimetype is in the list of supported mimetypes

This can actually happen and is completely valid, due to mimetype inheritance.

So the check would be for supported in supportedMimetypes { if QMimeType(input->mimeType()).inherits(supported) return true; }; return false. But this is already done from the calling code ...

Hmmm, I don't understand. When I change the code to return an empty stringlist of supported mimetypes for the xmlextractor, the tests still pass.
This should imho be covered by the tests.

In D16490#351935, @astippich wrote:

In D16490#351799, @bruns wrote:

In D16490#351662, @astippich wrote:

Only one minor thing: please also check that the mimetype is in the list of supported mimetypes

This can actually happen and is completely valid, due to mimetype inheritance.

So the check would be for supported in supportedMimetypes { if QMimeType(input->mimeType()).inherits(supported) return true; }; return false. But this is already done from the calling code ...

Hmmm, I don't understand. When I change the code to return an empty stringlist of supported mimetypes for the xmlextractor, the tests still pass.
This should imho be covered by the tests.

This is one level above these tests. The surrounding code ensures the right extractor is called for each file, see ExtractorCollection::fetchExtractors(...).

These are unit tests. The test itself is responsible to call an extractor with a suitable file and a matching mimetype.

What you are calling for are system tests.

In D16490#352109, @bruns wrote:

In D16490#351935, @astippich wrote:

In D16490#351799, @bruns wrote:

In D16490#351662, @astippich wrote:

Only one minor thing: please also check that the mimetype is in the list of supported mimetypes

This can actually happen and is completely valid, due to mimetype inheritance.

So the check would be for supported in supportedMimetypes { if QMimeType(input->mimeType()).inherits(supported) return true; }; return false. But this is already done from the calling code ...

Hmmm, I don't understand. When I change the code to return an empty stringlist of supported mimetypes for the xmlextractor, the tests still pass.
This should imho be covered by the tests.

This is one level above these tests. The surrounding code ensures the right extractor is called for each file, see ExtractorCollection::fetchExtractors(...).

Right, and if e.g. the list of supported mimetypes is empty, the corresponding extractor will never be selected because ExtractorCollection doesn't know that the mimetype is supported by this extractor.
Hence we should ensure and test imho that the list of supported mimetypes provided to the ExtractorCollection is correct for this extractor. I'm not calling for testing that the right extractor is selected.

These are unit tests. The test itself is responsible to call an extractor with a suitable file and a matching mimetype.

What you are calling for are system tests.

In D16490#352199, @astippich wrote:

In D16490#352109, @bruns wrote:

In D16490#351935, @astippich wrote:

In D16490#351799, @bruns wrote:

In D16490#351662, @astippich wrote:

Only one minor thing: please also check that the mimetype is in the list of supported mimetypes

This can actually happen and is completely valid, due to mimetype inheritance.

So the check would be for supported in supportedMimetypes { if QMimeType(input->mimeType()).inherits(supported) return true; }; return false. But this is already done from the calling code ...

Hmmm, I don't understand. When I change the code to return an empty stringlist of supported mimetypes for the xmlextractor, the tests still pass.
This should imho be covered by the tests.

This is one level above these tests. The surrounding code ensures the right extractor is called for each file, see ExtractorCollection::fetchExtractors(...).

Right, and if e.g. the list of supported mimetypes is empty, the corresponding extractor will never be selected because ExtractorCollection doesn't know that the mimetype is supported by this extractor.
Hence we should ensure and test imho that the list of supported mimetypes provided to the ExtractorCollection is correct for this extractor. I'm not calling for testing that the right extractor is selected.

The unit tests do not use ExtractorCollection, because they test the extractors, not ExtractorCollection. The extractor unit tests explicitly pass the mime type to the extractor.

We don't want to double the checks.

bruns retitled this revision from [KFileMetaData] Add unittest for XML extractor to [XmlExtractor] Add unittest for XML extractor.Nov 1 2018, 4:08 PM

bruns added a dependent revision: D16591: [XmlExtractor] Use QXmlStreamReader for better performance.

I understand that ExtractorCollection is not used, and that the extractor gets the mimetype directly set in the test.
But the list retrieved via XmlExtractor::mimetypes() is part of the extractor itself, isn't it? So imho that should be tested here.
All I'm asking for is something similar to https://phabricator.kde.org/source/kfilemetadata/browse/master/autotests/embeddedimagedatatest.cpp$48

In D16490#352244, @astippich wrote:

I understand that ExtractorCollection is not used, and that the extractor gets the mimetype directly set in the test.
But the list retrieved via XmlExtractor::mimetypes() is part of the extractor itself, isn't it? So imho that should be tested here.
All I'm asking for is something similar to https://phabricator.kde.org/source/kfilemetadata/browse/master/autotests/embeddedimagedatatest.cpp$48

For very specialized media types, this works out ok, but this is not correct in general:

Assume you have a filetype which is a specialization of some other, supported mime type. This specialization is not in the mimetypes list, but can still be handled by the extractor.

For XML, you will not be able to add every specialized mimetype. SVG, which is a specialization, is listed specifically because it has specific supporting code, but every other XML variant can be handled by the generic code.

Alright, you convinced me.

This revision is now accepted and ready to land.Nov 1 2018, 4:40 PM

Closed by commit R286:dbb34ea5039c: [XmlExtractor] Add unittest for XML extractor (authored by bruns). · Explain WhyNov 1 2018, 5:59 PM

This revision was automatically updated to reflect the committed changes.

This causes the Qt5.9 build to fail

https://build.kde.org/job/Frameworks/job/kfilemetadata/job/kf5-qt5%20SUSEQt5.9/workflow-stage/

Showing Only Differences

This revision modifies 3 more files that are hidden because they were not modified between selected diffs and they have no inline comments.

Revision Contents
Changeset List

			Path	Packages
A	M		autotests/xmlextractortest.cpp (117 lines)

Diff	ID	Base	Description	Created	Lint	Unit
Base			Base
Diff 1	44375	0bba318		Oct 28 2018, 5:40 PM	★	★
Diff 2	44409	0bba318	Replace QDir::Separator with "/"	Oct 29 2018, 9:27 AM	★	★
Diff 3	44656	9c58ae5	R286:dbb34ea5039c508728d824218ce10d91e1deab82	Nov 1 2018, 5:59 PM	★	★

Commit	Tree	Parents	Author	Summary	Date
8b270dddc9d3	784d5512d549	0bba318b8bff	Stefan Brüns	[KFileMetaData] Add unittest for XML extractor (Show More…)	Oct 28 2018, 1:59 AM

Status	Author	Revision
Closed	bruns	D16591 [XmlExtractor] Use QXmlStreamReader for better performance
Closed	bruns	D16490 [XmlExtractor] Add unittest for XML extractor
Closed	bruns	D16489 [KFileMetaData] Add extractor for generic XML and SVG
Closed	bruns	D16488 [KFileMetaData] Add helper for XML encoded Dublin Core metadata

Diff 44409

View Options

autotests/xmlextractortest.cpp