Website audit: Find and report broken links
Open, NormalPublic

Description

Go through the https://www.kde.org website (and its subpages) and test if all the links work properly.

If you find some links that don't work, please report them in this task.

You can post them in a comment, or create a shared note on https://notes.kde.org and paste the link to it here.
Make sure to indicate where each link is (on which page).

This task is suitable for new contributors.

Related Objects

skadinna created this task.Jan 31 2018, 7:56 PM
skadinna triaged this task as Normal priority.
davidc added a subscriber: davidc.Jan 31 2018, 8:16 PM

Many links under International Sites are broken.

Russia, Ukraine, Iran, Turkey, China, Argentina.

Hungary redirects to KDE.org
Poland redirects to Planet KDE

johnpaulding added a comment.EditedFeb 1 2018, 5:03 PM

There's no shortage of broken link checking tools out there. They won't pick up on links that just lead to wrong pages/sites, but they will pick up dead links. I've got wc3's validator tool checking links right now. https://validator.w3.org/checklink

I think a visual sitemap should be created for the kde overhaul before too much is done here. If the sitemap is to change much, we might be tracking down links that won't be needed in the final site.

There's currently 112 subdomains although some are for internal use. I cleaned up my list and am editing this version to include only typical public visitor type sites. (still just a subdomain list though)

https://kde.org
https://accessibility.kde.org
https://identity.kde.org
https://neon.kde.org
https://freebsd.kde.org
https://kdesrc-build.kde.org

https://jointhegame.kde.org (same as relate.kde.org)
https://relate.kde.org (same as jointhegame.kde.org)

https://dot.kde.org
https://news.kde.org

https://store.kde.org

https://kde.org/applications
https://umbrello.kde.org
https://rkward.kde.org
https://edu.kde.org
https://marble.kde.org
https://labplot.kde.org
https://kst-plot.kde.org

https://games.kde.org
https://rekonq.kde.org (oldish web browser)
https://konversation.kde.org
https://multimedia.kde.org
https://kmplayer.kde.org
https://amarok.kde.org
https://phonon.kde.org
https://juk.kde.org
https://minuet.kde.org (music)
https://babe.kde.org

https://okular.kde.org (doc reader)
https://kpdf.kde.org (old pdf reader 2008)
https://dolphin.kde.org
https://konsole.kde.org
https://yakuake.kde.org

https://utils.kde.org
https://zanshin.kde.org (to do)
https://notes.kde.org (web app)
https://simon.kde.org (speech reader)
https://peruse.kde.org (comic book reader)
https://atelier.kde.org

https://community.kde.org
https://planet.kde.org


**From xml sitemap tool - broken links

https://www.kde.org/info/plasma-5.12.0.php ( parent page: https://www.kde.org/announcements/plasma-5.11.95.php )
https://www.kde.org/www.qt.io/ ( parent page: https://www.kde.org/community/whatiskde/kdefreeqtfoundation.php )
https://www.kde.org/www.digia.com/ ( parent page: https://www.kde.org/community/whatiskde/kdefreeqtfoundation.php )
https://www.kde.org/community/whatiskde/lydia@kde.org ( parent page: https://www.kde.org/community/whatiskde/impressum.php )
https://community.kde.org/KDE_Community_Wiki:Current_events ( parent page: https://community.kde.org/Distributions )
https://community.kde.org/Special:WhatLinksHere/Distributions ( parent page: https://community.kde.org/Distributions )
https://community.kde.org/Special:RecentChangesLinked/Distributions ( parent page: https://community.kde.org/Distributions )
https://community.kde.org/index.php?title=Development/Git&action=edit&redlink=1 ( parent page: https://community.kde.org/Infrastructure/Get_a_Developer_Account )
https://api.kde.org/frameworks/kirigami/html/namespaceQtQuick.html ( parent page: https://api.kde.org/frameworks/kirigami/html/index.html )
https://api.kde.org/frameworks/kirigami/html/namespaceQtQuick_1_1Controls.html ( parent page: https://api.kde.org/frameworks/kirigami/html/index.html )
https://api.kde.org/frameworks/kirigami/html/namespaceorg.html ( parent page: https://api.kde.org/frameworks/kirigami/html/index.html )
https://api.kde.org/frameworks/kirigami/html/namespaceorg_1_1kde.html ( parent page: https://api.kde.org/frameworks/kirigami/html/index.html )
https://api.kde.org/frameworks/kirigami/html/namespaceorg_1_1kde_1_1kirigami.html ( parent page: https://api.kde.org/frameworks/kirigami/html/index.html )
https://api.kde.org/frameworks/kirigami/html/namespaceKirigami.html ( parent page: https://api.kde.org/frameworks/kirigami/html/index.html )
https://community.kde.org/Special:RecentChangesLinked/KDE_Visual_Design_Group/KirigamiHIG ( parent page: https://community.kde.org/KDE_Visual_Design_Group/KirigamiHIG )
https://git.reviewboard.kde.org/r/366430 ( parent page: https://www.kde.org/announcements/plasma-5.7.5-5.8.0-changelog.php )
https://www.kde.org/announcements/5.42.0 ( parent page: https://www.kde.org/announcements/kde-frameworks-5.42.0.php?site_locale=ca )
https://dot.kde.org/2014/07/07/kde-frameworks-5-makes-kde-software-more-accessible-all-qt- ( parent page: https://www.kde.org/announcements/kde-frameworks-5.42.0.php?site_locale=pt )
https://www.kde.org/announcements/community/donations/ ( parent page: https://www.kde.org/announcements/announce-applications-17.12.1.php?site_locale=nl )


**From wc3 link checker - only on kde.org, no subdomains run yet.

error Line: 197 https://www.kde.org/community/donations/index.php

Status: 200 OK

Some of the links to this resource point to broken URI fragments (such as index.html#fragment). 
Broken fragments:

    https://www.kde.org/community/donations/index.php#money (line 197)

info Line: 283 mailto:kde-webmaster@kde.org

Status: (N/A) Access to 'mailto' URIs has been disabled

Accessing links with this URI scheme has been disabled in link checker.

info Line: 275 https://www.facebook.com/kde/

Status: (N/A) Forbidden by robots.txt

The link was not checked due to robots exclusion rules. Check the link manually.

error Line: 8 https://www.kde.org/../../favicon.ico

Status: 400 Bad Request

This is usually the sign of a malformed URL that cannot be parsed by the server. Check the syntax of the link.

List of redirects

The links below are not broken, but the document does not use the exact URL, and the links were redirected. It may be a good idea to link to the final location, for the sake of speed.

warning Line: 241 https://techbase.kde.org/ redirected to https://techbase.kde.org/Welcome_to_KDE_TechBase

Status: 301 -> 200 OK

This is a permanent redirect. The link should be updated.

warning Line: 251 https://planetkde.org/ redirected to https://planet.kde.org/

Status: 301 -> 200 OK

This is a permanent redirect. The link should be updated.

warning Line: 256 https://userbase.kde.org/ redirected to https://userbase.kde.org/Welcome_to_KDE_UserBase

Status: 301 -> 200 OK

This is a permanent redirect. The link should be updated.

warning Line: 56 https://www.kde.org/products redirected to https://www.kde.org/products/

Status: 301 -> 200 OK

The link is missing a trailing slash, and caused a redirect. Adding the trailing slash would speed up browsing.

From multimedia.kde.org a link to juk leads to http://developer.kde.org/~wheeler/juk.html but there is a subdomain site for juk here https://juk.kde.org/

Thank you so much for your effort, John! This will help us A LOT.

No problem. icefox.net ala kaudioCreator linked from multimedia/kde.org appears to be down.

apol added a subscriber: apol.Feb 2 2018, 12:59 AM

FWIW, this has the information of the svn repository to use: https://websvn.kde.org/trunk/www/

The best way at the moment to help fixing these dead links is by providing a patch with the links fixed.

johnpaulding added a comment.EditedFeb 2 2018, 4:32 AM
In T7844#125991, @apol wrote:

FWIW, this has the information of the svn repository to use: https://websvn.kde.org/trunk/www/

The best way at the moment to help fixing these dead links is by providing a patch with the links fixed.

I just looked into that but it's beyond my capabilities. I can find the files in websvn and track down the line# so I'll at least be able to make it quick and easy for people familiar with subversion/git/anarcist to do.

I ran linkchecker for a few hours today. It made it through 3.5 million links before my system near froze. I'll do more and do it one directory at a time. I managed to export an html report with approx 4-5k 404s, whittled down from 21,000 because a lot were false errors of the same pattern that were actually resolving ok -- kde.org/events pulling from google calendar.

favicon.ico is another common one but I see it in the browser so it's probably being called up by some other means and the older code is still there. Other links to image files as well and those, I'll report as I have seen empty spots on the few pages I checked, where something should go.

What I'll do is verify the links are bad, provide the websvn url with line# of the offending link in the file, the bad link, and what the good link should be if possible. In most cases I seem to be able to figure out what the good link should be.

This comment was removed by johnpaulding.
johnpaulding added a comment.EditedFeb 2 2018, 4:15 PM

Should I be bothering with some of these really old announcement pages? Just ran across 1.1.2 from 1999
UPDATE 01/03/18
I didn't bother going through anything much earlier than 4.0 for announcements pages as they're probably not priority. I'm still running subdomains that are aimed at the public and will filter out the false positives, track down the associated file and line# and keep posting them to https://notes.kde.org/p/john

I've been running the linkchecker on subdomains which is probably outside the scope of this task. I've been self employed most of my life and usually as a one man show so this task concept is not something I'm used to. Should I just report bad outside of the kde.org domain link as bugs?