Details

Reviewers

Group Reviewers

Konsole
VDG

Commits

R319:e74cf6c36642: Use new character width code based on Unicode 11

Summary

Adds a code for getting character width togeter with LUTs generated
using uni2characterwidth from Unicode 11 lists.

Skin tone, flags, gender, and other emoji with and modifer are not
joined (you will see e.g. a skin tone square + generic yellow emoji).
I think joining them would cause problems in most editors, command line
prompts, and other programs which use character width data, as the
characters would behave as combining or emoji depending on context (like
ligatures).

Examples:

light thumb up: 👍🏻
dark thumb up: 👍🏿
Polish flag: 🇵🇱

This behavior is allowed:

It is possible to add support for sequences, but those would work
only for a string width functions.

Some characters which can be presented as emoji are narrow (e.g. ✖️, ©️).
Those characters are listed without "presentation" mode, which means
they should be rendered as text by default (real presentation depends on
renderer and/or font). Noto Sans Color Emoji renders them as wide,
DejaVu Sans as narrow. Vim, bash and zsh treat them as narrow, so I made
them narrow.

https://unicode.org/reports/tr51/#Presentation_Style

BUG: 396435
BUG: 378124
BUG: 392171
BUG: 339439

FIXED-IN: 18.12

Depends on D15757

Test Plan

Look at emoji_test.txt - emojis should look "normal" (two characters

width).

Look at GLASS.txt - characters width should look correct.
CharacterWidthTest should pass.
perl -XCSDL -e 'print map{chr($_), " "} 1..0xffff'

Diff Detail

Repository

R319 Konsole

Lint

Automatic diff as part of commit; lint not applicable.

Unit

Automatic diff as part of commit; unit tests not applicable.

mglb requested review of this revision.Sep 26 2018, 12:56 AM

mglb created this revision.

Harbormaster completed remote builds in B3217: Diff 42337.Sep 26 2018, 12:56 AM

ngraham edited the summary of this revision. (Show Details)Sep 26 2018, 1:02 AM

ngraham added a dependency: D15757: Add a tool for generating character width tables.

You won't get any VDG objection from something as cool as this!

broulik added a subscriber: broulik.Sep 26 2018, 7:43 AM

broulik added inline comments.

src/CharacterWidth.cpp
24	This is a generated file, or This file is generated.
src/CharacterWidth.src.cpp
5	What if someone else re-generates the file/updates it?

mglb added inline comments.Sep 26 2018, 9:46 PM

src/CharacterWidth.src.cpp
5	Regeneration using other source files will change some numbers in the arrays. This is the same as changing constants or something like that in C++ code, so the same policy applies.

Language fix

Harbormaster completed remote builds in B3252: Diff 42398.Sep 26 2018, 9:47 PM

mglb marked an inline comment as done.Sep 26 2018, 9:48 PM

git rebase arc/396435/Add-a-tool-for-generating-character-width-tables

Harbormaster completed remote builds in B3306: Diff 42514.Sep 28 2018, 5:57 PM

This needs a rebase as well

git rebase master

Harbormaster completed remote builds in B3400: Diff 42674.Oct 1 2018, 3:33 PM

Set upstream to master

Harbormaster completed remote builds in B3402: Diff 42676.Oct 1 2018, 3:39 PM

Thanks, I don't see anything obviously wrong; let me test it a bit more and we'll get it into master for more testing.

hindenburg edited the summary of this revision. (Show Details)Oct 3 2018, 3:03 PM

hindenburg edited the test plan for this revision. (Show Details)Oct 3 2018, 3:05 PM

hindenburg accepted this revision.Oct 3 2018, 3:11 PM

This revision is now accepted and ready to land.Oct 3 2018, 3:11 PM

Closed by commit R319:e74cf6c36642: Use new character width code based on Unicode 11 (authored by mglb, committed by hindenburg). · Explain WhyOct 3 2018, 3:11 PM

This revision was automatically updated to reflect the committed changes.

pppschmitt added a subscriber: pppschmitt.Nov 16 2020, 12:32 PM

		Path
D	M	COPYING.Unicode (64 lines)
M		src/CMakeLists.txt (2 lines)
M		src/Character.h (4 lines)
A	M	src/CharacterWidth.h (8 lines)
A	M	src/CharacterWidth.cpp (159 lines)
A	M	src/CharacterWidth.src.cpp (102 lines)
M		src/Filter.cpp (2 lines)
M		src/TerminalCharacterDecoder.cpp (2 lines)
M		src/TerminalDisplay.cpp (2 lines)
M		src/autotests/CharacterWidthTest.cpp (8 lines)
D	M	src/konsole_wcwidth.h (16 lines)
D	M	src/konsole_wcwidth.cpp (238 lines)
A	M	tools/uni2characterwidth/overrides.txt (3 lines)

Diff	ID	Base	Description	Created	Lint	Unit
Base			Base
Diff 1	42337	6f3ee1f		Sep 26 2018, 12:56 AM	★	★
Diff 2	42398	7421a8f	Language fix	Sep 26 2018, 9:47 PM	★	★
Diff 3	42514	d04fac2	git rebase arc/396435/Add-a-tool-for-generating-character-width-tables	Sep 28 2018, 5:57 PM	★	★
Diff 4	42674	0f33ee5	git rebase master	Oct 1 2018, 3:32 PM	★	★
Diff 5	42676	bfb91aa		Oct 1 2018, 3:37 PM	★	★
Diff 6	42800	bfb91aa	R319:e74cf6c36642247f3f79194da373d01a00645d36	Oct 3 2018, 3:11 PM	★	★

Use new character width code based on Unicode 11
ClosedPublic
Actions

Details

Diff Detail

Revision Contents
Changeset List

Diff 42800

COPYING.Unicode

src/CMakeLists.txt

src/Character.h

src/CharacterWidth.h

src/CharacterWidth.cpp

src/CharacterWidth.src.cpp

src/Filter.cpp

src/TerminalCharacterDecoder.cpp

src/TerminalDisplay.cpp

src/autotests/CharacterWidthTest.cpp

src/konsole_wcwidth.h

src/konsole_wcwidth.cpp

tools/uni2characterwidth/overrides.txt

Use new character width code based on Unicode 11ClosedPublicActions

Details

Diff Detail

Revision ContentsChangeset List

Diff 42800

COPYING.Unicode

src/CMakeLists.txt

src/Character.h

src/CharacterWidth.h

src/CharacterWidth.cpp

src/CharacterWidth.src.cpp

src/Filter.cpp

src/TerminalCharacterDecoder.cpp

src/TerminalDisplay.cpp

src/autotests/CharacterWidthTest.cpp

src/konsole_wcwidth.h

src/konsole_wcwidth.cpp

tools/uni2characterwidth/overrides.txt

Use new character width code based on Unicode 11
ClosedPublic
Actions

Revision Contents
Changeset List