First up I have to state that I'm putting my toes here into a territory that I might not really understand, as for me the simple (7-bit) ASCII character set is sufficient. So I'm lacking any prior knowledge.
Having said that I wanted to see for myself what the difference in the character set between the console (which flahaie needs to get fixed) and a 'xterm' might be. For that purpose I created a little 'awk' script (also as an exercise in unfamiliar territory):
#!/bin/sh
# show printable characters: 0x20 to 0xFF
echo | awk ' {
H = ""
for (j = 0; j <= 15; j++) {
H=H sprintf(" %02X ", j)
}
printf("\n\t%s\n", H)
S = ""
for (i = 32; i <= 256; i++) {
if (length(S) == 0) {
S=sprintf("0x%02X\t", (i / 16 ) * 16)
}
S=S sprintf(" %c ", i)
if ((i % 16) == 15) {
printf("%s\n", S)
S = ""
}
}
}'
When run in 'xterm' it showed for the range 0xA0 - 0xFF all sorts of characters and comparing it with this
wikipedia page I concluded that it represents the 'latin-1' (AKA: IEC 8859-1) character set.
OTOH when run in a console window the entire range 0x80 - 0xFF showed up as solid little rectangles, and I guess this is what constitutes the reported problem. BTW, using the same little script in the console of a Arch Linux systems also shows the 'latin-1' characters, so indeed there might be something amiss in TC.
EDIT: I should have also mentioned that I'm now aware that (at least in theory) it should be possible to change the console font via
'loadfont < /path/to/consolefont.psf'
but so far I had no luck with my limited attempts using some files from the most recent sources I could find (i.e. 'kbd-1.15.3.tar.bz2').