From a weird Dolphin bug 从一个奇怪的 Dolphin bug 说起

One day when I opened Dolphin I found that it is reporting the file does not exist when I tried to open one with a Chinese file name. "Well, that's bad," I thought, and instead opened it in the terminal. Recently I went through the bug list for Dolphin and KIO and saw that encoding-related bugs are all mentioning that file cannot be trashed. Which is not the case for me. So I opened a new bug and started investigating it.
某日我打开 Dolphin,发觉它在我试图打开一个文件名带中文的文件的时候告诉我那文件不存在。 「行吧,太差了。」我想着,然后在终端里打开了那个文件。最近我遍览了 Dolphin 和 KIO 的 bug 列表,看着编码相关的 bug 全在说文件不能被扔垃圾桶。而我这里却不然。所以我开了个 新 bug 开始钻研它。
A note: if your Qt is not configured to use icu, then this does not apply to you.
提示:如果您的 Qt 没有被配置为使用 icu,那么这事对您便不适用。

From Dolphin to KIO从 Dolphin 到 KIO

So I checked Dolphin's code, found that the actual code for opening a file is with KIO. The implementation is in KFileItemActions. However, that seems not to be the real source of error. The KFileItemList there is already wrong, with all non-ascii characters turned into question marks. Further I looked into the logic of obtaining the content of a directory: KIO invokes a plugin called file.so, for local filesystem, and that, which is actually a kdeinit program, run as another process, does the real work. Then the data is transferred back to Dolphin using a QDataStream. And in file.so it seems that everything looks correct.
所以啊我就查了查 Dolphin 的代码,发觉真正打开文件的代码 在 KIO 里。实现 KFileItemActions。然而,那好像并不是 真的错因。那里的 KFileItemList 已经错了,非 ascii 的字符全变问号了。再往下,我 看了看获取目录内容的逻辑:对于本地文件系统,KIO 调用了一个唤作 file.so 的插件, 而那插件实际上是个 kdeinit 程序,作为另一个进程,做了实事的了。然后数据通过一个 QDataStream 传到 Dolphin 里。在 file.so 里,一切看着都很对。
Why is that? Is it the QDataStream is bad? I looked into the code for how QDataStream transfers strings and, no, it cannot be. Also I noticed that the filename is displaying correctly in Dolphin, just I cannot open it. I got some support on the #kde-devel channel, and someone reminded me of the LegacyCodec in file.so. That is not the cause of the problem, of cource, as in there it is nothing wrong. But it reminded me of the QTextCodec problem. There is something Qt used to convert its UTF-16 representation into other encodings. And that is called QTextCodec::codecForLocale(). An observation is that in file.so, the codec name is called UTF-8; while in Dolphin, it is called US-ASCII, unless I set LC_ALL to a UTF-8 locale.
为啥啊?QDataStream 坏了吗?我看了下 QDataStream 是怎么传字串的,啊这, 不可能错的。又注意到文件名在 Dolphin 里显示啥问题都没有,只是我打不开。 在 #kde-devel 上得到了些帮助,有人提醒我 file.so 里有一 LegacyCodec。 当然这不是真的错因,因为那边没啥问题啊。但是这教我注意到 QTextCodec 的问题了。 有个玩意儿是 Qt 用来把它自己的 UTF-16 表示法转成别的编码的。那玩意儿叫 QTextCodec::codecForLocale()。观察到 file.so 里的 codec 名字叫 UTF-8;跑 Dolphin 里它变成 US-ASCII 了,除非我把 LC_ALL 设成一个 UTF-8 的 locale。

icu and Qt bugsicu 和 Qt 的 bug 们

I digged into the code for that, and it turns out that it ultimately uses ucnv_getDefaultName() to obtain the name of the encoding. That function is from icu. In their bug tracker, I found one that says:

You may need to call setlocale(LC_ALL, ""); from your own code before ICU.

我钻进代码里一看,它用了个叫 ucnv_getDefaultName() 的去获取编码的名字。 这函数 icu 里的。在伊们的 bug 追踪器上,我看到一个说:

在 ICU 之前,您可能得从您自己的代码里调用 setlocale(LC_ALL, "");

And that reminds me. Does Qt call it for us? I searched on Qt's bug tracker and got this:

When using ICU, Qt may call ucnv_getDefaultName() without calling setlocale(.., "") first, which is required before calling ucnv_getDefaultName(). One such possible path is: QString::fromLocal8Bit() -> QTextCodec::codecForLocale() -> QIcuCodec::defaultCodecUnlocked() -> ucnv_getDefaultName() setlocale() is called in QCoreApplicationPrivate::initLocale(), which may not have been called.

But unfortunately, it seems the Qt person does not want to fix it. Ok, I guess, then what is calling ucnv_getDefaultName()? At least, if they do not fix it, I must fix it.
这又给我提了个醒了。Qt 给咱们调用这玩意儿了吗?我搜搜 Qt 的 bug 追踪器,看到

用 ICU 时,Qt 可能会调用 ucnv_getDefaultName() 而先前却没有调用过 setlocale(.., "")。后者则是在调用 ucnv_getDefaultName() 之前必须得调用的. 一个可能的途径是: QString::fromLocal8Bit() -> QTextCodec::codecForLocale() -> QIcuCodec::defaultCodecUnlocked() -> ucnv_getDefaultName() setlocale() 是在 QCoreApplicationPrivate::initLocale() 里调用的。但这玩意儿可能还没被调用呢。

How to track down a function call怎么追踪一个函数调用

I did not know. First intuition was to search for fromLocal8Bit in the KIO source. Yields nothing useful. Then it occurred to me that I could use LD_PRELOAD trick. From proxychains to valgrind, they all use LD_PRELOAD to override an existing function. If I override ucnv_getDefaultName() to have it call std::terminate(), I could get a backtrace using gdb. So here it is:
我不知道啊。第一个直觉是在 KIO 源码里搜 fromLocal8Bit。并没有用。然后我突然想起来 LD_PRELOAD 的伎俩了。从 proxychainsvalgrind,都用了 LD_PRELOAD 去覆盖 一个已有的函数。如果我把 ucnv_getDefaultName() 覆盖了,让它调用 std::terminate(), 我便能用 gdb 整个 backtrace 出来。所以就这样了:
1
2
3
4
5
6
7
#include <exception>

extern "C" const char *ucnv_getDefaultName(void)
{
std::terminate();
return "";
}
Compile that file with g++ ucnv-override.cpp -shared -fPIC -o ucnv-override.so and set LD_PRELOAD=/path/to/ucnv-override.so. Use gdb to invoke dolphin. Got a backtrace. And it's there:
g++ ucnv-override.cpp -shared -fPIC -o ucnv-override.so 编译那文件, 设下 LD_PRELOAD=/path/to/ucnv-override.so。用 gdb 启动 dolphin。 弄出一个 backtrace。搁这儿呢:
1
2
3
4
5
6
#26 0x00007ffff582370b in QLoggingCategory::init(char const*, QtMsgType) () from /usr/lib64/libQt5Core.so.5
#27 0x00007ffff772d29b in __static_initialization_and_destruction_0 (__initialize_p=1, __priority=65535)
at /home/tusooa/Code/kio/src/widgets/kdirmodel.cpp:39
#28 0x00007ffff772d2ce in _GLOBAL__sub_I_kdirmodel.cpp(void) ()
at /home/tusooa/Code/kio/build/src/widgets/KF5KIOWidgets_autogen/include/moc_kdirmodel.cpp:155
#29 0x00007ffff7fe1c1e in ?? () from /lib64/ld-linux-x86-64.so.2
No statement from main, no statement from Dolphin. It is in the initialization of a global variable. And that is:
没有 main 的语句。没有 Dolphin 的语句。这玩意是在初始化全局变量的。 呐看:
1
static QLoggingCategory category("kf.kio.widgets.kdirmodel", QtInfoMsg);
The QLoggingCategory. But Krita uses that for debug too, while I do not have any issues dealing with Chinese file names? It turns out that we have been using the Q_LOGGING_CATEGORY macro, and "The implicitly-defined QLoggingCategory object is created on first use, in a thread-safe manner." (from Qt doc) So in that case, the initialization is always run after entering main(), where setlocale() has been called already.
QLoggingCategory。但 Krita 也用它做调试啊,我也没发现处理中文文件名有 啥问题啊?原来是我们用的是 Q_LOGGING_CATEGORY 宏,而「这隐式定义的 QLoggingCategory 对象在初次使用时给创建的,这过程还线程安全。」(来自 Qt 文档)所以那情况下,初始化肯定在进了 main()setlocale() 被调用过了的时候才运行了的。
So, replace that with Q_LOGGING_CATEGORY(category, "kf.kio.widgets.kdirmodel", QtInfoMsg), and everything went back to normal. 200 IQ.
所以啊,把那换成 Q_LOGGING_CATEGORY(category, "kf.kio.widgets.kdirmodel", QtInfoMsg), 啥东西都正常了。智商 200。